Next Step : Digital collections into Primo

Download Report

Transcript Next Step : Digital collections into Primo

NU Primo – What’s Next
“What to do with Digital Objects in Primo”
IGeLU 2014
Michael North -
Sr. Systems Analyst / Programmer
Lead, Systems Team
AGENDA
.1. Digital Objects and Digital Collections :
What is our Plan ? Why Use Primo ?
.2. Fedora and Primo workflow / Integrated with Blackboard
.3. Fun Item : Dublin Core : Youtube Videos and Blackboard Canvas ???
.4. MODS protocol extract and Ingestion (Winterton Collection)
.5. EAD to Primo (Finding Aids)
.6. METS schema : African Maps Collection
.7. MODS schema : African Posters
.8. LibGuides into Primo
What is our Plan? … ya gotta have a plan !
= The “sweet spot”
Develop integrated solution
Heaviest Traffic
Blackboard
2nd Heaviest Traffic
Library
ISSUES:
Metadata choice ?
Repository solution ?
Primo Configuration ?
Fastest Growing Traffic
Repositories
Working with DC Silo’s
First Hurdle
Moving to …… Fedora and Primo
* Consistent Delivery
* All different inside
Digital Collections Searching
And
Cross Collection Searching
Why use Primo for Digital Collections Viewing ?
Built-in facet search capability.
Boolean search capability
Value of cross collection searching.
Facets work (select 2+)
Advanced searching
Integration into other applications (such as Blackboard CMS)
#1 reason: Access points increased (more than one way to find).
Entry point for patron into a collection/website they never knew about.
NOTE: Use Primo instead of hiring a team of
developers to build a repository website capable of
cross-collection searching ! ! !
NEW : Repository Cross Searching VIEW
Cross Collection Searching
More Options : to facilitate Xsearching
Advanced Search
Select specific Digital Collection
Repository Metadata Workflow
into Primo So-Far !
EAD/MODS to PNX/Primo
(Winterton / Africana Posters)
METS/MODS to PNX/Primo
(Africana Maps)
EAD to PNX/Primo
(Archon - Finding Aids)
XML to PNX/Primo
(LibGuides)
Dublin Core to PNX/Primo
(YouTube Videos)
Fedora to Primo Overview
1 Extract
PIDs
File:
2
Use ITQL to
create file with
PIDs.
Metadata
Schema
EAD, MODS,
METS, DC, etc.
ITQL or
SPARQL
query
NonMODS
2 metadata
export
Convert nonMODS metadata
to MODS xml
files.
Fedora
Exported
File
Ruby/
Rails
script
1
3
Extract Fedora
Objects
Metadata
Schema to xml
files.
FEDORA
Fedora
Objects
Schema
Conversions:
Extract
Fedora
Metadata:
(mets, dc, ead,
etc)
PIDs
file
3
Ruby/Rails
script with
XSLT
MODS
Datafile
(xml)
4
MODS to
PNX :
Convert MODS
xml files into
single PNX file.
5
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
4
Ruby/Rails
script with
XSLT
PNX
Datafile
5
Primo
BackOffice
using basic
XML pipe
PNX
DB
OR
2
MODS metadata export
* 4 Ruby Scripts (export, convert, mods_2_pnx, add_content.rb)
* Fedora = modular repository system for management and dissemination of digital content.
* Why do we run 3 scripts ? It will be one script eventually ! ! !
6
Add Collection
Updates:
Use above procedure
– only use the
add_content.rb script.
Something Special for our
Teaching Faculty
With Primo and Blackboard interoperability
=================
Dublin Core
Use DC to manually add
YouTube Videos for Faculty
Classroom use with
Blackboard CMS.
Example: DC record from Faculty Form
PULL E-Shelf folders into Blackboard
Primo E-Shelf Folders built by Faculty Member
PULL feature
E-Shelf Folders in Blackboard Course Documents Page
With additional instructions
* Dynamically linked ! ! !
DYNAMIC
Links ! ! !
YouTube Videos
History
The professor was thrilled.
PUSH feature of Primo/Blackboard interoperability
Side note :
We added
Primo
Searching
(and Library
Resources)
into
Blackboard
How the Primo/Bb Project Comes
Around "FULL CIRCLE" with our
repository development
• Teachers provide digital objects or request one (NU
scanning project). These objects are made
available through Primo via "harvesting."
• Teachers pull or push this material into their
Blackboard Course Materials pages.
• Students use the material in a classroom
environment.
Primo/Blackboard Video’s
YouTube : Northwestern Primo/Blackboard PULL Integration Demo http://youtu.be/dq_C4AAWF2U
YouTube : Northwestern Primo/Blackboard PUSH Integration Demo http://youtu.be/B6rSV4hgy5g
PLEASE USE BOTH LINKS (there are older vids on this site)
CANVAS ????
Northwestern is moving to Canvas this fall and will no longer be
using Blackboard so we are now looking at replicating these
features in Canvas, and possible using LTI.
Our next major undertaking ! ! ! !
Blackboard
MODS
Harvesting
MODs  Primo
(Winterton Collection)
The Winterton Collection (using MODS)
Why MODS ?
We chose MODS because it lets us document “hierarchical” metadata.
Provenance ! ! !
Important to Special Collections and Archives.
Winteron Case.
EAD/MODS to PNX/Primo
Winterton Collection
1
Extract
PIDs File:
Use ITQL to
create file
with PIDs.
FEDORA
2
Extract Fedora
Metadata:
Extract Fedora
Objects
Metadata
Schema to xml
files.
4
MODS to
PNX :
Convert MODS
xml files into
single PNX file.
5
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
Fedora Objects
E
A
D
XSLT
M
O
D
S
ITQL or
SPARQL
query
MODS
Datafile
(xml)
1
PIDs
file
2 MODS metadata export
Ruby/Rails script
4
Ruby/Rails
script with
XSLT
PNX
Datafile
5
Primo
BackOffice
using file
splitter and
basic XML
pipe
PNX
DB
Winterton Process
The Winterton records in Fedora are stored, along with corresponding images, as Fedora
Objects. These objects include multiple files including descriptive metadata. The Fedora
object (theoretically) might have a tiff file, a jpeg file, a pdf.
First, we created an XSLT script which created the MODS portion of the Fedora objects and
these are the record form one sees on the Digital Collections site.
Then we created a Ruby on Rails script that exports the MODS files out of Fedora.
Then we created a Ruby/Rails script that uses an XSLT script that transforms the MODS
records into PNX records.
Finally, we ingest the PNX record into using the default XML file splitter and pipe.
Primo Record Discovery – Details Tab
Primo Record Discovery – Online Resource Tab
Parent of image
Actual image (item record)
Actual Photo display
Click on “thumbnail” or image displayed in Online Resource or Details tabs.
Parent of actual image record (hierarchy provided)
Click on the “Link to source in Winterton Collection of East African Photographs” link in the
“Online Resource” or “Details” tabs. This will display the parent page of the individual record
(shows where the image came from).
Image on parent page
Clicking on Component Page provides Picture detail
…. Plus metadata for the picture.
So, Why Is This So
GREAT ! ! !
This harvesting method incorporates hierarchical
metadata into Primo for our collections, and provides
another access point for patrons who never knew
about the Winterton Collection. So now, patrons can
look at the digital object directly in a webpage, or they
can go to the Winterton Repository website via Primo
and be placed right into the collection website so they
can look at other related objects.
EAD Finding
Aids (Archon)
into Primo
EAD to PNX/Primo
Archon – Finding Aids
1
Extract PIDs
File:
Use ITQL to
create file with
PIDs.
FEDORA
Fedora
Objects
Metadata
Schema:
EAD
ITQL or
SPARQL query
NonMODS
2 metadat
a export
Ruby/
Rails
script
1
2
Schema
Conversions:
Extract
Fedora
Metadata:
3
Extract Fedora
Objects
Metadata
Schema to xml
files.
Fedora
Export
File
3
Ruby/Rails
script with
XSLT
Convert nonMODS metadata
to MODS xml files.
4
MODS to
PNX :
5
Convert MODS
xml files into
single PNX file.
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
MODS
Datafile
(xml)
5
4
PNX
Ruby/Rails
script with
XSLT
Datafile
Primo
BackOffice
using
basic XML
pipe
PNX
DB
PIDs
file
EAD
ARCHON (Finding Aids - publishes archival descriptive information )
EAD  Metadata for the Finding Aids (inventories, indexes or guides)
Archival and Manuscript
Collection Portal :
Over 400 collections are
accessible using EAD Finding Aids
Just listings – indexes – guides - inventories
Getting from here….
To here….
Archon EAD record
Ingested into Fedora
Extract PIDS file from FEDORA using ITQL query
Ruby on Rails script Uses PIDs
file to extract the entire EAD
record from FEDORA
EAD record extracted from FEDORA
MODS records created from
descriptive section of EAD
XSLT script
XSLT script transforms records into PNX data file
Set up pipe and simple normalization rules that
copy the PNX data file into Primo
Finding aids in Primo
Another ACCESS POINT ! ! !
EAD to PNX/Primo
Archon – Finding Aids
1
Extract PIDs
File:
Use ITQL to
create file with
PIDs.
FEDORA
Fedora
Objects
Extract
Fedora
Metadata:
2
Extract Fedora
Objects
Metadata
Schema to xml
files.
NonMODS
2 metadat
a export
Metadata
Schema:
EAD
ITQL or
SPARQL query
Ruby/
Rails
script
1
Fedora
Export
File
Schema
Conversions:
3
4
Convert nonMODS metadata
to MODS xml files.
MODS to
PNX :
5
Convert MODS
xml files into
single PNX file.
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
3
Ruby/Rails
script with
XSLT
MODS
Datafile
(xml)
5
4
PNX
Ruby/Rails
script with
XSLT
Datafile
Primo
BackOffice
using
basic XML
pipe
PNX
DB
PIDs
file
EAD
ARCHON (Finding Aids)
REVIEW SLIDE
METS Metadata
Schema into
Primo (African
Maps)
METS/MODS to PNX/Primo
Africana Maps Collection
1
Extract
PIDs File:
FEDORA
Use ITQL to
create file
with PIDs.
2
Extract
Fedora
Metadata:
Extract Fedora
Objects
Metadata
Schema to xml
files.
4
MODS to
PNX :
Convert MODS
xml files into
single PNX file.
5
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
Africana Maps
Fedora Objects
MODS
Datafile
(xml)
METS
MODS
DC
ITQL or
SPARQL
query
1
PIDs
file
2 MODS metadata export
Ruby/Rails script
4
Ruby/Rails
script with
XSLT
PNX
Datafile
5
Primo
BackOffice
using file
splitter and
basic XML
pipe
PNX
DB
Africana Maps Process
In our Fedora Africana Maps Collection, the Fedora Objects contain METS files that
function as a wrapper around MODS and Dublin Core metadata.
The METS file contains a Description Section that contain MODS metadata and
Dublin Core metadata. The MODS and DC metadata was created by using the VB
script behind Gary Strawn’s Catalogers Toolkit Program (not the actual program menu
itself).
.1. In short, MODS and DC metadata is contained within the METS metadata
schema.
.2. Then, a Ruby/Rails script accesses the MODS metadata (within METS) and
creates a MODS XML file.
.3. Then, a Ruby/Rails script is run with an XSLT file, that converts the MODS XML to
a PNX file.
.4. Using the Primo XML File Splitter, the Africana Maps PNX file is harvested by
Primo.
AFRI-MAPS : METS Wrapper with MODS
Primo Record for African Map Collection
View and Zoom in Primo VIEW ONLINE tab
MODS:
African
Posters into
Primo
EAD/MODS to PNX/Primo
African Posters Collection
1
Extract
PIDs File:
FEDORA
Use ITQL to
create file
with PIDs.
2
Extract
Fedora
Metadata:
4
Extract Fedora
Objects
Metadata
Schema to xml
files.
MODS to
PNX :
Convert MODS
xml files into
single PNX file.
5
Primo
Harvest:
XML File
Splitter &
Harvest
PRIMO
Fedora Objects
E
A
D
XSLT
M
O
D
S
ITQL or
SPARQL
query
MODS
Datafile
(xml)
1
PIDs
file
2 MODS metadata export
Ruby/Rails script
4
Ruby/Rail
s script
with
XSLT
PNX
Datafile
5
Primo
BackOffice
using file
splitter and
basic XML
pipe
PNX
DB
Image appears in the Primo Online Resource tab
Linking takes the user to the image inside the
Collection website (new “access”)
LibGuides
into
Primo
LibGuides into Primo
Springshare
PRIMO
Primo BackOffice using file
splitter and basic XML pipe
LibGuides
records
Send
link that
will
generate
an XML
file
XML
Datafile
XSLT
script
Excel
table
(links guide ID
number with guide
creator name)
PNX
DB
libguide
accounts
Mapping
table
LibGuides into Primo – why good!
LibGuides into Primo
• Use XSLT script to create mapping table
that will link LibGuide creator to guide
XSLT script
Accounts,
Categories,
Guides
LibGuides into Primo
• Upload mapping table in Primo Back Office
LibGuides into Primo
• Configure file splitter
• Create normalization rules
LibGuides into Primo
• Configure data source
• Define pipe
LibGuides into Primo
Records appear in Primo:
LibGuides into Primo
Records appear in Primo “VIEW ONLINE” tab
DID I SAY YET. . . . . . . . . . . .
What I have shown you today provides
additional ACCESS POINTS to your digital
collections such that users can find your
digital collection, LibGuides, AND Finding
Aids, without ever knowing where they are ! !
! ! And once Primo links take them to the
collection website (that they did not know
existed), they will discover MORE ! ! !
Session Review
.1. Digital Objects and Digital Collections :
What is our Plan ? Why Use Primo ?
.2. Fedora and Primo workflow / Integrated with Blackboard
.3. Fun Item : Dublin Core : Youtube Videos and Blackboard Canvas ???
.4. MODS protocol extract and Ingestion (Winterton Collection)
.5. EAD to Primo (Finding Aids)
.6. METS to Primo : African Maps Collection
.7. MODS to Primo : African Posters
.8. LibGuides XML into Primo
What’s Next
.1. Deep Search setup with News Archives from Vanderbilt University.
.2. More Digital Collections at Northwestern.
.3. Looking at ingesting external collections of interest (Africana).
Example : DPLA collections harvest ??? Smithsonian ???
Thank You
Michael North
[email protected]