IMPLEMENTATION EXPERIENCE OF OPEN ACCESS INITIATIVES AT THE UNIVERSITY Digital Library Research Group Faculty of Computer Science & Information Technology University of Malaya.

Download Report

Transcript IMPLEMENTATION EXPERIENCE OF OPEN ACCESS INITIATIVES AT THE UNIVERSITY Digital Library Research Group Faculty of Computer Science & Information Technology University of Malaya.

IMPLEMENTATION EXPERIENCE
OF OPEN ACCESS INITIATIVES AT
THE UNIVERSITY
Digital Library Research Group
Faculty of Computer Science & Information Technology
University of Malaya
Open Access Publication
Condition 1: The Golden Road
1 The author(s) and copyright holder(s) grant(s) to
all users a free, irrevocable, worldwide,
perpetual right of access to, and a license to
copy, use, distribute, transmit and display the
work publicly and to make and distribute
derivative works, in any digital medium for any
responsible purpose, subject to proper
attribution of authorship, as well as the right to
make small numbers of printed copies for their
personal use.
Open Access Publication
 Condition 2: The Green Road (for IR)
 A complete version of the work and all supplemental
materials, including a copy of the permission as
stated above, in a suitable standard electronic
format is deposited immediately upon initial
publication in at least one online repository that is
supported by an academic institution, scholarly
society, government agency, or other wellestablished organization that seeks to enable open
access, unrestricted distribution, interoperability,
and long-term archiving
OPEN ACCESS INITIATIVES AT FSKTM
OPEN ACCESS DIGITAL LIBRARY : http://dspace.fsktm.um.edu.my
MyAIS : http://myais.fsktm.um.edu.my
DIGITAL LIBRARY OF MALAY MANUSKRIP : http:/mymanuskrip.fsktm.um.edu.my
Why Open Access…..
(for researchers)
• Dissemination
• Increased visibility (Google, Google Scholar
OAI…)
• More visibility leads to more citations
• Research impact
• Preservation
• Control / Monitoring of one’s own Publications
Why Open Access…….
(for institutions)
• Pooling the Organizational Intellectual Capital
- One Stop Source / Point for the research
output of an Institution
• Scope for Introspection / Strategies / Action
Plan
• Generation of reports
• Long term preservation
Introducing Dspace :
The Road Map
Digital Library Research Group@ FSKTM
welcomes you
to the
Wonderful World of DSpace
Outline
• What is DSpace and what does it do?
• The DSpace information model
• Components & features of DSpace
What is DSpace?
• DSpace is a Digital Repository System
– Institutional Repositories
– Learning Object Repositories
• Open source development model
• At the moment (2009-01-12) there are approximately
– 334 instances
– in 56 countries
– with 2.716.897 documents
(source: Dspace Wiki –
http://wiki.dspace.org/DspaceInstances)
What does it do?
• Captures
– Digital research material directly from the creators
• Describes
– Allows descriptive, technical, and rights metadata
– Assigns persistent identifiers
• Distributes
– Searches metadata & full text
– Delivers content over the web
• Preserves
– Content in supported formats for long term
preservation
The DSpace Information Model
Information Model
• Communities & Collections
– Hierarchical organization of items in the repository
• Items
– Logical units of content
– Receive persistent identifiers
• Bitstreams & Bundles
– Individual digital files
Community & Collection
Relationships
Community
Community
Collection
Collection
Item
Item
Collection
Item
Item
Item
Item
Item
Communities & Collections
• Collections and Communities organize items
into a hierarchical form
• Metadata:
– Limited descriptive metadata available
• Name, description, license, etc…
• Example:
Communities
— Faculty of Education
— Digital Library Research Group
Collections
— A Faculty’s Academic Exercise
— A research group’s publications
Item Composition
Dublin Core
metadata
Item
Bundle
Bitstream
Bundle
Bitstream
Bundle
Bitstream
Items
• Items are logical units of content
• Metadata (Descriptive)
– All items have qualified Dublin Core metadata
– May contain metadata in other formats encoded as a
bitstream
• Example:
— Theses & Dissertations
— Book
— Web page (Images, CSS, HTML)
— Photographs
Bitstreams
• Bitstreams are Individual Digital files
• Metadata (Technical)
– Limited descriptive metadata available
• name, file format, size, etc…
• Example:
— PDF file
— Word document
— JPEG picture
— Executable program
— HTML file
— CSS file
Bundles
• Bundles group related bitstreams together
• Metadata:
– No metadata
• Example:
– HTML files and images that compose a single
HTML document may be organized into a bundle
– Typical bundles are:
— ORIGINAL
— THUMBNAILS
— TEXT
— LICENSE
— CC_LICENSE
Components & Features of
DSpace
Item Metadata
• Descriptive
– Qualified Dublin Core
– Any other format may be added as a bitstream
• However, it will not be searchable
• Administrative
– Who can access, remove, or modify an item
– Stored in the database, no standard format used
• Structural
– Very basic
– What bitstreams are contained in an item
– What collections and communities does an item belong
too
Dublin Core registry
• Maintain what metadata fields may exist for an
item in DSpace.
• Three components
– Schema (new)
– Element
– Qualifier
– Scope Note
Format Registry
• Maintain a registry of file formats for
bitstreams e.g. TIFF, SGML/XML,PDF
• Three levels:
– Supported (fully supports the format)
– Known (recognize the format, cannot guarantee
full support
– Unknown (cannot recognize a format; these will
be listed as "application/octet-stream)
Handle System
• Provides a persistent identifier
• Standard URL’s change in cases of:
– Hardware or software changes
– Political changes
– Network changes
• Handles attempt to address these problems by creating a
permanent URL independent of the repository.
• Example:
– http://hdl.handle.net/1969.1/3356
E-People
• DSpace user accounts are called E-people
• If given the permission, an e-person may:
– Login to the site
– Sign up to receive notifications about changes to a
collection
– Submit new items to collections
– Administer collections/communities
– Administer the DSpace site.
Authorization
• The DSpace authorization system enables administrators to
give e-people the ability to perform the following operations
on an object.
– Add / Remove
• Enable an e-person to add or remove any object (community, collection,
item)
– Collection Administrator
• Enable an e-person to edit an item’s metadata, withdraw items, or map
items into the collection.
– Write
• Enable an e-person to add or remove bitstreams
– Read
• Enable an e-person to read bitstreams
Ingestion
• Ingestion = getting stuff into DSpace
• Batch import
– Many at a time
– Needs to be in a specific format
• XML encoded metadata
• Bitstreams
• Web based submission
– One at a time
– Workflow processes
Workflow
(Approval/Moderation)
• Step 1: May reject the submission
• Step 2: Edit metadata or reject
• Step 3: Edit Metadata
Search & Browse
• Users may browse any item in DSpace
–
–
–
–
–
Title
Author
Date
Community / Collection
Subject (new)
• Users may search for any item in DSpace based
upon any Dublin Core value or a full text search.
OAI-PMH
• Enables other sites to harvest metadata from
a DSpace repository
• Collections are exposed as OAI sets
• Only Dublin Core metadata is available
Statistics
• Analyses the DSpace logs to generate a set of statistics on
how DSpace is being used.
• Metrics collected:
–
–
–
–
–
–
Number of item visits
Number of collection visits
Number of community visits
Number of OAI requests
Number of logins
Most popular searches
• Presented in a by-month form or in-total form.
Strengths of DSpace
• Communities / collections
• Backed up by MIT and HP
• Strong workflow support
• Handle-based identifier
• Better articulation of preservation strategy
• Default support for qualified DC
• User Groups, Lists …
Current Challenges – what we found out…..
 No reporting facilities
• Usage of content
• Repository content
• Repository management (Community/Collection administrators)
• Submission workflow (abandoned submissions)
 Cumbersome Community/Collection structure for browsing content
• Structure inconsistent
 Empty Collections
 Metadata inconsistencies
 Browsing options with limited utility (e.g. Date)
 No user-defined sort order for search results
 Limited metadata displayed in search results
 Authors must be ‘authorized’ to submit
 Little visible evolution in functionality or service
 Promotion and advocacy building
Future work for Dspace
 What services might provide added value?
• Citation Management System?
-
Easy maintenance of Research Publications on DLC/faculty web-sites
Easy ‘push’ facilities to the Dspace IR
Links to repository content and/or OpenURL for published version
Easily distributable to peers
• Citation Analysis Services?
-
Who’s citing my work?
How long after publication does it take for my work to be cited by others?
How long does it take for my work to become cited in developing countries?
Where are my peers publishing worldwide? With open-access publishers?
• Data Management Services?
- Long-term archiving of research data
- Repository to meet funding agency requirements
• Personal/collaborative workspaces?
• Discipline-based, harvested content?
• Metadata creation/harvesting services?
What libraries can do.....
 About DSpace…
• DSpace is a product. Libraries should think about this as a core
infrastructure component to support the evolving service needs.
• DSpace@library is a service. The current service is defined by:
- Policies for collection content, open access, contributor authorization,
long-term archiving mission, etc.
 What should the goal be?
• To achieve the widespread adoption of the service by the faculty and
research communities to secure and provide open access to the research
output of the Institute.
 How should libraries do this?
• Focus on the needs of the customers
• Embed the repository into a broader suite of services that meet faculty
needs
 The Dspace@library can support the needs of…
• Authors
• Researchers
• Educators
• Advisors
• Administrators
• Collaborators
• Subject experts within a
global community
• Grant funders
“It must be remembered that there is nothing more difficult to
plan, more doubtful of success, nor more dangerous to
manage, than the creation of a new system.
For the initiator has the enmity of all who would profit by the
preservation of the old institutions and merely lukewarm
defenders in those who would gain by the new ones.”
Machiavelli
Thank you
Associate Prof Dr Abrizah Abdullah
[email protected]