Transcript Document

Giri Palanisamy
Oak Ridge National Laboratory
&
Lorrie Apple Johnson
U.S. Department of Energy
October 16, 2013
• Located at Oak Ridge
National Laboratory
(ORNL)
• Part of Climate Change
Science Institute
• ARM – www.arm.gov
OSTI has the corporate responsibility for ensuring
appropriate access to the U.S. Department of
Energy’s (DOE) R&D results.
• DOE invests over $10 billion/year in basic sciences, clean energy
technology, nuclear research.
• The immediate output from this investment is information… knowledge…
R&D results in many formats, including digital data.
• OSTI’s mission is to accelerate scientific progress by accelerating access
to this information.
Energy Policy Act of 2005
“The Secretary, through the Office of Scientific and Technical Information, shall maintain within
the Department publicly available collections of scientific and technical information resulting
from research, development, demonstration, and commercial applications activities supported
by the Department.”
Type of Data – Atmospheric processes, cloud
dynamics
Products - > 3,000
Archive Size - > 300 TB
Users/year - ~ 1,500
Year Started - 1991
 Southern Great Plains (1993)
 North Slope of Alaska: Barrow (1998)
and Atqasuk (1999)
 Tropical Western Pacific: Manus
(1996), Nauru (1998), and Darwin
(2002)
 First ARM Mobile Facility (2005);
Second ARM Mobile Facility (2010)
 ARM Aerial Facility (2007)
Hard to FIND
Hard to NAVIGATE
Hard to CITE
Millions of data files from over 3,000 data
products.
Most of them are continuous data
streams.
Large user community and complex use of
data (climate change modeling).
Data is also published via other portals.
Data should be cited in just the same way that other sources of
information, such as articles and books, are cited.
Data citation can help by:
 enabling easy reuse and verification of data
 allowing the impact of data to be tracked
 creating a scholarly structure that recognizes and rewards data producers
To allow users to cite the exact ARM data used in
their research publications
To allow future data users, and the project, to
easily track the data used in various articles
Strategy:
 DOI’s assigned at the ARM data product level,
and presented in the ARM data stream pages
and field campaign readme files
 DOI’s also sent via Archive data notification
emails
What is DataCite?
 A global consortium composed
of local institutions focused on
improving the scholarly
infrastructure around datasets
and other non-textual
information.
 A service for assigning Digital
Object Identification (DOIs) and
metadata to datasets.
DataCite (www.datacite.org) helps researchers find, access and
reuse data.
DOE Data ID Service
•
DOE/OSTI is the only U.S. federal member of DataCite.
•
Interagency agreement in place with NIH project; in
discussions with eight agencies representing 15 projects.
•
OSTI Partnered with Oak Ridge National Laboratory to pioneer procedure.
•
First DOI for a DOE dataset was minted and registered with DataCite
on 8/10/2011.
•
DOE Atmospheric Radiation Measurement (ARM) has now registered over
545 datastreams, each representing hundreds of subordinate data files.
•
Currently working with 6 DOE data centers, including ARM. Two are fully
integrated; 4 others in testing or planning phases.
 Easier identification and access of datasets across the
international community of researchers via DataCite’s
resolving tools
 Linkage between DOE’s R&D documents and the
underlying datasets generated by the research
 Standard format for including
data in the accepted bibliographic
citation framework
 Aid researchers in locating exact
datasets used in previous work,
thus allowing verification of
results or new uses for the data
•Originating Research
Organization
•Dataset Type
Data Citation
metadata submitted to
DOE-OSTI
=
•Dataset Title
•Dataset Creator/Author or
Principal Investigator
•Dataset Product Number
•DOE Contract/Award Number
Web
Service
API
•Publication/ Issue Date
•Sponsoring Organization
•URL where the Dataset is
posted for access
•Contact information
241.6
AN
DOI Assigned By
DOE-OSTI
DOE-OSTI submits nightly
feed of new
DOIs to DataCite
DataCite
Registers DOI
Creator/Author, Primary
Investigator, or
Submitter notified of
Data Citation availability
Data Citation
submitted to
search engines
for indexing
DOE-OSTI updates
metadata record with DOI
creating a full
Data Citation
DataCite validates
DOI registration with
DOE-OSTI
•Dataset Type
•Dataset Title
•Dataset Creator/Author
or Principal Investigator
•Dataset Product Number
•DOE Contract/Award
Number
•Originating Research
Organization
•Publication/ Issue Date
•Sponsoring Organization
•URL where the Dataset
is posted for access
•Contact information
Federated Searching
Since science is not bound by agency,
organization, or geography…
• We integrate or aggregate multiple government R&D-related
databases into single-search portals.
• Innovative technology drills down to selected databases and
websites in parallel, then presents ranked search results.
WorldWideScience.org
Enabling Access to Global R&D Results
Research results from 70+ countries are
searchable via single-query global
science portal.
•
•
Multilingual translations capability for 10
languages.
More than 400 million pages of scientific and
technical information, including:
•
•
•
Text
Multimedia
Data
Several citation formats are possible using DOI’s. ARM
encourages users to include the following information
when citing ARM data:
Author
Original publication date
Update period, if applicable (daily, monthly, etc.)
Dataset name
Dates used
Location (latitude/longitude, site name, and facility identifier)
Editor(s) or compiler(s)
Place of publication
Publisher
Date accessed
DOI
ORNL DAAC: Data Products used in literature
ORNL DAAC requests
that data be cited in
list of references;
some authors “refer”
to data in text or
acknowledgements
Thank you!
Giri Palanisamy
Oak Ridge National Laboratory
[email protected]
Lorrie Johnson
U.S. Department of Energy
Office of Scientific and Technical Information
[email protected]