The WorldWideScience Alliance: An International Partnership to Improve Access to Scientific and Technical Information Lorrie A.

Download Report

Transcript The WorldWideScience Alliance: An International Partnership to Improve Access to Scientific and Technical Information Lorrie A.

The WorldWideScience Alliance:
An International Partnership to Improve Access
to Scientific and Technical Information
Lorrie A. Johnson
United States Department of Energy
Office of Scientific and Technical Information
• What is WorldWideScience.org?
• Launch of WorldWideScience.org and Formation of the
WorldWideScience Alliance
• Search Technology
• Multilingual Translations
• Integration of Multimedia and Scientific Research Data
• Role in Public Access
• Future Directions
What is WorldWideScience.org?
 A global science gateway comprised
of over 100 national and international
scientific databases and portals
 More than 70 countries are
represented
History and Formation of the
WorldWideScience Alliance
• WorldWideScience.org concept emanated from
Science.gov model (2006)
• Initial partnership between U.S. Department of Energy
and the British Library (2007)
• Transition to multilateral governance (WorldWideScience
Alliance) and ICSTI* sponsorship (2008)
*International Council for Scientific
and Technical Information
The Search Technology Behind
WorldWideScience.org
Federated search technology:
 Searches the “deep web,” which may not be searchable by
major search engines
 Performs a real-time, simultaneous search of participating
databases
 Overcomes barrier of needing to know about all resources
 Little or no burden on database owners
 Returns a consolidated, relevance-ranked results list
 Links to original records at source databases, including full text
if available
 Integrates symbiotic technologies to further accelerate
scientific discovery
A Measure of WorldWideScience.org’s
Uniqueness
 33 sample queries launched
in Google, Google Scholar,
and WorldWideScience.org
 Similar quantities in the
numbers of results, but very
little overlap
 Among the “top 50” results
from each search engine,
only ~10% overlap – or 90%
uniqueness – in
WorldWideScience.org results
90%
“Unique”
WorldWideScience.org Topic Pages
Utilizing the power of Google, Bing,
Yahoo, and other search engines:
 Millions of Topic Pages created
 Exposes distributed
WorldWideScience.org content to
commercial search engines via site
map protocols
Multilingual Translations
The world’s first “one to many”
and “many to one” multilingual
translations tool in science
• Most automatic translations are
limited to translating from a single
language into another single
language
• WorldWideScience.org partnering
with Microsoft® Translator enables
true multilingual functionality
Multilingual Translations
Translating ten languages, with potential for more:
Arabic
Chinese
German
English
Spanish
French
Japanese
Korean
Portuguese
Russian
中文
Deutsch
Español
Français
日本語
한국어
Português
Русский
User enters search term and selects language
Query is translated into languages of all databases, searches are completed, and
results are combined into a single, relevance-ranked results list
User translates results into their preferred language
Translated bibliographic record (English to German)
Multimedia and Speech Indexing
 Multimedia (e.g., video, audio, images)
represents a major emerging form of
scientific information
 Multimedia presents special opportunities
and challenges – lack of written transcripts,
minimal metadata,
scientific/technical/medical terminology,
lengthy videos (>1 hour)
Multimedia Search Results
Videos accessible
Search terms identified via snippets
Users can play the
exact segments of
the videos where
search terms were
spoken
Integration of Scientific Research Data
Scientific Research Data can be:
 Difficult to find…. and
 Hard to navigate
TTT F Phe TTC F Phe TTA L Leu TTG L Leu i TCT S
Ser TCC S Ser TCA S Ser TCG S Ser TAT Y Tyr TAC
Y Tyr TAA * Ter TAG * Ter TGT C Cys TGC C Cys
TGA * Ter TGG W Trp CTT L Leu CTC L Leu CTA L
Leu CTG L Leu i CCT P Pro CCC P Pro CCA P Pro
CCG P Pro CAT H His CAC H His CAA Q Gln CAG
Q Gln CGT R Arg CGC R Arg CGA R
Results containing research and numeric datasets
A data result, showing the landing page for this dataset
Research Data, via the landing page, is accessible
Data can be viewed or downloaded
WorldWideScience.org and Public Access
Within the United States,
 White House Office of Science and Technology Policy
(OSTP) Directive – February 2013 (addressed both
scholarly publications and digital research data)
 U.S. Department of Energy released
public access plan in July 2014, and
launched the Public Access Gateway
for Energy & Science (PAGESBeta)
in August 2014
Public access results from the
U.S. Department of Energy’s PAGESBeta
Record from PAGESBeta
The Accepted Manuscript will be available 12 months after publication
Public Access as a Growth Opportunity for
WorldWideScience.org
Other U.S. federal agencies with public access plans include:











U.S. Department of Agriculture
Department of Defense
National Aeronautics and Space Administration
National Institute of Standards and Technology
Centers for Disease Control
Food and Drug Administration
Agency for Healthcare Research and Quality
National Institutes of Health
National Oceanic and Atmospheric Administration
National Science Foundation
Office of the Assistant Secretary for Preparedness and Response
Canada: “Tri-agency Open Access Policy on Publications” released in February 2015
United Kingdom: Research Councils’ open access policy, in place for several years
Other countries in progress….
 As public and open access content grows, new resources will be
incorporated into WorldWideScience.org (scholarly publications and
digital research data)
 With the extension of federated searching on an international scale,
combined with multilingual translations, WorldWideScience.org enables
access to a diverse array of scientific and technical content in traditional
and non-traditional formats
Thank you!
WorldWideScience.org Operating Agent
Lorrie Johnson
[email protected]