Transcript Document

Apple’s Knowledge Navigator (1989)
If the play button does not display, please click the link below:
http://video.google.com/videoplay?docid=-5144094928842683632#
1
Science, Government, and Information:
Finding and Communicating
the Results of Research
Bonnie C. Carroll
President, IIa
Feb. 25, 2011
College of Communication and Information
33rd Annual Symposium to Explore Science,
Risk Communication Research
What’s In the Title
Science, Government, and Information
A Report of the President’s Science Advisory Committee 1964
 Chaired by Alvin Weinberg, Director, ORNL
 Distinguished Panel of the Best and Brightest in Science
…”strong science and technology is a national necessity and
adequate communication is a prerequisite for strong science…”
– President John F. Kennedy
“Transfer of information is an inseparable
part of research and development.”
Challenge of coping with the growing volume
and variable quality of information.
 Information Analysis Centers as a solution
New roles for Scientists, documentalists, information
scientists
3
Alvin Weinberg,
Director, ORNL
Why Me On This?
SGI launched my interest in STI
Worked with Dr. Weinberg and ORNL ICs in
1970s
4 decades experience in STI and data
CENDI -- Executive Director for 20 years
 14 Federal agency STI managers - 97+% US
R&D budget
 White House Science Office (OSTP), NSTC
Interagency Working Groups
 Biodiversity and Ecosystems Informatics WG -Executive Secretary
 Interagency Working Group on Digital Data -Executive Secretary
Member of NAS/NRC BRDI/CODATA
 US National Rep to CODATA
4
Objectives of the Talk
Provide overview of government context in scientific
and technical information (STI) management
Briefly describe key factors driving change
Discuss key challenges in STI today
Describe some experiments going on
Start with some historical and local context
Make you smile
Disclaimer: I never promised you a research garden
5
A Primer on Federal STI
Executive
President
Legislative
PCAST
OMB
OSTP
Committees
NSTC
Mission Agencies
Agency
A
Agency
N…
Agency
NSF
NSB
Administration
•
•
•
•
R&D
CIO
Public
Affairs
Data
DOC
STI
Web
LC
National Libraries
Centralized Clearinghouses
Decentralized / Specialized Data / Info Center
Libraries / Digital Libraries
6
Authorization
Appropriations
Historical Perspective
By the end of the 1960’s, we had evolved an
effective STI Infrastructure.
 Published literature, technical reports, reviews,
A&I services
Weinberg report contributions
 Foresaw the issues
Followed by a series of transitions and reports
7
The Need for the Next
Generation System (1)
Transition from information systems to decision support
Forces are both technological push - user pull
Move from passive to active systems - extend
intellectual power
Needs of our age are in crisis management and decision
making
 Need to integrate STI and societal information (STSI)
1978 National Science Foundation Report,
“Passing the Threshold into the Information Age”
8
The Need for the Next
Generation System (2)
Need for interconnected national information technology network
New information infrastructure
 Stores information in forms accessible
Provided means of communication
Aids in information manipulation to create new insights and
knowledge
Provides human and computer assistant; i.e., “user friendly”
“Infory” of the Future:
 Distribute in physical sense
 Professional career structure to identify and respect
 Dissemination of reliable, personalized software
1978 National Science Foundation Report,
“Passing the Threshold into the Information Age”
9
By the 1980’s, the Theme was Dealing
with the Crisis of Information Overload
Irony of Information Age is not a problem of scarcity
Science (8/12/83) article shows that information availability is
increasing more rapidly than the ability to absorb it
Due to effectiveness of electronic access systems, too much
“relevant” information is identified
 Need the right information at the right time
With technological advances,
the problem became magnified
Algorithms for quality discriminations
are not yet developed
10
By the 1990’s, the Paradigm
in STI Began to Shift
Basic changes in structure of science itself
Increasing commercialization and
marketability of science
Advances in technology
Growth in volume of information
Lack of coordinated information policy and
federal STI leadership
11
Basic Changes in
Structure of Science Itself (1)
Disciplinary, mission, problem-solving
orientations
Blurring boundaries among science, social
science, politics, marketing, ethics
Specialization but linkages
New tools and new methods -- computational
sciences
12
Basic Changes in
Structure of Science Itself (2)
Big science, international science
Increase in commercialization and
marketability of science
Shift from military to industrial R&D
 Global politics
Changing user expectations
 Computer literacy of next generation
 User community initiatives
13
Question of the Month
on Internet (1993):
In what month will the number of bytes
run through the Internet exceed the
number of atoms in the Universe?
14
Electronic World Was Not a
Linear Extension of Print
Print world talks of “food for thought”
Networked world users will be the prey
and information is the predator
- Paul Peters
3rd NASA Foreign Acquisitions Workshop
1993
15
By 1999, We’re Reinventing our Roles
Agency STI programs redefining missions
Budget allocations are problematic -- again
Publishers are questioning primary/secondary
relationships
Authors are experimenting
User expectations are changing – the digital
native
Madison Avenue/Wall Street are “IN”
16
CENDI STI Definition
STI is information that derives from … research,
development, and deployment (RD&D) results of the efforts
of scientists and engineers and individuals supporting their
work. STI includes new theory and information obtained
from experimentation, observation, instrumentation, or
computation in the form of text, numeric data, or images.
STI may be further transformed, described, evaluated,
synthesized, and recorded in print, micrographic, magnetic,
optical, or other media to enhance its communication and its
usefulness and value to a wide spectrum of users and uses.
17
Rip Van Winkle
opening ones eyes 50 years later
Technological Change
Types and Volume of Content
Interoperability on steroids --everything is a bit
18
Rapidly advancing technologies have
opened new opportunities
“Five years ago, right after the
start of the personal computer
revolution, industry experts
observed that if the automobile
business had developed like the
computer business, a Rolls Royce
would cost $2.75 and go 3 million
miles on a gallon of gasoline.”
-- Fortune Magazine
August 1, 1998, p. 4
19
Information Life Cycle
20
1973
First mobile
phone
1990
1990
Internet
Web 3.0
Social
Software
Web 2.0
Search
2010
iPad
2006
• Twitter
• Cloud Computing
2001
Wikipedia
Commercialization
1980
2004
• Facebook
• Flickr
1995
• Client Servers
• http / www
Decentralization
Digitization
1970
1960
Networking
Online
C
H
A
N
G
E
1960
KWIC/
KWOC
1980
• Personal
Computers
• Desktop
Publishing
Computerization
of Publishing
Technologies Over Time
Change is happening faster!
2000
2010
1999
• Blogs
• Dot com
companies
21
2005
RSS feeds
2003
• Google
• LinkedIn
• Second Life
2012
2011
Watson
2008
Popularization
of Smart phones
Who’s a publisher?
The Information Industry is Transforming
Commercial businesses
Institutional depositories
Individual web sites
Individuals (blogs)
Preprint servers/networks
22
The www: Fastest growing new publishing
media of all time and medium of first resort
Surface Web
 Static, publicly available
Deep Web
 Dynamic, database driven
But the Key To Access Is Not Search, It’s Find
 Not the envelope but the information
23
Next Generation Science,
Government and Information
STI is again in focus for the 21st century
 Open access; Open government; management of
scientific data
 American Competes; Scientific integrity memo; PCAST
report 12/10 "Designing a Digital Future: Federally
Funded Research and Development in Networking and
Information Technology”
“Our Sputnik moment” – President Obama State
of the Union
 “Maintaining our leadership in research and
technology is crucial …we must invest in innovation,
education, and infrastructure….”
24
The Current Landscape
The products of science and the starting point for new
research are increasingly digital and increasingly “borndigital”;
Exploding volumes and rising demand for data use are
driven by the rapid pace of digital technology innovations;
All sectors of society are stakeholders in digital preservation
and access; and
A comprehensive framework for cooperation and
coordination to manage the risks to preservation of digital
data is missing.
Science is DATA INTENSIVE – The Fourth Paradigm – infer
meaning from data
25
Key Background to Where We Are Today*
* There are many reports that cover scientific data. These show a direct lineage to national policy
26
The Drivers: Why Do We Care?
“Empowered by an array of new digital technologies, science in the 21st century will be
conducted in a fully digital world. In this world, the power of digital information to
catalyze progress is limited only by the power of the human mind. Data are not
consumed by the ideas and innovations they spark but are an endless fuel for
creativity. A few bits, well found, can drive a giant leap of creativity. The power of a
data set is amplified by ingenuity through applications unimagined by the authors and
distant from the original field.”
--Harnessing Report
Technology is enabling -- Push
Data Intensive Science is the future -- Pull
27
What’s the Current Challenge?
Complexity: Heterogeneity and Volume
10s
100s
1,000s
iPod Shuffle
(up to 120
songs) = 512
MegaBytes
1 small novel =
1 MegaByte
Reference collections
•managed for long-term use by many users
Printed materials in the
Library of Congress =
10 TeraBytes
Resource or community database
collections,
•intermediate in duration,
standardization, and community of
users;
Research database
collections
Atmospheric Radiation
Measurement Program
(ARM) Data Archive =
41 TeraBytes
•specific to a single
investigator or research
project;
Kilo
103
Mega
106
Giga
109
Tera
1012
Peta
1015
Exa
1018
Yatta
1024
Large Hadon
Collider (LHC) =
15 PetaBytes
annually
All worldwide
information in
one year =
5 ExaBytes
28
Volume: How big is five exabytes?
5 exabytes of new information in 2002
 Print, film, magnetic and optical storage media
 92% on magnetic media
If digitized, the 19M books and other print
collections in the Library of Congress would
contain about 10TB of information
5 exabytes is equivalent in size to ½M new
libraries the size of the LC print collections.
file://///Herald/www/research/projects/how-much-info-2003/execsum.htm (1of14) [10/30/2003 4:11:03 PM]
29
IWGDD Vision and Strategy – Top Down
Vision: We envision a digital scientific data universe in
which data creation, collection, documentation,
analysis, preservation, and dissemination can be
appropriately, reliably, and readily managed.
 This will enhance the return on our nation’s research and
development investment by ensuring that digital data
realize their full potential as catalysts for progress in our
global information society.
Strategy: Create a comprehensive framework of
transparent, evolvable, extensible policies and
management and organizational structures that
provide reliable, effective access to the full spectrum of
public digital scientific data.
30
CENDI Vision – Bottoms Up Grand Challenge:
Information Fuels Discovery And Innovation
Data-intensive science, STEM education, and
workplace innovation will be fueled by a
transformed Federal science and technology
knowledge infrastructure – the nextgeneration Science.gov - to stimulate
economic health, create jobs, and
strengthen U.S. competitiveness in the
global marketplace.
Through the i-Science2Jobs interagency
initiative, the next-generation knowledge
platform will provide data, tools, and
services for scientific and economic progress.
Science.gov is the gateway to Federal science
information and research results from 18
scientific and technical organizations in 14
federal agencies.
31
Related Experiments Are Going On:
A Department of Energy Approach
Data Sets
STTR
Gray Literature Documents
Data Citation
TG
OSTI
Metadata and Data Repository
DOI
32
Interagency Case Study
Oak Ridge National Laboratory
Major Center for Environmental Scientific Data Management
 responsible for archiving, managing, and distributing data
 for enabling the distribution, use, and analysis of this data.
Data Repositories
 Atmospheric Radiation Measurement Archive
 Carbon Dioxide Information and Analysis Center
 ORNL Distributed Active Archive Center
Mercury Metadata Repository
Brazil
33
Project Office
LBA-Hydrometeorology
CDIAC
Interactive Science Publishing:
A Joint OSA-NLM Project
To evaluate the educational value of
ISP used within actual scholarly journal
articles
To explore the problems of archiving
this medium
To develop an interactive software and
curated database infrastructure
“Interactive Science Publishing”
To give authors the ability to submit
their own databases and ISP-enables
figures in actual peer-reviewed journal
articles
To give readers and editors the ability
to view, analyze, and interact with
source data published in conjunction
with an article
34
Making Data More Accessible
Policy, Culture and Management






National Policy -government taxpayers funded projects should be accessible
Enhanced metadata
Journals supporting links to some published data sets
“People getting the message that data has to be accessible.”
Increased involvement of libraries and lifecycle management of data
Younger generation post data as they go – expectation that data should be
shared
Technology Trends and Applications





Digital object management technology
Growth of scientific workflow software
Adaptation of “netcentric” way of doing business
Use of embedded links in publications
Increased number of portals serving data sets
35
Key Policy Initiatives
Agencies should lay foundations for digital scientific
data policy and make it available
Agencies should promote data management
planning including a DM Plan for all proposals
generating data
Create an NSTC subcommittee for
continuing cooperation and
coordination.
36
Workshop Objectives
Address two key objectives of
the IWGDD Harnessing Report
 Agencies should have a data policy
 Projects should have data
management plans
Bring in the Practitioner
perspective
 Policy analysts, operational user,
researcher, science manager + data
manager
 Share experience and best
practices
 87 people from 25 agencies
attended
37
What’s the Future?
Apple’s Knowledge Navigator (1989)
IBM’s Watson (2011)
Star Trek Computer (Stardate 8128)
38
Coffee, tea, or battery?
Or Questions?
Point of Contact:
Bonnie C. Carroll
President, IIa
104 Union Valley Rd.
Oak Ridge, TN 37830
(865) 298-1220
[email protected]
www.iiaweb.com
This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported
License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/ or send a
letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.
39