Transcript Document
Apple’s Knowledge Navigator (1989) If the play button does not display, please click the link below: http://video.google.com/videoplay?docid=-5144094928842683632# 1 Science, Government, and Information: Finding and Communicating the Results of Research Bonnie C. Carroll President, IIa Feb. 25, 2011 College of Communication and Information 33rd Annual Symposium to Explore Science, Risk Communication Research What’s In the Title Science, Government, and Information A Report of the President’s Science Advisory Committee 1964 Chaired by Alvin Weinberg, Director, ORNL Distinguished Panel of the Best and Brightest in Science …”strong science and technology is a national necessity and adequate communication is a prerequisite for strong science…” – President John F. Kennedy “Transfer of information is an inseparable part of research and development.” Challenge of coping with the growing volume and variable quality of information. Information Analysis Centers as a solution New roles for Scientists, documentalists, information scientists 3 Alvin Weinberg, Director, ORNL Why Me On This? SGI launched my interest in STI Worked with Dr. Weinberg and ORNL ICs in 1970s 4 decades experience in STI and data CENDI -- Executive Director for 20 years 14 Federal agency STI managers - 97+% US R&D budget White House Science Office (OSTP), NSTC Interagency Working Groups Biodiversity and Ecosystems Informatics WG -Executive Secretary Interagency Working Group on Digital Data -Executive Secretary Member of NAS/NRC BRDI/CODATA US National Rep to CODATA 4 Objectives of the Talk Provide overview of government context in scientific and technical information (STI) management Briefly describe key factors driving change Discuss key challenges in STI today Describe some experiments going on Start with some historical and local context Make you smile Disclaimer: I never promised you a research garden 5 A Primer on Federal STI Executive President Legislative PCAST OMB OSTP Committees NSTC Mission Agencies Agency A Agency N… Agency NSF NSB Administration • • • • R&D CIO Public Affairs Data DOC STI Web LC National Libraries Centralized Clearinghouses Decentralized / Specialized Data / Info Center Libraries / Digital Libraries 6 Authorization Appropriations Historical Perspective By the end of the 1960’s, we had evolved an effective STI Infrastructure. Published literature, technical reports, reviews, A&I services Weinberg report contributions Foresaw the issues Followed by a series of transitions and reports 7 The Need for the Next Generation System (1) Transition from information systems to decision support Forces are both technological push - user pull Move from passive to active systems - extend intellectual power Needs of our age are in crisis management and decision making Need to integrate STI and societal information (STSI) 1978 National Science Foundation Report, “Passing the Threshold into the Information Age” 8 The Need for the Next Generation System (2) Need for interconnected national information technology network New information infrastructure Stores information in forms accessible Provided means of communication Aids in information manipulation to create new insights and knowledge Provides human and computer assistant; i.e., “user friendly” “Infory” of the Future: Distribute in physical sense Professional career structure to identify and respect Dissemination of reliable, personalized software 1978 National Science Foundation Report, “Passing the Threshold into the Information Age” 9 By the 1980’s, the Theme was Dealing with the Crisis of Information Overload Irony of Information Age is not a problem of scarcity Science (8/12/83) article shows that information availability is increasing more rapidly than the ability to absorb it Due to effectiveness of electronic access systems, too much “relevant” information is identified Need the right information at the right time With technological advances, the problem became magnified Algorithms for quality discriminations are not yet developed 10 By the 1990’s, the Paradigm in STI Began to Shift Basic changes in structure of science itself Increasing commercialization and marketability of science Advances in technology Growth in volume of information Lack of coordinated information policy and federal STI leadership 11 Basic Changes in Structure of Science Itself (1) Disciplinary, mission, problem-solving orientations Blurring boundaries among science, social science, politics, marketing, ethics Specialization but linkages New tools and new methods -- computational sciences 12 Basic Changes in Structure of Science Itself (2) Big science, international science Increase in commercialization and marketability of science Shift from military to industrial R&D Global politics Changing user expectations Computer literacy of next generation User community initiatives 13 Question of the Month on Internet (1993): In what month will the number of bytes run through the Internet exceed the number of atoms in the Universe? 14 Electronic World Was Not a Linear Extension of Print Print world talks of “food for thought” Networked world users will be the prey and information is the predator - Paul Peters 3rd NASA Foreign Acquisitions Workshop 1993 15 By 1999, We’re Reinventing our Roles Agency STI programs redefining missions Budget allocations are problematic -- again Publishers are questioning primary/secondary relationships Authors are experimenting User expectations are changing – the digital native Madison Avenue/Wall Street are “IN” 16 CENDI STI Definition STI is information that derives from … research, development, and deployment (RD&D) results of the efforts of scientists and engineers and individuals supporting their work. STI includes new theory and information obtained from experimentation, observation, instrumentation, or computation in the form of text, numeric data, or images. STI may be further transformed, described, evaluated, synthesized, and recorded in print, micrographic, magnetic, optical, or other media to enhance its communication and its usefulness and value to a wide spectrum of users and uses. 17 Rip Van Winkle opening ones eyes 50 years later Technological Change Types and Volume of Content Interoperability on steroids --everything is a bit 18 Rapidly advancing technologies have opened new opportunities “Five years ago, right after the start of the personal computer revolution, industry experts observed that if the automobile business had developed like the computer business, a Rolls Royce would cost $2.75 and go 3 million miles on a gallon of gasoline.” -- Fortune Magazine August 1, 1998, p. 4 19 Information Life Cycle 20 1973 First mobile phone 1990 1990 Internet Web 3.0 Social Software Web 2.0 Search 2010 iPad 2006 • Twitter • Cloud Computing 2001 Wikipedia Commercialization 1980 2004 • Facebook • Flickr 1995 • Client Servers • http / www Decentralization Digitization 1970 1960 Networking Online C H A N G E 1960 KWIC/ KWOC 1980 • Personal Computers • Desktop Publishing Computerization of Publishing Technologies Over Time Change is happening faster! 2000 2010 1999 • Blogs • Dot com companies 21 2005 RSS feeds 2003 • Google • LinkedIn • Second Life 2012 2011 Watson 2008 Popularization of Smart phones Who’s a publisher? The Information Industry is Transforming Commercial businesses Institutional depositories Individual web sites Individuals (blogs) Preprint servers/networks 22 The www: Fastest growing new publishing media of all time and medium of first resort Surface Web Static, publicly available Deep Web Dynamic, database driven But the Key To Access Is Not Search, It’s Find Not the envelope but the information 23 Next Generation Science, Government and Information STI is again in focus for the 21st century Open access; Open government; management of scientific data American Competes; Scientific integrity memo; PCAST report 12/10 "Designing a Digital Future: Federally Funded Research and Development in Networking and Information Technology” “Our Sputnik moment” – President Obama State of the Union “Maintaining our leadership in research and technology is crucial …we must invest in innovation, education, and infrastructure….” 24 The Current Landscape The products of science and the starting point for new research are increasingly digital and increasingly “borndigital”; Exploding volumes and rising demand for data use are driven by the rapid pace of digital technology innovations; All sectors of society are stakeholders in digital preservation and access; and A comprehensive framework for cooperation and coordination to manage the risks to preservation of digital data is missing. Science is DATA INTENSIVE – The Fourth Paradigm – infer meaning from data 25 Key Background to Where We Are Today* * There are many reports that cover scientific data. These show a direct lineage to national policy 26 The Drivers: Why Do We Care? “Empowered by an array of new digital technologies, science in the 21st century will be conducted in a fully digital world. In this world, the power of digital information to catalyze progress is limited only by the power of the human mind. Data are not consumed by the ideas and innovations they spark but are an endless fuel for creativity. A few bits, well found, can drive a giant leap of creativity. The power of a data set is amplified by ingenuity through applications unimagined by the authors and distant from the original field.” --Harnessing Report Technology is enabling -- Push Data Intensive Science is the future -- Pull 27 What’s the Current Challenge? Complexity: Heterogeneity and Volume 10s 100s 1,000s iPod Shuffle (up to 120 songs) = 512 MegaBytes 1 small novel = 1 MegaByte Reference collections •managed for long-term use by many users Printed materials in the Library of Congress = 10 TeraBytes Resource or community database collections, •intermediate in duration, standardization, and community of users; Research database collections Atmospheric Radiation Measurement Program (ARM) Data Archive = 41 TeraBytes •specific to a single investigator or research project; Kilo 103 Mega 106 Giga 109 Tera 1012 Peta 1015 Exa 1018 Yatta 1024 Large Hadon Collider (LHC) = 15 PetaBytes annually All worldwide information in one year = 5 ExaBytes 28 Volume: How big is five exabytes? 5 exabytes of new information in 2002 Print, film, magnetic and optical storage media 92% on magnetic media If digitized, the 19M books and other print collections in the Library of Congress would contain about 10TB of information 5 exabytes is equivalent in size to ½M new libraries the size of the LC print collections. file://///Herald/www/research/projects/how-much-info-2003/execsum.htm (1of14) [10/30/2003 4:11:03 PM] 29 IWGDD Vision and Strategy – Top Down Vision: We envision a digital scientific data universe in which data creation, collection, documentation, analysis, preservation, and dissemination can be appropriately, reliably, and readily managed. This will enhance the return on our nation’s research and development investment by ensuring that digital data realize their full potential as catalysts for progress in our global information society. Strategy: Create a comprehensive framework of transparent, evolvable, extensible policies and management and organizational structures that provide reliable, effective access to the full spectrum of public digital scientific data. 30 CENDI Vision – Bottoms Up Grand Challenge: Information Fuels Discovery And Innovation Data-intensive science, STEM education, and workplace innovation will be fueled by a transformed Federal science and technology knowledge infrastructure – the nextgeneration Science.gov - to stimulate economic health, create jobs, and strengthen U.S. competitiveness in the global marketplace. Through the i-Science2Jobs interagency initiative, the next-generation knowledge platform will provide data, tools, and services for scientific and economic progress. Science.gov is the gateway to Federal science information and research results from 18 scientific and technical organizations in 14 federal agencies. 31 Related Experiments Are Going On: A Department of Energy Approach Data Sets STTR Gray Literature Documents Data Citation TG OSTI Metadata and Data Repository DOI 32 Interagency Case Study Oak Ridge National Laboratory Major Center for Environmental Scientific Data Management responsible for archiving, managing, and distributing data for enabling the distribution, use, and analysis of this data. Data Repositories Atmospheric Radiation Measurement Archive Carbon Dioxide Information and Analysis Center ORNL Distributed Active Archive Center Mercury Metadata Repository Brazil 33 Project Office LBA-Hydrometeorology CDIAC Interactive Science Publishing: A Joint OSA-NLM Project To evaluate the educational value of ISP used within actual scholarly journal articles To explore the problems of archiving this medium To develop an interactive software and curated database infrastructure “Interactive Science Publishing” To give authors the ability to submit their own databases and ISP-enables figures in actual peer-reviewed journal articles To give readers and editors the ability to view, analyze, and interact with source data published in conjunction with an article 34 Making Data More Accessible Policy, Culture and Management National Policy -government taxpayers funded projects should be accessible Enhanced metadata Journals supporting links to some published data sets “People getting the message that data has to be accessible.” Increased involvement of libraries and lifecycle management of data Younger generation post data as they go – expectation that data should be shared Technology Trends and Applications Digital object management technology Growth of scientific workflow software Adaptation of “netcentric” way of doing business Use of embedded links in publications Increased number of portals serving data sets 35 Key Policy Initiatives Agencies should lay foundations for digital scientific data policy and make it available Agencies should promote data management planning including a DM Plan for all proposals generating data Create an NSTC subcommittee for continuing cooperation and coordination. 36 Workshop Objectives Address two key objectives of the IWGDD Harnessing Report Agencies should have a data policy Projects should have data management plans Bring in the Practitioner perspective Policy analysts, operational user, researcher, science manager + data manager Share experience and best practices 87 people from 25 agencies attended 37 What’s the Future? Apple’s Knowledge Navigator (1989) IBM’s Watson (2011) Star Trek Computer (Stardate 8128) 38 Coffee, tea, or battery? Or Questions? Point of Contact: Bonnie C. Carroll President, IIa 104 Union Valley Rd. Oak Ridge, TN 37830 (865) 298-1220 [email protected] www.iiaweb.com This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA. 39