Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in the Era of Big.
Download ReportTranscript Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in the Era of Big.
Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in the Era of Big Data; Career Opportunities and Education Requirements NRC July 19, 2012 1 1 Framing the Challenge: Science and Society Transformed by Data Modern science Multi-disciplinary Collaborations for Complexity Data- and computeintensive Integrative, multiscale Individuals, groups, teams, communities Sea of Data Age of Observation Distributed, central repositories, sensordriven, diverse, etc 2 Data as a transforming agent Enormous amounts of data are being generated by modern experiments, sensors, observations and social networks New analysis tools including automatic extraction of new knowledge continues to accelerate Infusion of data-intensive computation into science, engineering and education is revolutionizing research Research in science and education is an essential pathway to prosperity and competitiveness and thrives in an environment of shared data Scientific Data Challenges Exa Bytes Square Kilometer Array Climate, Environment Volume/Growth Bytes per day Genomics Peta Bytes TeraGrid, Blue Waters Climate, Environment LHC LHC Tera Bytes LSST Genomics Giga Bytes Distribution Many smaller datasets… 2012 2020 Data Access 4 NSF Data strategy Establish a national data infrastructure to support science, engineering and education Ensure that this infrastructure stays at the most advanced state of sophistication and is sustainable Expand the development of the next generation of compute and data intensive workforce Develop of a suite of policies to support the full data life cycle (data access, curation, object identifiers, etc) 5 Data Web Forum The DWF will facilitate the exchange and interoperability of data across disciplines and national boundaries by producing high quality, relevant technical documents that influence the way people store, use, and manage data Linking top-down governance model with bottom-up IETF model to catalyze this community-based activity Top-down focus on policy, permission . . . Bottom-up focus on operations, services . . . Timeliness an important factor Ability to respond quickly essential DWF Principles Balanced representation of stakeholder communities Community-based; not a government organization, a regulatory body or a commercial organization Products are free and open source Meetings are public, progress through consensus and practice Focus on harmonization across standards, policies, technologies, tools, and other data infrastructure elements Proposed Timeline: 2012 Initial Government Agency funding in process Awards to Non Government Organizations in Aug/Sep Teams flesh out organization and structure – Sep/Dec Initial working groups identified and charged – Sep/Dec groups already sharing data across global boundaries Identify candidates for early deliverables, best practices Secure time-commitments to undertake proposed activities