Transcript Document

(AED, EMAP)
(AED, EMAP)
(AED, EMAP)
(AED, EMAP)
(AED, EMAP)
Data Policy and Information Management Planning
Linda Harwell
US EPA/ORD/NHEERL/GED
“The overall purpose of these
policy statements is to facilitate
full and open access and use
with confidence, both now and in
the future, of the data and
information . . .”
(Hale, et. al., June, 1999)
Piecing Together Your Plan
Objectives
Key Points





What are the policy/plan objectives?
How will the data policy support
program objectives?
What “type” of data will be
contributed?
Who is the most likely audience?
What are the partnership roles and
responsibilities?
EMAP IM Plan Objectives






Providing a data directory so that data of interest can be
identified
Providing access to data and metadata files
Assisting with database development and/or
administration
Providing support to EMAP working groups for planning,
research, monitoring, and analysis efforts so that
differences in information management environments of
the groups are minimized
Ensuring a distributed data structure, allowing
responsibility for the data to reside with the owners
Maximizing interoperability with other environmental
monitoring data systems in accordance with CENR
objectives
Supporting Program Objectives . . .
. . . Is vital to the “health” of your IM program
Data Expectations
 Data sets
 Documentation of data sets
 Statistical methods
 Methodology for evaluating data
 Reports
 Spatial data
 Other derived products
 Data ownership/stewardship
Target Audience
 Identify the primary user
 Use “simple” language
 Plan for training (i.e. workshops)
 Provide guidance
 Understand the user’s IM culture
Role and Responsibilities

Outline
What needs to be done
 Who will do it


Provide reasonable expectations
for all partners
Clear definitions help minimize
misunderstandings and delays
We’re supposed
to enter the field
data!?
What Role Are You
Willing to Play?
Just a little inconsistency here
MD00-0053
NC00-0008
XCM0159
NC00-008
MD-CB-51
NC00008
MDCB-51
NC0008 BNT00
FishT_013
NC-08-C2000
13 BTRW*
NC-08
MDT013
NC-00008
* Two separate data fields
Go Ahead and Spell It Out
Piecing Together Your Plan
Objectives
Technical
Specifications
Key Points




How should the data flow?
What hardware/software
should be used?
What tools should be made
available to partners?
What are resource needs?
EMAP IM Information Flow
Data Relationships
EMAP IM Distributed System Components
The Nuts and Bolts
Other Considerations

Software limitations
Email attachments
 Available software



Internet protocols and firewalls
Network Contraints
T1 vs. frational
 Computers and operating systems


Current and future expertise needs
National Archives
Records Admin
(EPA Directive 2100)
EMAP IM
Data Sources
Archive and Recovery Plan
Redundant Backup
Piecing Together Your Plan
Objectives
Technical
Specifications
Implementation
Recommendations and Guidelines

Quality assurance/control

Preparation and delivery of data

Data distribution for public use
Providing “tools” help ensure you get what
you need -- when you need it
QA Assessment Example
Completeness Check
Water
Survey
Expected # Quality
of Samples Visits
Sediment
Collection
Visits
Trawl
Visits
Group 1
56
54
54
39
Group 2
35
35
32
31
Group 3
37
34
2
35
Intended
Actual
QA Assessment Example
Reasonable Outliers
QA Assessment Example
Assessing Lab QA
“Raw” Vs. “Synthesized” Data
(NCADC)
Data Aggregation
Metadata Aggregation
Data
Analysis
Data Storage
(State)
Data Capture
Günther, 1998
Navigation and Info Retrieval
Metadata Collection
Transactional Vs. Reduced
One Big File Vs. Subsets of Data
Chemistry Table
Or
Sediment Chem
Tissue Chem
Water Chem
Sediment Comp
Quality Assurance
Sediment Toxicity
Results Table
Or
Sediment Toxicity
Quality Assurance
Controls
Data Transfer Format
Purpose
Required
Optional
Standards
Public Data Distribution
Data View Options
Data View Options
Data Download Format
Tab-delimited ASCII Query Result
Data Sharing
Percent Area
Oregon Small Estuaries
Sediment Toxicity Tests - A. abdita
110
100
90
80
70
60
50
40
30
20
10
0
50
60
70
80
90
100
% Control-Corrected Survival
110
Analysis Tips –
Making Friends and Influencing People


What was the primary
study question?
What are the
recommended
methods of analyses?
How else can these
data be used?
Oregon Small Estuaries
Sediment Toxicity Tests - A. abdita
Percent Area

110
100
90
80
70
60
50
40
30
20
10
0
50
60
70
80
90
100
% Control-Corrected Survival
110
Joys of Metadata
Metadata Formats
EML
FGDC
NBII
Metadata Tools
Metadata Consultants
EMAP Example
Data Directory
Data Catalog
EPA’s EIMS
Frithsen and Shepanek; 1998,2000
Piecing Together Your Plan
Objectives
Planning
for the
Future
Technical
Specifications
Implementation
What Does the Future Look Like?
What will it take to:



Continue to meet program needs
 Accommodating program evolution
How might users’ needs change
Keep up with emerging technology
 Resources
 Expertise
Evolution of EMAP
Multi-scaled, Diversified
Coordination Effort
Highly Centralized and
Compartmentalized
Staying Afloat

Be prepared to revise IM policy
and plan

Make connections with other
programs

Proactively work IM plan

Make excerpts a part of QAPPs
Information Management Plan
EaGLe
Data for the Next Generation
Coming Soon: 2003