Transcript Document
Evolution of a Clearinghouse: From FTP to Map Services ESRI International Users Conference, August 10th, 2006 Maurie Kelly, Ryan Baxter, David Walrath Pennsylvania Spatial Data Access www.pasda.psu.edu In this presentation…. • Historical background of clearinghouse •Past and current challenges and initiatives •Related projects and experience FTP & Metadata Relational Database Development Map Services & Customization Tools www.pasda.psu.edu PASDA … • Is the official public geospatial data clearinghouse for Pennsylvania PASDA is the Commonwealth’s node on the NSDI, participates in GOS, the Geography Network, and the NBII Serves as a resource for locating data throughout PA Provides widespread sharing of Geospatial data Eliminates the creation of redundant data sets PASDA … •Established in 1996 to serve PA DEP data (35 data sets) via FTP • Expanded in 1999 to officially serve as the public geospatial data clearinghouse for PA A collaborative effort of: • PA Geospatial Technologies Office •Penn State Institutes of the Environment •State agencies, programs such as PAMAP, and data partners throughout the region PASDA… •Provides access to about 50,000 data files •Celebrates its 10th anniversary this year and is one of the US’ largest and most established clearinghouses • Offices are located on PSU UP Campus PASDA PASDA… • Our services include: Data Storage and Access Users Data & Metad ata PASDA Metadata Outreach Training & Assistance Planar Distance Units:meters Geodetic Model: Horizontal Datum Name:North American Planar Distance Units:meters Datum ofModel: 1927 Geodetic Ellipsoid Name:Clarke Horizontal Datum Name:North American Planar Distance Units:meters Datum ofModel: 1927 Geodetic Ellipsoid Name:Clarke Horizontal Datum Name:North American Planar Distance Units:meters Datum ofModel: 1927 Geodetic Ellipsoid Name:Clarke Horizontal Datum Name:North American Planar Distance Units:meters Datum ofModel: 1927 Geodetic Ellipsoid Name:Clarke Horizontal Datum Name:North American Datum of 1927 Ellipsoid Name:Clarke •PASDA info. •GIS tutorials How PASDA Works Data Partners -Data Collection -Mapping -Documentation PASDA Staff -Storage -Management -Assistance Distribution Via the Web -Retrieval by user Benefit to Providers: -Mechanism for data distribution -Publicity -Provide service to PA Benefit to Users: -Can access & use valuable data -Avoid duplication of effort -FREE PASDA: Data Cataloging Description: Originator:U.S. Environmental Protection Agency/Office of Water/OST Basins Abstract:The U.S. Environmental Protection Agency's (EPA) Reach Files are a series of hydrographic databases of the surface waters of the continental United States … Time Period of Content: Metadata… Time Period Information: Single Date/Time: Calendar Date:1994 Currentness Reference:publication date Status: Progress:Complete Maintenance and Update Frequency:None planned Spatial Domain: Bounding Coordinates: Title:USEPA/OW River Reach File 3 (RF3) Alpha for CONUS, Hawaii, Puerto Rico, and the Indirect Spatial U.S. Reference:English-Salmon Virgin Islands - New West Bounding Coordinate:-74.739 East Bounding Coordinate:-73.652 North Bounding Coordinate:45.156 South Bounding Coordinate:44.496 Keywords: Theme: Theme Keyword Thesaurus:None Info about the data Theme Keyword:RF3 alpha Hydrography River Reach streams stream network rivers hydrography ArcInfo navigation drainage Place: Place Keyword Thesaurus:Geographic Names Information System Place Keyword:New York NY New York English-Salmon Access Constraints:none Use Constraints:none Calendar Date:1994 Point of Contact: York Contact Information: Contact Organization Primary: Contact Organization:U.S. Environmental Protection Agency Contact Person:Dan Parker Direct Spatial Reference Method:Vector Contact Address: Address Type:mailing and physical address Address:401 M Street SW (4503F) City:Washington State or Province:DC Postal Code:20460 Country:United States Contact Voice Telephone:1-800-424-9067 Data Set Credit:McKay, Lucinda; Sue Hanson; Robert Horn; Richard Dulaney; Alan Cahoon; Mark Olsen; and Thomas Dewald, 1994. The U.S. EPA Reach File Version 3.0 Alpha Release (RF3-Alpha) Technical Reference. U.S. Environmental Protection Agency, Washington, DC Spatial Domain: Bounding Coordinates: West Bounding Coordinate:-74.739 East Bounding Coordinate:-73.652 Planar Distance Units:meters Geodetic Model: Horizontal Datum Name:North American Datum of 1927 Ellipsoid Name:Clarke 1866 Types of data on PASDA •Vector and Raster Data •DEMs • DLGs • DRGs • DOQQs • PAMAP •River Conservation Plans •Census Data •Land Use/Land Cover •Wetlands •Historic Markers •Fish Species PASDA Statistics – End of 2005 • Data Storage • 6.5 Terabytes of data • 3 Terabytes in database for interactive mapping/interactive download • Access (during last 12 months) • 11.7 million hits (from 253,000 users) • 631,000 data downloads requests • 3.9 Terabytes of data downloaded • 828,000 online maps generated via web GIS PASDA Statistics PASDA Statistics To whom are we providing assistance? First Challenge—Expand Services and Capability beyond FTP •#1 Challenge for any clearinghouse—Technology •Technology changes rapidly both for clearinghouse and for users •Changes in technology mean changes in hardware, software, and architecture of clearinghouse as well as changes in personnel and training needs. •Requires regular planning, benchmarking, and vision Needs Analysis •1999 PASDA and ATS of Lancaster undertook a needs analysis to define user requirements for role as official public geospatial data clearinghouse for PA •User requirements included: customization options, web mapping applications, tutorials and guides, more data!! Impact on Clearinghouse •Increased training needs and knowledge base for staff •Increased technology, software, and hardware needs •More users Results of Needs Analysis Initiative Strategic Planning effort undertaken and implemented in 2000 facilitated by Avencia Inc. of Philadelphia. By 2001, the following were complete •Development of first relational database using Db2 & SDE •Pennsylvania Atlas ArcIMS application •First iteration of Data Wizard customization reprojector/clipping tool By 2002… •Statewide DOQQ mosaic in database •Clipper/reprojector in regular use •Pennsylvania Atlas ArcIMS application clipping/reprojecting tool developed By 2003 Technology Changes on the horizon….. By 2003 changes in technology were on the horizon… Mid 2003 User requirements session held. Examined existing tools, data, and interface. Acquired input on changes and expectations of users— •Clearinghouse needed integrated web interface •Clearinghouse needed streamlined metadata •Clearinghouse needed integrated option for accessing imagery •Website and html needed to be streamlined •Search mechanisms needed to be integrated into single environment •Post user session—staff held strategic planning session and developed new multiyear plan to implement additional enhancements to clearinghouse. Challenge…. How to make all of the options accessible? Solution 1: Approach had been to create new tools for each need Approach had been to add links and content to site Result 1: Confusing Website Countless links & access points Challenge Part 2…. How to make all of the options accessible? Solution 2: Our approach this past year Complete redesign of the clearinghouse…search interface, metadata architecture, data wizard, integration of applications and map Services into search interface. Result 2: Simplified Website Single, integrated access point to all data types & services Current Architecture… •“Data cart” option added to Data Wizard •Users can search for, select, add data to cart, reproject/clip data, and zip all the data in their cart together into one downloadable file. •Imagery viewer and download utility created for all imagery in database Clip & Reproject WMS MapServices ArcIMS MapServices Google Earth FTP .ZIP Files WebGIS Applications NSDI – GOS Harvesting Current Architecture… How did we accomplish this? One central database table • One record for each ‘dataset’ in the clearinghouse • Basic metadata elements (title, keywords, date…) • Links to FTP files • Links to SDE database feature classes • Links to XML metadata files • Links to WebGIS application URLs • Links to ArcIMS & WMS MapServices Current Architecture… Central table drives the Website and integrates all services Website Direct Download FTP .zip Clip, Reproject & Download SDE Central Table Metadata xml MapServices xml xml WebGIS NSDI -- Geospatial One-Stop ArcIMS -- WMS --Google Earth Data Access Tools Search by keyword, county, quadrangle, etc. Download .zip files FTP View aerial photos; zoom to an address Click on a quarter-quadrangle to download it DOQQs View & Interact with data on a map Clip, reproject & download PA Atlas Map & download census 2000 data All SF1 & SF3 census variables for PA Census 2000 Live Internet Data Data Access Wizard PennCAT - Geography Network Catalog of MapServices Unified interface for all data Add Data to your ‘Data Cart’ Clip, reproject and download multiple datasets at one time View XML metadata Download via FTP Link to separate application Compile datasets for clipping & reprojecting Dynamic Content Specialized Applications Imagery Viewer and Download Utility • 4 Terabytes of imagery including aerial photos DRGs, DOQQs • 5 unique tiling schemes • Multiple years of data—spans period from 1993-2005 • All data can be queried via single interface • Search by address,quadrangle, county • Select image and provide with download options http ftp mr. Sid or tiff Specialized Applications:Imagery Viewer Specialized Applications:Imagery Viewer Specialized Applications:National Weather Service Forecast Map Services Near real time data from NWS including: Apparent Temperature, Dew Point Temperature,Maximum Temperature, Minimum Temperature, Precipitation Amount, Probability of Precipitation, Relative Humidity, Sky Cover, Snow Amount, Temperature, Wave Height, Wind Direction, and Wind Speed. Specialized Applications: Tropical Depression Ivan • The Ivan viewer was developed to provide information about flood damage following the storm to the public, emergency managers, And other stakeholders in the Commonwealth. • Ivan includes pre and post flood aerial photos, pictures taken at ground level of flood damage, and supporting framework data. Ivan Imagery Viewer Interface City Island after the flood… Pre-flood imagery of City Island Flood damage—Photo taken on site and contributed to Ivan viewer by citizen Pennsylvania Initiatives that use the clearinghouse… National Information Infrastructure Initiatives… Conclusion •Clearinghouses have evolved from simple FTP sites to complex information infrastructures •Clearinghouses serve multiple purposes—access to data •Access to services, access to applications, metadata development, and outreach. •PASDA has undergone regular strategic planning to develop new initiatives and to benchmark progress. PASDA has transitioned from a traditional structure to one that encompasses data, services, and tools for its users. •Map services and applications are now the hallmark of the clearinghouse •Free access to data supports businesses, non-profits, and other projects and programs throughout the state and nation. Questions & Comments ESRI International User Conference, August 10th, 2006 Please contact us at [email protected] if you have any questions or comments. More information is available at http://www.pasda.psu.edu/ Maurie Kelly, Ryan Baxter, David Walrath Pennsylvania Spatial Data Access www.pasda.psu.edu