StEPS at EIA—Where We Are Now Paula Weir and Sue Harris Energy Information Administration, U.S.
Download ReportTranscript StEPS at EIA—Where We Are Now Paula Weir and Sue Harris Energy Information Administration, U.S.
StEPS at EIA—Where We Are Now Paula Weir and Sue Harris Energy Information Administration, U.S. Department of Energy ICES3 Topic Contributed Session: Generalized Survey Processing Systems—Part II Presentation Outline • Background and reason for implementing a generalized system • First issue of getting data into StEPS • StePs modules—changes made and how used • Modifications to other systems and creation of new systems needed • Summary and future work Background • In 2002 EIA acquired StEPS from the U.S. Bureau of Census • Low cost solution for replacing legacy systems • Since then 18 surveys have been migrated to StEPS • Differences in survey requirements and business process have resulted in significant customization • Challenges faced in implementing a generalized system designed with other surveys and processes in mind Getting Data into StEPS • • • • • • • • Process for creating batch data into the StEPS input format EIA developed the Data Collection Module (DCM) Originally intended to be a generalized system DCM provides the identification of the respondents expected to report All reported data are keyed or passed through the DCM The DCM performs multiple frontend tasks Process original responses and resubmissions. Generate reports for organizing and tracking information regarding survey processing (case control) StEPs Modules • This processing system includes: a) modules for specifying parameters for the specific users and survey; b) modules for data collection activities including mailing, receipt and check-in; as well as, c) modules for post collection such as editing, imputation, and estimation. Customization of StEPS • Evolutionary process • Original vision was to remain compatible with future versions and upgrades from Census • As a result of differences in survey and business processes, the vision changed • Example: analysts work on multiple surveys and time periods, so EIA created a StEPS Menu Interface screen • EIA Tools Defining STePS Edits • StEPS has 7 edit rules types: 1) required data item test; 2) range test; 3) list directed test; 4) skip pattern validation test; 5) balance test; 6) survey rule test; and 7) negative test. • The use of the edit types varied within the three groups of EIA surveys. • Many of the edit rules are common to many of the data cells, so a “wildcard” feature was developed to apply the edit to cells selected through a drop down. • Performance issues and resolution approaches • Enhanced roster functionality Imputation Definition • Imputation implemented differently within the three survey groups • Simple imputation vs. general imputation • Calculation of impute values within StEPS vs brought in as auxiliary file • Imputation as a rollover function • Major change initiative: Imputation in roster surveys Review and Correction • View reported data, edit flags and resolution, view notes • Override flag created • Failures highlighted in red • Notes functionality enhanced • Roster item view and correction screens created • Mass correction function for control data Control Information • Master Control and Control Information • Latest action • Ghosting and successor ID • Unghosting to preserve ID as of the time data are reported • Relationship to other systems and processes Other Existing Systems and Creation of New Systems • Historical respondent level database--SIS • Master Frame FileIntegrated Frame System • OHUB—temporary holding area of control and survey data, job status, other frame and sample information Overview of Production Processes and Systems NGPS (Natural Gas Publication System) NG Weekly System External Data XLS (191) PC SAS; XLS DQRS Analyst Query Unformatted email FAX EDX (Energy Data Exchange) Data Warehouse CSAF Dissemination StEPS NG -------------------------------StEPS Reserves -------------------------------StEPS PD DCM Master Frame File System SIS PD only CSAF OHUB Cubes Aggregate Data Repository (ADR) Oil and Gas Information Resource System (OGIRS) OHUB jobs PEDRO EDES PC SAS; XLS XLS Graphicr Inteface PD Weekly System XLS System previously in place Systems developed for StEPS migration Summary and Future Work • StEPS: one box in an overall process and flow • Low cost alternative but did not replace all the functionality in the legacy system • New systems and interfaces had to be built and other systems modified • Main problems: lack of knowledge on effective implementation (training and resource issues); differences in processes and work flow—language barrier; integration with other systems; independent treatment of separate but dependent surveys; dimensionality of EIA surveys • Quick fixes and survey specific solutions vs integrated generalized solution for implementation • Short run: 3 more surveys; Longer run: focus on edit/impute/estimate in StEPS, graphical interface and outlier detection, revisit process flow, integration of separate but related surveys, upcoming forms modifications, Red Hat?