FEDERAL STATE STATISTICS SERVICE (ROSSTAT) Unified information system of statistical data collection, processing, storage and dissemination (Rosstat UIS ) Overview Elena Priakhina, Director of the Scientific -
Download ReportTranscript FEDERAL STATE STATISTICS SERVICE (ROSSTAT) Unified information system of statistical data collection, processing, storage and dissemination (Rosstat UIS ) Overview Elena Priakhina, Director of the Scientific -
FEDERAL STATE STATISTICS SERVICE (ROSSTAT) Unified information system of statistical data collection, processing, storage and dissemination (Rosstat UIS ) Overview Elena Priakhina, Director of the Scientific - Research and Project Institute of Rosstat , Moscow FEDERAL STATE STATISTICS SERVICE STRUCTURE ROSSTAT MAIN COMPUTING CENTER ROSSTAT CENTRAL OFFICE RESEARCH & PROJECT INSTITUTE I TERRIRORIAL STATE STATISTICS OFFICES (TSO) DISTRICT STATISTICAL DIVISIONS (DSD) FEDERAL STATE STATISTICS SERVICE STRUCTURE Rosstat has the distributed hierarchical structure and includes: At the Federal level: Rosstat Central Office responsible for methodology and for the presentation of official statistical information to the government, to ministries and agencies and other users Main computing center responsible for data collection from regional level, data processing and presentation to Rosstat Central Office Scientific - Research and Project Institute engaged in the development of application software for all stages of technological process At regional level: Territorial statistical offices (TSO) allocated in regions of the Russian Federation have to ensure statistical data collection, processing, storage and dissemination at the regional level At district level: District (town) statistical Divisions of TSO are engaged mainly in statistical data collection from enterprises and organizations and presentation to regional level Regional level – 83 Territorial statistical offices (TSO) District level – > 2000 District (town) statistical Divisions of TSO DSD Калининградская обл. Псковская обл. Тверская обл. 15 Брянская обл. 5 23 17 19 21 18 20 24 22 Чукотский АО 16 Смоленская обл. Ростовская обл. Мурманская обл. Ленинградская обл. Республика Карелия Архангельская обл. 14 9 13 6 10 7 2 4 11 8 1 32 31 3 12 27 30 26 33 29 38 28 37 36 25 Оренбургская обл. Волгоградская обл. Республика Астраханская обл. Дагестан Роспублика Калмыкия Ненецкий АО Корякский АО Таймырский (Долгано-Ненецкий АО) Республика Коми ЯмалоНенецкий АО 34 Ханты-Мансийский Свердловская АО обл. Тюменская обл. 35 Курганская обл. Челябинская обл. Омская обл. Томская обл. Новосибирская обл. Республика Саха (Якутия) Эвенкийский АО Цифрами на карте обозначены: 1 2 3 4 5 6 7 - Белгородская область - Владимирская область - Воронежская область - Ивановская область - Калужская область - Костромская область - Курская область 8 - Липецкая область 9 - Московская область 10 - Орловская область 11 - Рязанская область 12 - Тамбовская область 13 - Тульская область 14 - Ярославская область Иркутская обл. Республика Тыва Кемеровская Республика обл. Хакасия 15 - Вологодская область 16 - Новгородская область 17 - Республика Адыгея 18 - Республика Ингушетия 19 - Кабардино-Балкарская Республика Камчатская обл. Красноярский край Алтайский край Республика Алтай Магаданская обл. Сахалинская обл. Республика Бурятия Читинская обл. Усть-Ордынский Агинский Бурятский АО Бурятский АО 20 - Карачаево-Черкесская Республика 21 - Республика Северная Осетия - Алания 22 - Чеченская Республика 23 - Краснодарский край 24 - Ставропольский край 25 26 27 28 29 30 31 Амурская обл. Хабаровский край Еврейская автономная обл. - Республика Башкортостан - Республика Марий-Эл - Республика Мордовия - Республика Татарстан - Удмуртская Республика - Чувашская Республика - Кировская область Приморский край 32 33 34 35 36 37 38 - Нижегородская область - Пензенская область - Пермская область - Коми-Пермяцкий АО - Самарская область - Саратовская область - Ульяновская область Copyright (с) Госкомстат России, 2004 Russian Statistics today • Official Federal Plan of Statistical Works contains more than 400 statistical works • More than 20 000 statistical indicators • Rosstat generates 60% of total volume of statistical information in Russia • 250 statistical forms (questionnaires) including: some 120 – annual, 60 – quarterly, 60 – monthly • Number of units under statistical observation comprise more than 3 million Unified information system of statistical data collection, processing, storage and dissemination (Rosstat UIS) Main goals • Decrease of data collection and processing period • Decrease of labor cost • Implementation of modern information technology Basic principles • Unification • Adaptability Problems and Solutions Problems Solutions Various applications for 400 statistical works Single unified software for all statistical works Lack of metadata for system to be adjusted to changes in questionnaires Common Metadata Concept and Rules for Metadata description for collecting, processing and dissemination Different version of classifications… Unified system of classifications Different technologies and user interfaces for statistical works Unified technology and user interface for all works Main information flows of the system of statistical data collection, processing, storage and dissemination Enterprises and organisations District level Regional level Federal level WEB CSI GU CSI GU Electronic questionnaires SDCP of district level SDCP of regional level Warehouse SDCP of federal level Warehouse Paper questionnaires USERS USERS Architecture of the system of statistical data collection, processing, storage and dissemination (Rosstat UIS) ETL Uploading data must be correspond to indicators of CSI DWFL CSI GU SSESQ CSDB Unified system of user access Federal level Regional (district) level ETL federal SDCP district From file CWS On-line From paper DWRL RSDB Unified system of user access Data warehouse and ЕTL allow to upload the information even at change of input forms The subsystem of classifications Off-line The subsystem of data collection and processing (SDCP) E-mail Enterprise level WEB-forms data input Input forms on the paper Data input application The subsystem of storage and dissemination Subsystem of classifications • Data base of classifications • Data base of enterprises and organizations • Data base of statistical indicators The subsystem of data collection and processing • Unified software application for all statistical works Operator of data collecting Economists and Respondents and processing subsystem Unified System «STATEC» Analytics Component for electronic data collection from enterprisesSDCP based on «STATEC» Component for reception (input), verification, processing and table generation • Metadata base Unified business logic for all Statistical works Output tables reports • Operating data bases for statistical works ____________________________________ • DBMS MS SQL Server MS SQL Server • Client-Server application is based on PowerBuilder • Web-application is based on ASP.NET Operational databases of statistical works The subsystem of statistical information storage, presentation and dissemination • Data Warehouse (DWH) of statistical data have the same structure at federal and regional levels • DWH structure is based on Catalog of statistical indicators (CSI) • Data marts are generated from DWH for different aims (data dissemination, analysis of selected branches and sectors of the economy) • Data dissemination is carried out via web-interface with Central statistical data base (CSDB) on web-site of Rosstat • Unified system for internal user access to DWH of statistical data is being developed now _________________________________ • DBMS MS SQL Server • Applications are based on PowerBuilder, ASP.NET Federal State Statistics Service (Rosstat) SCREENSHOTS OF SUBSYSTEM Statistical reports collection in electronic mode verifies a report exports a report to MS Word or Adobe Respondent logs PDF saves andinsends to the system a report gets the stamp selectsofasending statistical form fills a form (makes a report) signs a report by digital signature before sending Statistical data processing Work 1 Work place for tables generation Work place for data entry Work 2 Work place for verification Work 3 Basic stages Data input Data control Data processing (aggregation) Printing output tables Storage and presentation of statistical data Select a category of the data Select indicators from the DW catalogue Select: Macro or micro data. Catalogue of indicators corresponds to CSI. There is ability to find indicator on name or code. There is ability to find value on name or Set of dimensions depends on selected indicators. code Select values of dimensions Build model of output table Get the result Printing from browser MS Excel XML (SDMX) Values of dimensions has linear or hierarchical structure. There is ability to find а value of dimension on name or code from dictionary. Data arrangement in the output table depends on arrangement of dimensions and measure in the model. Results • Unified system of data collection and processing on basis of “STATEC” is used for 400 Statistical works • Collection of statistical reports in electronic mode has been implemented for 20% of reporting units • Statistical data Warehouse is being loaded with respective data Federal State Statistics Service (Rosstat) THANK YOU FOR ATTENTION !