Remarks on Education and the Grid Geoffrey Fox Professor of Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47401 [email protected] http://www.infomall.org http://www.grid2002.org.
Download ReportTranscript Remarks on Education and the Grid Geoffrey Fox Professor of Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47401 [email protected] http://www.infomall.org http://www.grid2002.org.
Remarks on Education and the Grid Geoffrey Fox Professor of Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47401 [email protected] http://www.infomall.org http://www.grid2002.org Grid Computing: Making The Global Infrastructure a Reality Based on work done in preparing book edited with Fran Berman (director NPACI) and Anthony J.G. Hey (leader of core UK e-Science program), ISBN: 0-470-85319-0 Hardcover 1080 Pages Published March 2003 http://www.grid2002.org Last chapter is on education and the Grid Who is Geoffrey Fox? Undergraduate degree in math, PhD Theoretical Physics at Cambridge University Theory, Experiment, Computation, Phenomenology of particle physics Caltech for 20 years • Worked with Feynman, Hey, Wolfram Dean for educational computing and associate provost for computing Caltech; Professor of Physics; department chair Developed parallel computers for science Computer Science Syracuse, Florida State, Indiana • Main area of research last 20 years Interdisciplinary work in computational science with many fields – Earth Science/Biology at moment Chief technologist Anabas corporation (WebEx done right) • Technology for distance education on the Grid • Will teach class from Indiana to Jackson State State next semester Informatics, Computer Science, Physics at Indiana • Pervasive Technology Lab Information technology initiative at Indiana University funded by Lilly • Director Community Grids Laboratory e-Business e-Science and the Grid e-Business captures an emerging view of corporations as dynamic virtual organizations linking employees, customers and stakeholders across the world. • The growing use of outsourcing is one example e-Science is the similar vision for scientific research with international participation in large accelerators, satellites or distributed gene analyses. The Grid integrates the best of the Web, traditional enterprise software, high performance computing and Peerto-peer systems to provide the information technology infrastructure for e-moreorlessanything. A deluge of data of unprecedented and inevitable size must be managed and understood. People, computers, data and instruments must be linked. On demand assignment of experts, computers, networks and storage resources must be supported So what is a Grid? Supporting human decision making with a network of at least four large computers, perhaps six or eight small computers, and a great assortment of disc files and magnetic tape units not to mention remote consoles and teletype stations - all churning away. (Licklider 1960) Coordinated resource sharing and problem solving in dynamic multi-institutional virtual organizations Infrastructure that will provide us with the ability to dynamically link together resources as an ensemble to support the execution of large-scale, resource-intensive, and distributed applications. Realizing thirty year dream of science fiction writers that have spun yarns featuring worldwide networks of interconnected computers that behave as a single entity. e-Science e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it. This is a major UK Program e-Science reflects growing importance of international laboratories, satellites and sensors and their integrated analysis by distributed teams CyberInfrastructure is the analogous US initiative Grid Technology supports e-Science and DATA ACQUISITION CyberInfrastructure ADVANCED VISUALIZATION ,ANALYSIS QuickTime™ and a decompressor are needed to see this picture. IMAGING INSTRUMENTS COMPUTATIONAL RESOURCES LARGE-SCALE DATABASES Classic Grid Architecture Resources Database Database Composition Content Access Netsolve Security Collaboration Middle Tier Brokers Service Providers Computing Middle Tier becomes Web Services Clients Users and Devices e-Business and (Virtual) Organizations Enterprise Grid supports information system for an organization; includes “university computer center”, “(digital) library”, sales, marketing, manufacturing … Outsourcing Grid links different parts of an enterprise together (Gridsourcing) • Manufacturing plants with designers • Animators with electronic game or film designers and producers • Coaches with aspiring players (e-NCAA or e-NFL etc.) Customer Grid links businesses and their customers as in many web sites such as amazon.com e-Multimedia can use secure peer-to-peer Grids to link creators, distributors and consumers of digital music, games and films respecting rights Distance education Grid links teacher at one place, students all over the place, mentors and graders; shared curriculum, homework, live classes … Some Important Styles of Grids Computational Grids were origin of concepts and link computers across the globe – high latency stops this from being used as parallel machine Knowledge and Information Grids link sensors and information repositories as in Virtual Observatories or BioInformatics • More detail on next slide Community Grids focus on Grids involving large numbers of peers rather than focusing on linking major resources – links Grid and Peer-to-peer network concepts Semantic Grid links Grid, and AI community with Semantic web (ontology/meta-data enriched resources) and Agent concepts Peers Database Database Service Facing Web Service Interfaces Event/ Message Brokers Event/ Message Brokers Event/ Message Brokers Peer to Peer Grid Peers User Facing Web Service Interfaces A democratic organization Peer to Peer Grid Information/Knowledge Grids Distributed (10’s to 1000’s) of data sources (instruments, file systems, curated databases …) Data Deluge: 1 (now) to 100’s petabytes/year (2012) • Moore’s law for Sensors Possible filters assigned dynamically (on-demand) • Run image processing algorithm on telescope image • Run Gene sequencing algorithm on compiled data Needs decision support front end with “what-if” simulations Metadata (provenance) critical to annotate data Integrate across experiments as in multi-wavelength astronomy Data Deluge comes from pixels/year available SERVOGrid – Solid Earth Research Virtual Observatory will link Australia, Japan, USA …… Repositories Federated Databases Database Sensor Nets Streaming Data Database Grids for Geoscience Analysis and Visualization Loosely Coupled Filters Closely Coupled Compute Nodes SERVOGrid Requirements Seamless Access to Data repositories and large scale computers Integration of multiple data sources including sensors, databases, file systems with analysis system • Including filtered OGSA-DAI (Grid database access) Rich meta-data generation and access with SERVOGrid specific Schema extending openGIS (Geography as a Web service) standards and using Semantic Grid Portals with component model for user interfaces and web control of all capabilities Collaboration to support world-wide work Basic Grid tools: workflow and notification Virtual Observatory Astronomy Grid Integrate Experiments Radio Far-Infrared Visible Dust Map Visible + X-ray Galaxy Density Map Grids in a Nutshell Grids are by definition the best of HPCC, Web Services, Agents, Distributed Objects, Peer-to-peer networks, Collaborative environments Grid applications are typically zero or one very large supercomputers, lots of conventional machines, with unlimited data and/or people supporting an electronic (virtual) community • Data sources and people are latency tolerant … • Multiple supercomputers (or clusters) on same Grid as in TeraGrid/ETF largely for sharing of data and by people Grids are supported by Global Grid Forum, W3C, OASIS … setting standards Grids are a “service oriented architecture” hiding irrelevant details • Services are electronic resources communicating by messages • Message based architecture gives scalable loosely coupled component model A typical Web Service In principle, services can be in any language (Fortran .. Java .. Perl .. Python) and the interfaces can be method calls, Java RMI Messages, CGI Web invocations, totally compiled away (inlining) The simplest implementations involve XML messages (SOAP) and programs written in net friendly languages like Java and Python Web Services WSDL interfaces Portal Service Security WSDL interfaces Web Services Payment Credit Card Catalog Warehouse Shipping control Typical Grid Architecture Re-use Application Customization User Services Application Service Portal Services Application Service Libraries Application Service Middleware Re-use “Core” Grid System Services System Services System Services Raw (HPC) Resources Database What does Community Grids Lab do? New curricula for Grids (next semester) taught using Grid technology Underlying Grid messaging technology supporting reliable communication Grid Portals built with re-usable components (portlets) Application Grids for Earth Science/Biocomplexity/supercomputer users Integration of handheld devices with the Grid Audio-video conferencing and collaboration with a Grid architecture Multimedia on the Grid; http://www.undergroundfilm.org Why Grids for Education Education is a classic distributed organization New multi-disciplinary curricula in emerging fields (information technology, informatics, biocomplexity, nanotechnology) require distributed experts interacting with mentors and students Grids support component model for content allowing re-use of research services in education Education fits the service model for both “process” and “Curriculum” Education wants rich integration of data sources and people and some computing – Grids can do this Grids democratize resources as enable universal (ubiqitous) access • In terms of geographic distribution, level (K-12 through lifelong learning) and disparate clients Grids support the Internet generation WebCT, Blackboard, Placeware, WebEx, Groove, Learning Management systems all have natural Grid implementations SERVOGrid for Education Content Field Trip Data Repositories Federated Databases Database Sensors Streaming Data Database Sharing Grid Services ? Loosely Coupled Filters Discovery Services Analysis and Visualization Coarse grain simulations Grid Learning Model Education and Research Grids share some services both for content and “process” • For example collaboration services are largely identical • Research will use much larger simulation engines to get high resolution results • Maybe a researcher uses a CAVE to visualize; education a Macintosh But both can share data services but run through different filters to select for precision (research) or pedagogical value (education) Education has “digital textbook” frontend to resources of the research Grid Both use same workflow technologies to link services together Grid Services for the Education Process “Learning Object” XML standards already exist Registration Performance (grading) Authoring of Curriculum Online laboratories for real and virtual instruments Homework submission Quizzes of various types (multiple choice, random parameters) Assessment data access and analysis Synchronous Delivery of Curricula including Audio/Video Conferencing and other synchronous collaborative tools as Web Services Scheduling of courses and mentoring sessions Asynchronous access, data-mining and knowledge discovery Learning Plan agents to guide students and teachers Implementing Grids for Education I Need to design a service architecture for education • Build on services from broader fields • Need some specific EducationML specifying services and properties Note IMS (http://www.imsproject.org/) and ADL have a lot of education property metadata but no services • Need more use of standards outside education but much of IMS can be used Use services where-ever possible but only if “coarse-grain” Closely coupled Java/Python … Module B Module A Method Calls .001 to 1 millisecond Coarse Grain Service Model Service B Messages Service A 0.1 to 1000 millisecond latency Implementing Grids for Education II Build a Education Grid prototype addressing content and process • Focus education grid on a curriculum area (using Grids!) such as Geoscience or even e-Science/Information Technology/Science Informatics Re-use Grid services in systems area (portals, security, collaboration ..) and from application domain • What research Grid services can be re-used; what need to be significantly changed or customized • Develop some “Education process” services Supply leadership in use of CyberInfrastructure/Grids in education • Feed Education needs to CyberInfrastructure and vice-versa Perform a requirement analysis analogous to Gap Analysis http://grids.ucs.indiana.edu/ptliupages/publications/GapAnalysis30June03v2.pdf Develop curriculum in Grids, e-Science and CyberInfrastructure Conclusions Grids/CyberInfrastructures are inevitable and pervasive Education can benefit from Grids and vice-versa Can expect Web Services, P2P networks and Grids to merge with a common set of general principles but different implementations with different scaling and functionality trade-offs We will be flooded with data, information and purported knowledge One should be preparing Grid strategies; understanding relevant Web and Grid standards and developing new domain specific standards Note many existing (standards) efforts assume client-server and not a brokered service model; these will need to change! Enough is known that one can start today with prototypes Discussion The following additional points were made in discussion Connect to Citizen Science as at http://www.ebird.org and projects like SETI@Home Role of Science museums on a Grid Role of Grid services to provide knowledge transformations for education or research • Use “resources-on-demand” and so knowledge broadly available if resources supplied as a national facility Does Grid technology enhance or mitigate the digital divide Need to explain role of teachers in an Education Grid and train them to take it Explain difference between Internet and the Grid Describe role of Access Grid type technology Some new textbook’s have embedded URL’s