Education and Grid Services Geoffrey Fox Professor of Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47404 [email protected] http://www.infomall.org http://www.grid2002.org.

Download Report

Transcript Education and Grid Services Geoffrey Fox Professor of Computer Science, Informatics, Physics Pervasive Technology Laboratories Indiana University Bloomington IN 47404 [email protected] http://www.infomall.org http://www.grid2002.org.

Education and Grid Services
Geoffrey Fox
Professor of Computer Science, Informatics, Physics
Pervasive Technology Laboratories
Indiana University Bloomington IN 47404
[email protected]
http://www.infomall.org
http://www.grid2002.org
Who is Geoffrey Fox?


Undergraduate degree in math, PhD Theoretical Physics at Cambridge
University
Theory, Experiment, Computation, Phenomenology of particle physics
Caltech for 20 years
• Worked with Feynman, Hey, Wolfram



Dean for educational computing and associate provost for computing Caltech;
Professor of Physics; department chair
Developed parallel computers for science
Computer Science Syracuse, Florida State, Indiana
• Main area of research last 20 years


Interdisciplinary work in computational science with many fields – Earth
Science/Biology at moment
Chief technologist Anabas corporation (WebEx done right)
• Technology for distance education on the Grid
• Teaching class from Indiana to Jackson State this semester

Informatics, Computer Science, Physics at Indiana
• Pervasive Technology Lab Information technology initiative at Indiana University
funded by Lilly
• Director Community Grids Laboratory
What is a MLE Managed Learning Environment



An MLE is the full range of information systems and
processes of an institution that contributes directly or
indirectly to learning and the management of learning
A Virtual Learning Environment VLE is the subset of
MLE components that provide online learning
interactions for learners and teachers
MLE Components include enrollment, security, portal,
digital library functions on learning resources, access to
administrative material, payment, attendance tracking,
authoring curriculum, learning planners, quizzes,
homework, grading, assessment, distance teaching,
computer-aided instruction, collaboration tools
Some Players with Education Grid like Capabilities


IMS and ADL in the USA have set standards for some of the
special learning metadata structures
CHEF (Michigan) and Colloquia (Bangor) are academic
groupware projects aimed at education
• Access Grid from Argonne is Audio-Video conferencing


Sakai and OKI are Mellon Foundation projects implementing
electronic learning capabilities
Blackboard and WebCT are popular (some places) academic elearning support systems
• Several inhouse efforts like OnCourse at Indiana


Docent, Topclass etc. are learning content management systems
LCMS mainly selling to corporate training market
Centra, Interwise, Placeware, WebEx, GrooveNetworks are well
known collaboration systems that might support distance
learning/tutoring and participatory education
Grids in a Nutshell




Grids are by definition the best of HPCC, Web Services, Agents,
Distributed Objects, Peer-to-peer networks, Collaborative
environments
Grid applications are typically zero or one very large
supercomputers, lots of conventional machines, with unlimited
data and/or people supporting an electronic (virtual) community
• Data sources and people are latency tolerant …
• Multiple supercomputers (or clusters) on same Grid as in
TeraGrid/ETF largely for sharing of data and by people
Grids are supported by Global Grid Forum, W3C, OASIS …
setting standards
Grids are a “service oriented architecture” hiding irrelevant
details
• Services are electronic resources communicating by messages
• Message based architecture gives scalable loosely coupled
component model
Information/Knowledge Grids


Distributed (10’s to 1000’s) of data sources (instruments,
file systems, curated databases …)
Data Deluge: 1 (now) to 100’s petabytes/year (2012)
• Moore’s law for Sensors




Possible filters assigned dynamically (on-demand)
• Run image processing algorithm on telescope image
• Run Gene sequencing algorithm on compiled data
Needs decision support front end with “what-if”
simulations
Metadata (provenance)
critical to annotate data
Integrate across experiments
as in multi-wavelength
astronomy
Data Deluge comes from pixels/year available
A typical Web Service


In principle, services can be in any language (Fortran .. Java ..
Perl .. Python) and the interfaces can be method calls, Java RMI
Messages, CGI Web invocations, totally compiled away (inlining)
The simplest implementations involve XML messages (SOAP) and
programs written in net friendly languages like Java and Python
Web Services
WSDL interfaces
Portal
Service
Security
WSDL interfaces
Web Services
Payment
Credit Card
Catalog
Warehouse
Shipping
control
Each service should be
able to run independently
on separate machines
Typical Grid Architecture
Re-use
Application
Customization
User
Services
Application
Service
Portal
Services
Application
Service
Libraries
Application
Service
Middleware
Re-use
“Core”
Grid
System
Services
System
Services
System
Services
Raw (HPC)
Resources
Database
Some Technical Issues

All IT approaches support systems with multiple
capabilities
• They often reveal and/or standardize interfaces
• They could be different method calls, Java classes, or
Web/Grid service interfaces

We will ONLY use the word Service when interface can
be efficiently accessed by messages with service as an
isolated single service
• Grids build systems from message-based services
Capabilities
often called services
Module
Module
Service even if NOT using
Servicea
Messages
B
A
B
A
Service
Oriented Architecture
Method Calls
1 to 10 microseconds
10 to 1000 millisecond latency
Message-based or Method-based


Method-based interfaces are most efficient but can only
be run in that fashion in a single monolithic
implementation
• One service with multiple ports
• i.e. each interface might be accessed via message but
all capabilities need to be co-located
• Technologies like Java RMI allow distributed objects
but requires serialization (often non trivial) and
unclear if application supports performance loss
“Message-based services” support standards and
distributed deployment with easy use of standards
compliant services from different implementers.
Sakai





The University of Michigan, Indiana University, MIT, Stanford,
and the uPortal consortium are joining forces to integrate and
synchronize their considerable educational software into a preintegrated collection of open source tools.
Sakai builds on OKI – Open Knowledge Initiative – interfaces
These Open Service Interface Definitions were developed
outside the Grid process but appear to have overlaps with
many Web service and Grid standards
• Note OGSA-DAI, Security, Workflow, WS-Notification,
Grid monitoring, WebDAV overlaps
Although they are called “services”, I think they are being
developed initially inside a (single) Java container
Does not address real-time collaboration except for chat
Portals

These are used rather inconsistently for
• A general term for the whole user experience with an interface to multiple
capabilities
• Narrow specification of certain capabilities such as customization, server
side support for web page generation, aggregation of document fragments
(one per service), security
• Broad specification to include both user interface and services

Note portals tend to be monolithic frameworks because that’s
how one used to build such things
• Jetspeed and CHEF’s modification of it are both frameworks

Portals need to be broken up into distributed message based
services for security, customization, layout, rendering
• Shouldn’t invest too much in today’s frameworks although they have
some wonderful features

However Portals do encourage “component” model for user
interfaces and so this fits service model so every service can be
packaged with its (document fragment) user interface
• So portlets are good even if containers primitive
The OGCE Computing Grid Portal
• Provides Portlets for
– Management of user proxy
certificates
– Remote file Management via
Grid FTP
– News/Message systems
• for collaborations
–
–
–
–
Grid Event/Logging service
Access to OGSA services
Access to directory services
Specialized Application Factory
access
• Distributed applications
• Workflow
– Access to Metadata Index tools
• User searchable index
– Real Time Collaboration
• Audio/Video Conferencing
OGCE
Consortium
Example Capability: File Management
• Grid FTP portlet– Allow
User to manage remote file
spaces
– Uses stored proxy for
authentication
– Upload and download files
– Third party file transfer
User
Jetspeed Portal Server
1 of many Portlets
• Request that GridFTP server A send aGridFTP
file to GridFTP server B
Server A
• Does not involve traffic through portal
server
GridFTP
Service
GridFTP
Server B
OGCE
Consortium
Education Grids


Education Grids can be considered from at least two
points of view
1) Exploiting e-Science and other relevant research
government or business grids whose resources can be
adapted for use in education
• Opportunity to make education more “real” and to give
students an idea what scientific research is like

2) Support the virtual organization that is the teacher
and learner community
• Actually this community is heterogeneous with teachers,
learners, parents, employers, publishers, informal education,
university staff ….

Build the Education Grid as a Grid of Grids
Typical Science Grid
Service such as Research
Database or simulation
Campus or
Enterprise
Administrative
Grid
Learning Management
or LMS Grid
Digital
Library
Grid
Science Grids
Bioinformatics
Earth Science …….
Transformed by Grid Filter
to form suitable for education
Publisher
Grid
Education Grid
Inservice Teachers
Preservice Teachers
School of Education
Teacher Educator
Grids
Student/Parent …
Community Grid
Informal
Education
(Museum)
Grid
Planning Grid
Education as a Grid of Grids
Education Grid of Grids




Services in an Education Grid fall into three classes
1) Those that special to Education such as quiz (as in
IMS), learning plan or grading services
2) Those that are important but can be taken from
other Grids such as collaboration and security
3) Those that come from other Grids and are refactored
for education
• The simulation is reduced in size
• The bioinformatics database interface is simplified
e-Science
Resource
Filter
Education Grid View
of e-Science Resource
Education Grid
Repositories
Federated Databases
Database
Sensors
Streaming
Data
Field Trip Data
Database
Research
SERVOGrid
Data
Filter
Services
Research
Simulations
Geoscience Research and
Education Grids
Customization
Services
From
Research
to Education
?
Discovery
Services
Education
Analysis and
Visualization
Portal
Education
Grid
Computer
Farm
XGSP Web Service MCU Architecture
Use Multiple Media servers to scale to many codecs and many
versions of audio/video mixing
Session Server
XGSP-based Control
NaradaBrokering
All Messaging
NB Scales as
distributed
Admire
Web
Services
SIP
H323
Media Servers
Filters
High Performance (RTP)
and XML/SOAP and ..
Access Grid
Gateways convert to uniform XGSP Messaging
NaradaBrokering
Native XGSP
Requirements or Issues to be Addressed I
• Interoperability: Several standards – e.g.
H323, T120, SIP, Access Grid – which are
inconsistent with themselves and with modern
Web standards
• Integration: Integrate all forms of collaboration
– instant messenger, audio-video conferencing,
application sharing
• Life-cycle costs: use commodity software
components
• Extensibility: Interfaces defined for adding
new capabilities
•
•
•
•
•
•
Requirements or Issues to be Addressed II
Performance: Allow maximum performance
with given network with no unnecessary client
or server overheads
Fault Tolerance: Fault tolerant session control
Security: Support multiple levels of security for
clients, servers and communication traffic
Scalability: Current systems are often limited
by architecture or implementation (such as a
single server) in number of simultaneous
participants
Pervasive Access: Need to support wide
range of clients from hand-held devices to
sophisticated desktop system.
Ease of Use: Simple web portal interface; no
Collaboration Architecture
• Use Grid and Web Service base architecture
• Define XML-based Collaboration Interface specification
capturing semantics of existing standards
• Define open interfaces allowing both third party services to be
developed and to allow competitive implementation of base
infrastructure
• Use software overlay network to support needed dynamic
routing and message-based architecture
• Use active measurements to find network performance and
network
or server/broker
• Web Service
architecturefaults
with N logN servers to support
• Use
Web Service message based security
N participants
• 1000
simultaneous streams
needsfor
around
50 low-end
• Use
publish/subscribe
paradigm
all messaging
to support
multi-participant
Linux servers sessions and archiving
• Does
not need multi-cast;
supports web-cams
• Use
distributed
scalable fault-tolerant
middleware including
WS-RM
(Web
Service
Messaging)
• Supports
Polycom
andReliable
Access Grid
clients or equivalent
Summary









Grids are inevitably important for Education
Grid of Grids interesting way to build “new Grids” that might be
accepted by skeptical participants and enhance re-use
IMS has set data but not many service standards
• Partial step to interoperability
Sakai is building modern (probably wonderful) open e-learning
capabilities but appears not to be a Grid/WS standards compliant
service architecture
Current academic/commercial systems are successful but
monolithic and perhaps are too education-specific
Opportunity to build service-based Education Grid Infrastructure
interacting with broad community (from Grids to WS to Schools
of Education) exploiting other Grids
Can build collaboration – A/V Conferencing, Shared applications,
groupware – in Grid/WS architecture
Can develop best practice and tools to allow e-Science grids to be
linked to education
Can encourage use of component-based portals