NGS in the future: emerging middleware Mike Mineter

Download Report

Transcript NGS in the future: emerging middleware Mike Mineter

http://www.grid-support.ac.uk
http://www.ngs.ac.uk
NGS in the future: emerging
middleware
Mike Mineter
[email protected]
http://www.nesc.ac.uk/
http://www.pparc.ac.uk/
http://www.eu-egee.org/
NGS middleware evolution
EGEE…
Other software
sources
Prototypes &
specifications
‘Gold’ services
NGS
ETF
Software with proven
capability & realistic
deployment
experience
Operations
UK,
Campus
and other
grids
Feedback & future requirements
Engineering Task Force
Deployment/testing/advice
2
Outline
• Middleware currently being prepared for deployment
– Resource broker
– (NGS portal – yesterday!)
• The context for the next generation of middleware:
seeking convergence with web services
• Under assessment:
– gLite middleware from EGEE
– OMII
– GT4
3
Resource broker
• (This is NOT the SRB!!!)
• Current NGS middleware : Toolkits inviting
development of higher level services
• On the current NGS we have
– GRAM to submit jobs
– Information service to tell us what queues are busy
• The RB takes the work out of deciding where to
run a job
4
Major components
“User
interface”
Input “sandbox”
Output “sandbox”
DataSets info
Replica
Catalogue
Information
Service
Resource
Broker
Publish
Logging &
Book-keeping
Job Query
Job Submit Event
Author.
&Authen.
Storage
Element
Job Status
Computing
Element
Resource broker
• Job Description Language file: describes
resources needed by a job
• Commands analogous to GT2:
– edg-job-submit <jdl filename>
– edg-job-status <dg-job-id>
– edg-job-get-output <dg-job-id>
6
Example
• edg-job-submit myjob.jdl
– Myjob.jdl
• JobType = “Normal”;
• Executable = "$(CMS)/exe/sum.exe";
• InputSandbox = {"/home/user/WP1testC","/home/file*”,
"/home/user/DATA/*"};
• OutputSandbox = {“sim.err”, “test.out”, “sim.log"};
• Requirements = other. GlueHostOperatingSystemName ==
“linux" &&
• other. GlueHostOperatingSystemRelease == "Red Hat 7.3“
&& other.GlueCEPolicyMaxCPUTime > 10000;
• Rank = other.GlueCEStateFreeCPUs;
7
More about the RB
• Developed by the European DataGrid project, EDG then
“hardened” by LCG, and now one of the sources for the
EGEE middleware
• Uses components of Condor
– matchmaker and Condor-G
• Try the GENIUS portal on GILDA
– GILDA is a dissemination grid running the LCG-2 middleware
– Demo site: https://grid-demo.ct.infn.it/
• And look at
http://lcg.web.cern.ch/LCG/
http://www.hep.ph.ic.ac.uk/escience/projects/demo/index.html
8
Implications for the NGS
• Integration with NGS core nodes in progress
• “UI” requirements??:
– LCG user interface + OGSA-DAI + SRB client
– Lighter-weight alternatives?
• To packaging?
• For client software
9
• http://www.hep.ph.ic.ac.uk/escience/projects/demo/index.html
• Monitoring of the current LCG-2/ EGEE-0
system, that includes resource brokers
10
Resource broker summary
• The resource broker receives a job description
in JDL
• It chooses a batch queue for job submission,
using the information services
• Its an example of the higher services that can be
deployed for the NGS, built upon the current
toolkits
11
Outline
• Middleware currently being prepared for deployment
– Resource broker
– (NGS portal – yesterday!)
• The context for the next generation of middleware:
seeking convergence with web services
• Under assessment:
– gLite middleware from EGEE
– OMII
– GT4
12
Convergence of Web
Services and Grids
Web services
Resource
Framework
Web services
World-wide web
OGSI
Grid prototypes
High-end computing
High throughput-computing
INTERNET
Massively parallel
computing
13
Convergence with WS
Why?
•Opens integration of grids and WS worlds!
•Need service orientation for grids!
•Need easier installation!!
Grid services
SOAP
XML
Grid services
http, https
Operating system; TCP/IP
O/S; TCP/IP
14
Moving to WS:
• Leverage WS hosting
environments
• OGSI first attempt
• Building on WS
– Many projects
– EGEE’s gLite
• Now moving towards
“WS-RF”
– Set of emerging standards
– Globus Toolkit 4
Grid
services
OGSI – GT3
Grid services
Now: GT4
SOAP
Axis
XML
http, https
TOMCAT
O/S; TCP/IP
O/S; TCP/IP
15
Outline
• Middleware currently being prepared for deployment
– Resource broker
– (NGS portal – yesterday!)
• The context for the next generation of middleware:
seeking convergence with web services
• Under assessment:
– gLite middleware from EGEE
– OMII
– GT4
• Introduction to EGEE and OMII follows
– You know Globus already!
16
EGEE – towards e-infrastructure
Enabling Grids for E-sciencE
EGEE is building a large-scale
production grid service to:
• Underpin research,
technology and public service
• Link with and build on
national, regional and
international initiatives
• Foster international
cooperation both in the
creation and the use of the einfrastructure
INFSO-RI-508833
Pan-European Grid
Operations, Support and
training
Collaboration
Network
infrastructure
& Resource
centres
17
Project Goals
Enabling Grids for E-sciencE
A four year programme:
• Build, deploy and operate a consistent, robust and
secure grid that attracts new computing resources
• Improve and maintain the middleware in order to
deliver a reliable service to users
• Attract new users from science and industry and
ensure training and support for them
INFSO-RI-508833
18
In the first 2 years EGEE will
Enabling Grids for E-sciencE
• Establish production quality sustained Grid
services
– 3000 users from at least 5 disciplines
– integrate 50 sites into a common
infrastructure
– offer 5 Petabytes (1015) storage
• Demonstrate a viable general process to
bring other scientific communities on board
• Propose a second phase in mid 2005 to take
over EGEE in early 2006
INFSO-RI-508833
Pilot
New
19
EGEE Organisation
Enabling Grids for E-sciencE
• 70 leading institutions in 27
countries, federated in regional Grids
• ~32 M Euros EU funding for first 2
years starting April 2004
(matching funds from partners)
• Leveraging national and regional grid
activities
• Promoting scientific partnership
outside EU
INFSO-RI-508833
20
Activities Definition
Enabling Grids for E-sciencE
•
Network Activities
–
–
–
–
–
•
NA1: Project Management
NA2: Dissemination and Outreach
NA3: User Training and Induction
NA4: Application Identification and Support
NA5: Policy and International Cooperation
Service Activities
– SA1: Grid Support, Operation and Management
– SA2: Network Resource Provision
•
Joint Research Activities
–
–
–
–
JRA1: Middleware Reengineering + Integration
JRA2: Quality Assurance
JRA3: Security
JRA4: Network Services Development
INFSO-RI-508833
Emphasis in EGEE is on
operating a production
grid and supporting the
end-users
21
gLite: Guiding Principles
Enabling Grids for E-sciencE
•
VDT
EDG
...
AliEn
LCG
...
Service oriented approach
– Allow for multiple interoperable
implementations
•
Lightweight (existing) services
– Easily and quickly deployable
– Use existing services where
possible
 Condor, EDG, Globus, LCG, …
•
Portable
– Being built on Scientific Linux and
Windows
•
•
Security
– Co-existence with LCG-2 and OSG
(US) are essential for the EGEE
Grid services
– Sites and Applications
•
Performance/Scalability &
Resilience/Fault Tolerance
•
Site autonomy
– Reduce dependence on ‘global,
central’ services
– Comparable to deployed
infrastructure
•
INFSO-RI-508833
Co-existence with deployed
infrastructure
Open source license
22
gLite Services for Release 1
Enabling Grids for E-sciencE
JRA3
Grid Access
Service
CERN
API
Access Services
Authorization
Information &
Monitoring
Auditing
Authentication
Security Services
Metadata
Catalog
File & Replica
Catalog
Storage
Element
Data
Management
IT/CZ
Application
Monitoring
Information &
Monitoring Services
Accounting
Job
Provenance
Package
Manager
Site Proxy
Computing
Element
Workload
Management
Data Services
INFSO-RI-508833
UK
Job Management Services
23
Grid Operations
Enabling Grids for E-sciencE
•
•
RC
RC
ROC
RC
– Essential to scale the operation
RC
RC
RC
RC
RC
•
ROC
CIC
CIC
RC
CIC
CIC
OMC
RC
CIC
RC
RC
RC
RC
ROC
RC
RC = Resource Centre
INFSO-RI-508833
ROC
CICs act as a single Operations
Centre
– Operational oversight (grid
operator) responsibility
– rotates weekly between CICs
– Report problems to ROC/RC
– ROC is responsible for ensuring
problem is resolved
– ROC oversees regional RCs
RC
RC
CIC
The grid is flat, but
Hierarchy of responsibility
RC
RC
•
RC
ROCs responsible for organising
the operations in a region
– Coordinate deployment of
middleware, etc
•
CERN coordinates sites not
associated with a ROC
24
Open Middleware
Infrastructure Institute

The slides that follow were selected and (in a
few cases) modified by Mike Mineter (NeSC)
from those presented in January 2005 at an
OMII training day
Steven Newhouse, Peter Henderson
Stephen Crouch & Karen Ng

Goal of this presentation: to rase awareness
of the OMII and its OMII_1 release
MM
25
Open Middleware
Infrastructure Institute
OMII goal: to be the source of open source grid
software





Institute of the University of Southampton
Utilise existing software and standards
Production focused software development
Integrate, test & document ‘a product’
Focus on the user experience


Easy to install & use
Utilise existing software and standards

Provide a solid web service base for others to build on
26
Where does our software come
from?

Open Source Community


Software Repository



Tomcat, Axis, etc.,
Accept software contributions
Software deployed, tested & graded to provide
feedback
Managed Programme


Fill gaps to build a solid enabling infrastructure
Projects to bring research software to production
quality
27
OMII_1 release:
A basic File-Compute Grid




Enables a generic computational task
Move input data from the client to the service
provider
Process the data using an application on the
service provider
Retrieve the output data from the service
provider
28
OMII Server Infrastructure
PBAC
ExampleService
TestService
Job
Data
Allocation
Account
Resource Acct
Mgmt
Mgmt
Servlet Servlet
Happy
Axis
WS-Security
AXIS
Static Webpage
TOMCAT
29
Try out the OMII_1 client !



Register at www.omii.ac.uk & login
Goto the downloads page
Download the client distribution



SuSE 9.0
 Client may work on other Linuxs but no exhaustive testing
Windows XP (SP 1 & 2)
Distribution requires JDK 1.4.2_04



Does not work with ‘just’ a JRE
Will not work with JDK 1.4.2_05/06 & JDK 1.5.0
No testing with earlier JDKs.
30
Summary
• Middleware currently being prepared for deployment
– Resource broker
– (NGS portal – yesterday!)
• The context for the next generation of middleware:
service orientation, based on WS and the emerging
standards
• Under assessment by the Engineering Task Force:
– gLite middleware from EGEE
– OMII
– GT4
31
Additional slides - background
32
Managed Programme









GridSAM (Job Submission & Monitoring service)
BPEL (Workflow service)
Grimoires (Registry service based on UDDI)
FIRMS (Reliable messaging)
FINS (Notification)
GeodiseLab (Matlab toolbox)
WSRF::Lite integration
OGSA-DAI (Database service)
WSeSS (Using SSH to tunnel requests to resources)
33
Grid Architecture Today

The best way of designing Grids…



The best way of running Grids…



Loosely coupled services
Message based exchange
Interoperability between versions & grids
Standards for infrastructure & services
The best way of building Grids…


Leverage existing infrastructure & standards
Use Web Services…
34
Some Web Service Definitions



A service is the logical manifestation of some
physical or logical resources (databases, programs,
devices, humans, etc) and/or some application logic
that is exposed to the network
Service interaction is facilitated by message
exchanges
A service is an abstract resource that represents a
capability of performing tasks that represents a
coherent functionality from the point of view of
provider entities and requester entities. To be used,
a service must be realised by a concrete provider
agent
35
Web Services (WS)


XML: Platform neutral mechanism to describe
data
SOAP: Mechanism to describe message
exchange

Simple Object Access Protocol


Service Oriented Access Protocol


Not simple and nothing to do with Objects!
Re-engineering of acronym to fit current use!
WSDL: Defines the service interface
36
More WS concepts…

Services have to reside in a supporting environment:





Called: hosting environment or container
Marshals requests into and response out of the service
Service can discover local configuration parameters
Provides a standard infrastructure for service developers
Processing incoming requests & outgoing responses



Called: Message handlers
Manipulates elements of the message header
 Primarily the SOAP header
Handlers can be applied to message traffic into or out of
the whole container or a specific service
37
Putting it all together…

Architecturally web services provide…







Process of independent loosely coupling services
Defining service interfaces (or contract)
Defining the format of the messages interchange
Platform neutral
Flexible granularity
Clearly defined boundaries
Need an implementation…
38
OMII_1 as a Service Provider


Goal: I want others to access my resources &
applications
I want to provide secure controlled access to:




My applications:
 Specify who can access which applications
My computational resources:
 I can limit external usage of my resources
Provides an interface that allows remote users to
access my resources
Enable collaboration with other partners
39
OMII_1 as a User (or Client)


Goal: I want to use other resources & applications
Through a network of service providers I can…:



Gain access to applications that I do not have installed locally
Use remote machines with more CPU, memory or storage
 Process larger problems sizes
Transparently switch between different service providers
 No exposure to underlying OS, queuing policy, disk layout etc.
40
OMII 1:Basic File-Compute Grid

Consists of:





Base (Tomcat 5.0.25 & Axis 1.2b)
Extensions (Axis Handlers)
 WS-Security
 Process Based Access Control
Basic Services
Sample application
Plus installers, README’s & documentation
41
OMII 1:Basic Services


Based on a group of four services
Functional: Data & Application execution



Running jobs using pre-installed applications
Movement of input and output data files
Management: Account and Resources


Must have an account with a service provider
Or delegated access to someone else’s account
42
Process Based Access Control:
A model for implementing AAA


Authentication: CA issued X.509 certificates
Authorisation: Interaction dependent authorisation
process



Access control lists tied to process context and state
 i.e. impose server side workflow requirements
Supports “delegation” and “subordination” actions
Accounting: Activity matched against allocated quota


Clients control who can access “their” allocated quota
Collaboration with minimal overhead for service providers
43