gLite Architecture

Download Report

Transcript gLite Architecture

Enabling Grids for E-sciencE
The gLite middleware
distribution
OSG Consortium Meeting
Seattle, 21-23 August 2006
www.eu-egee.org
www.glite.org
EGEE-II INFSO-RI-031688
Outline
Enabling Grids for E-sciencE
• Background and approach adopted
• Architecture
• Software process
• Status
• Summary
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
2
Background & Approach
Enabling Grids for E-sciencE
• gLite
– Exploit experience and existing
components from VDT (Condor,
Globus), EDG/LCG, AliEn,
and others
– Develop a lightweight stack of
generic middleware useful to EGEE applications (HEP and
Biomedics are pilot applications).
 Pluggable components – cater for different implementations
 Follow SOA approach, WS-I compliant where possible
– Focus is on re-engineering and hardening
– Business friendly open source license
 Plan to switch to Apache-2
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
3
Service Oriented Architecture
Enabling Grids for E-sciencE
• gLite follows a Service Oriented
Architecture
– Facilitate interoperability among Grid
services
– Allow easier compliance with
upcoming standards
– The services work together in a
concerted way but can also be
deployed and used independently,
allowing their exploitation in different
contexts
GIN
• Services communicate through
the exchange of messages
– Slowly moving to WS-* interfaces
– Still missing a real standard. Many
WS-* specifications
– Activity inside GGF-GIN
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
4
Middleware structure
Enabling Grids for E-sciencE
Applications
Higher-Level Grid Services
Workload Management
Replica Management
Visualization
Workflow
Grid Economies
...
Foundation Grid Middleware
Security model and infrastructure
Computing (CE) and Storage Elements (SE)
Accounting
Information and Monitoring
EGEE-II INFSO-RI-031688
• Applications have access
both to Higher-level Grid
Services and to Foundation
Grid Middleware
• Higher-Level Grid Services
are supposed to help the
users building their
computing infrastructure but
should not be mandatory
• Foundation Grid Middleware
will be deployed on the EGEE
infrastructure
– Must be complete and robust
– Should allow interoperation
with other major grid
infrastructures
– Should not assume the use of
Higher-Level Grid Services
OSG Consortium Meeting - Seattle - 21-23 August 2006
5
gLite Grid Middleware Services
Enabling Grids for E-sciencE
Access
CLI
API
Security
Information & Monitoring
Authorization
Information &
Monitoring
Auditing
Authentication
Data Management
Application
Monitoring
Workload Management
Metadata
Catalog
File & Replica
Catalog
Accounting
Job
Provenance
Package
Manager
Storage
Element
Data
Movement
Site Proxy
Computing
Element
Workload
Management
Overview paper http://doc.cern.ch//archive/electronic/egee/tr/egee-tr-2006-001.pdf
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
6
Grid foundation: Accounting
Enabling Grids for E-sciencE
• Resource usage by VO,
group or single user
– Resource metering:
sensors running on
resources to determine
usage
– Pricing policies: associate a
cost to resource usage
 if enabled allowed marketbased resource brokering
– privacy: access to
accounting data granted
only to authorized people
(user, provider, VO
manager)
• Basic functionality in APEL, full functionality in DGAS
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
7
Grid foundation: Computing Element
Enabling Grids for E-sciencE
The CE software
• accepts batch jobs (and job control
requests) through a gatekeeper
• performs the necessary AAA operations
and map to a local user
 through LCAS/LCMAPS and the GRAM or
glexec
• passes the job to a layer that interacts
with the local resource manager
Grid
Site
 LCG-CE (GT2 GRAM + GSI-enabled
Condor)
 gLite-CE (GSI-enabled Condor-C)
 CREAM (WS-I based interface)
Client
Computing
Element
Monitoring
AAA and
local mapping
Job
Controller
WN
LRMS
 BLAH
• monitors the status of the jobs and
reports it to the client
 Condor
 CEMon (in CREAM)
 Web service interface to the CE info
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
8
Grid foundation: Storage Element
Enabling Grids for E-sciencE
• Site File Name (SFN): identifies a Storage Element and
the logical name of the file inside it
• Physical File Name (PFN): argument of file open
• Storage Resource Manager (SRM)
–
–
–
–
hides the storage system implementation (disk or active tape)
checks the access rights to the storage system and the files
translates SFNs to PFNs
disk-based: DPM, dCache; tape-based: Castor, dCache
• File I/O: posix-like access from local nodes or the grid
GFAL
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
9
High Level Services: EDS
Enabling Grids for E-sciencE
• Encrypted Data Storage
– encrypt and decrypt data on-the-fly
– Key-store: Hydra
 N instances: at least M (<N) need to
be available for decryption
Will be DPM
(now d-Cache)
Will be
LFC
• fault tolerance and security
• Demonstrated
with the SRMDICOM demo
at EGEE Pisa
conference
(Oct’05)
Will be
GFAL
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
10
High Level Services: Workload Manag.
Enabling Grids for E-sciencE
• Resource brokering, workflow management, I/O data
management
Web Service interface: WMProxy
– Task Queue: keep non matched jobs
– Information SuperMarket: optimized cache of information system
– Match Maker: assigns jobs to resources according to user
requirements
– Job submission & monitoring
Condor-G
ICE (to CREAM)
– External interactions:
 Information System
 Data Catalogs
 Logging&Bookkeeping
 Policy Management
system (G-PBox)
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
11
High Level Services: Job Information
Enabling Grids for E-sciencE
• Logging and Bookkeeping service
– Tracks jobs during their lifetime (in terms of events)
• Job Provenance stores long term job information
– Supports job rerun
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
12
gLite Software Process
Enabling Grids for E-sciencE
JRA1 Development
Directives
Error Fixing
Software
Serious
problem
SA3 Integration
SA3 Testing &
Certification
SA1 PreProduction
Deployment
Packages
Problem
Fail
SA1 Production
Infrastructure
Integration
Tests
Testbed
Deployment
Fail
Pass
Functional
Tests
Release
EGEE-II INFSO-RI-031688
Installation Guide,
Release Notes, etc
Pre-Production
Deployment
Pass
Pass
Fail
Scalability
Tests
OSG Consortium Meeting - Seattle - 21-23 August 2006
13
gLite Software Process
Enabling Grids for E-sciencE
• Technical Coordination Group (TCG)
– gathers & prioritizes user requirements
from HEP, biomed, (industry), sites
– gLite development is client-driven!
• Software from EGEE-JRA1 and other projects
– JRA1 preview testbed (currently being set up)
 early exposure to users of “uncertified” components
• SA3 Integration Team
– Ensures components are deployable and work
– Deployment Modules implemented high-level gLite node types
 (WMS, CE, R-GMA Server, VOMS Server, FTS, etc)
– Build system now spun off into the ETICS project (Jan 2006)
• SA3 Certification Team
– Merge of the JRA1 testing and SA1 certification teams
– Dedicated testbed; test release candidates and patches
– Develop test suites
• SA1 Pre-Production System
– Scale tests by users
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
14
gLite status
Enabling Grids for E-sciencE
• Convergence of LCG 2.7.0 and gLite 1.5.0 LCG-2
in spring 2006
2004
– continuity on the production infrastructure
ensured usability by experiments
– new features from gLite 1.5.0
prototyping
prototyping
• Current activities
– Improve usability, efficiency and performance
– Migration to VDT 1.3.11 (GT4 pre-WS)
– Support for Scientific Linux 4 and 64-bit
 Support for other platforms will follow
gLite
product
2005
product
– New data management components for Biomed
applications on the production infrastructure
– Certify new components (CREAM, Job
Provenance, GPBox, ...)
• Interoperation with other projects and
adherence to standards
• Open source (Apache) license
EGEE-II INFSO-RI-031688
2006 gLite 3.0
OSG Consortium Meeting - Seattle - 21-23 August 2006
15
Summary
Enabling Grids for E-sciencE
• gLite 3 is an important milestone in EGEE program
– New components from gLite 1.5 being deployed for the first time
on the Production Infrastructure
 Address requirements in terms of functionality and scalability
 Components deployed for the first time need extensive testing!
– New organization in EGEE II
 more controlled software process
 development is client driven (TCG)
• Development is continuing to provide increased
robustness, usability and functionality
• Collaboration with other projects for interoperability
and definition/adoption of international standards
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
16
Enabling Grids for E-sciencE
www.glite.org
EGEE-II INFSO-RI-031688
OSG Consortium Meeting - Seattle - 21-23 August 2006
17