Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum visit, CERN, 13 September 2005 www.eu-egee.org INFSO-RI-508833

Download Report

Transcript Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum visit, CERN, 13 September 2005 www.eu-egee.org INFSO-RI-508833

Enabling Grids for E-sciencE
The EGEE Project
Owen Appleton
EGEE Dissemination Officer
CERN, Switzerland
Danish Grid Forum visit, CERN, 13 September 2005
www.eu-egee.org
INFSO-RI-508833
Presentation Overview
Enabling Grids for E-sciencE
• What is EGEE
– Structure
– Rationale
• State of the project
– Applications running
– Industrial involvement
• Toward EGEE-II
– Changes foreseen
– A new model for Industrial involvement
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
2
What is EGEE
Enabling Grids for E-sciencE
• EGEE is the Enabling Grid for E-SciencE project
– Funded by the EU through the Information Society
Infrastructures programme
 Conceived as first 24 months of a 48 month programme
 ~32M€ funding
• EGEE Objectives:
–
–
–
–
Consistent, robust and secure service grid infrastructure
Improving and maintaining the middleware
Attracting new resources and users from industry as well as science
Spread knowledge of Grid Technology across the European
Research Area and beyond
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
3
Project structure
Enabling Grids for E-sciencE
• EGEE is an international
consortium
– More than 70 partners in 27 countries
organised into 12 federations
 Includes links to US and Asian
countries (to be partners in EGEE II)
– Infrastructure almost all shared with
LCG (as well as some key staff)
– Supports and benefits from a large
range of related projects and National
Grid initiatives
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
4
Project Rationale
Enabling Grids for E-sciencE
• The LCG project is being built due to need from High
Energy Physics for large scale computing power
– Processing 15 petabytes of data per year from LHC experiments
• EGEE will make the LCG/EGEE infrastructure available
to researchers from all scientific fields
– Two pilot applications (HEP & Biomed), several other
applications
• Provide end-to-end service for users
– Help them find out about the grid, port their applications and
teach them how to submit jobs to the infrastructure
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
5
State of the Project
Enabling Grids for E-sciencE
• We are 18 months into a 24 month project (started 1st April 2004)
• Have already exceeded some targets for the end of year
two!
– 180 sites (50 expected)
– 5PB storage (500TB expected)
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
6
EGEE Sites
Enabling Grids for E-sciencE
In collaboration with LCG
Site Map
NorduGrid
Grid3/OSG
Status 25 July 2005
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
7
EGEE Activities
Enabling Grids for E-sciencE
• 48 % service activities
– Grid Operations, Support and
Management, Network Resource
Provision)
• 24 % joint research activities
– Middleware re-engineering, Quality
Assurance, Security, Network
Services Development
• 28 % networking activities
– Management, Dissemination and
Outreach, User Training and
Education, Application Identification
and Support, Policy and
International Cooperation
INFSO-RI-508833
Emphasis in EGEE is on
operating a production
grid and supporting the
end-users
The EGEE Project - Owen Appleton - Sept 05
8
Applications Running
Enabling Grids for E-sciencE
• Two pilot application
domains
– High Energy Physics
– Biomedicine
• Four other domains joined
during EGEE
–
–
–
–
ESR (Earth Sciences)
Computational chemistry
Magic (Astronomy)
EGEODE (Geo-Physics)
Industrial Application!
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
9
Pilot Application Domains
Enabling Grids for E-sciencE
Mont Blanc
(4810 m)
• High Energy Physics
– The 4 LHC experiments via LCG
 ALICE, ATLAS, CMS, LHCb
 LHC data challenges have used ~1000
CPU years
Downtown Geneva
– Other international physics experiments
 D0, CDF, Zeus and Babar
• Biomedicine
– 12 applications in 18 research institutes
 Protein sequence analysis, medical
imaging, Malaria Drug Discovery and
many others
– Recent Drug Discovery Data challenge
used ~40 CPU years in under 2 months
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
10
Industrial Involvement
Enabling Grids for E-sciencE
• Industrial involvement at all levels
– Contracted partners, such as CS-SI
(France) and Datamat SpA (Italy)
– Resources Providers, such as Hewlett
Packard (USA)
– Users such as Compagnie General de
Geophysique (France)
• Active Industry Forum
– Give advice on direction of project and
learn about project results
• Commitment to transferring
knowledge and results to European
Industry
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
11
Toward EGEE-II
Enabling Grids for E-sciencE
• EGEE-II proposal submitted to the EU
on 8th September, proposed start 1st
April 2006
• Continuation of EGEE, second two
years of a four year programme
• Expanded consortium, more than 90
partners in 32 countries with further
expansion through related projects
• Non-European partners in USA, Korea
and Taiwan
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
12
Changes from EGEE
Enabling Grids for E-sciencE
• Changed focus of work
– Reduced efforts in Middleware
re-engineering (Joint Research Activities)
 Plans to integrate software from other projects and sources
– Increased funding for operations (Service Activities)
 Commitment to working, production-quality infrastructure
– Increased funding For Dissemination, Training and Application
Support (Networking Activities)
 Further spread knowledge of Grids and provide expanded support for
new users
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
13
Incubator for Related Projects
Enabling Grids for E-sciencE
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
14
New model for industrial involvement
Enabling Grids for E-sciencE
• Provide end-to-end model for
involvement
– Engage industry through targeted
dissemination activities
 Special events, targeted
dissemination material
– Further Involvement through Industry
Forum
 Open Up two way exchange of
information on Grid computing
– Option of technical involvement
through CERN openlab
 Proven model for industrialacademic collaboration
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
15
Looking beyond EGEE-II
Enabling Grids for E-sciencE
• Provide an Infrastructure capable of being used on a
daily basis by a large range of disciplines
• Develop SLAs and support emerging standards to
allow all groups to easily join the European Grid
infrastructure
• Pave the way for sustainable use of the Infrastructure
beyond two-year time-scale of EGEE-II
• Prepare ground for a European Grid Organisation
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
16
Conclusions
Enabling Grids for E-sciencE
• Grid computing has been chosen by CERN and HEP as the most
cost effective computing model
• Several other applications are already benefiting from Grid
technologies through EGEE (biomedicine is a good example)
• Collaboration across national and international programmes is
very important:
– Grids are above all about collaboration at a large scale
– Science is international and therefore requires an international
computing infrastructure
• Europe is strong in the development of Grids thanks to the
success of EGEE and related projects
• EGEE is interested in discussing possible future new
collaborations from academia or industry
INFSO-RI-508833
The EGEE Project - Owen Appleton - Sept 05
17