Light weight Disk Pool Manager: Status Update Jean-Philippe Baud, IT-GD-CERN Gilbert Grosdidier, LAL-IN2P3-CNRS & IT-GD-CERN October 2005 HEPiX Fall ‘05 @ SLAC.

Download Report

Transcript Light weight Disk Pool Manager: Status Update Jean-Philippe Baud, IT-GD-CERN Gilbert Grosdidier, LAL-IN2P3-CNRS & IT-GD-CERN October 2005 HEPiX Fall ‘05 @ SLAC.

Light weight Disk Pool Manager:
Status Update
Jean-Philippe Baud, IT-GD-CERN
Gilbert Grosdidier, LAL-IN2P3-CNRS
& IT-GD-CERN
October 2005
HEPiX Fall ‘05 @ SLAC
DPM Goals
• Provide a solution for the small Tier-2s in LCG-2
• This implies a few tens of Terabytes in 2005
• Focus on manageability
• Easy to install
• Easy to configure
• Low effort for ongoing maintenance
• Easy to add/remove resources
• Integrated security (authentication/authorization)
• Authentication a la Globus
• Meant as a replacement for a ‘Classic SE’
13/10/05
HEPiX Fall '05 - SLAC - GG
2
Architecture
• The Light Weight Disk Pool Manager consists of
• The Disk Pool Manager with its configuration and request DB
• The Disk Pool Name server with its NS/ACL/replica DB
• The stateless SRM v1 & v2 servers
• The MySQL server (if any)
• all of the above run usually on the same node
– this is not a requirement
• The GSI- & DPM-aware RFIOD and GsiFTP servers
• They have
– To be duplicated on each disk server
– And installed on the master node if there are local data areas
13/10/05
HEPiX Fall '05 - SLAC - GG
3
Current available functionalities
• Management of disk space
• Overall physical space management
• automated garbage collection (removal of expired volatile files)
• disk space balancing (when several filesystems)
• Management of name space (including ACLs)
• Control interfaces:
• socket
• SRM v1.0 (no srmCopy method yet)
• SRM v2.1 (w/o any user/VO global space reservation yet)
• Data access protocols: secure RFIO, GsiFTP
• Support for multiple disk servers
•
•
•
•
support
support
support
support
13/10/05
for
for
for
for
(Globus 2.4)
multiple physical disk partitions for each server
several pools for each server (load balancing)
several space types (volatile and permanent pools)
multiple replicas of a file within the disk pools
HEPiX Fall '05 - SLAC - GG
4
First experience with DPM (Tier2s)
• 18 sites in early Sept. ‘05
• 3 ways to install the DPM: manual, Yaim, Quattor (LAL)
• Sites using Yaim had almost no problem to install
• all servers on the same node
• or disk servers segregated from DPM servers
• Sites doing a manual installation
(for example gLite sites)
had quite a few problems in the security area
• the DPM install must come on top of the Globus security step
• The overall feeling is that the DPM is very stable
13/10/05
HEPiX Fall '05 - SLAC - GG
5
Service Challenge 3
• NDGF (Denmark/Norway/Sweden) used a distributed
•
•
•
•
•
Disk Pool with a single “head” node
Very stable
Constant data rate (150 MB/s)
Good resiliency spotted after a network fiber accidental
cut
Required several DPM restarts because of wrong free
space reported (no reboot -> short disruption)
Bad disk nodes not blacklisted for all error conditions
(needs integration with monitoring), but FTS could also
disable filesystems in case of errors
13/10/05
HEPiX Fall '05 - SLAC - GG
6
Missing functionalities
(SC3 feedback)
• Sysadmin level requests
• Support for filesystems larger than 2TB (UK)
• Support for disk servers spread over multiple subnets
• Support for TCP port range used by RFIO v3 (firewall issue)
• Needs VOMS integration
• User level requests
• Not compatible with dCache srmcp (Phedex)
• available space reported was wrong
• Insufficient documentation about ACLs
13/10/05
HEPiX Fall '05 - SLAC - GG
7
Current Status
• DPNS, DPM, SRM v1 & v2 (w/o Copy nor global space
reservation) have been heavily tested
• The secure versions of RFIO and GsiFTP interfaced to
the DPM are now available
• they have been thoroughly tested as well
• DPM first public release was part of LCG 2.5.0 in May
• mainly for testing since all functionalities were not available
• DPM 1.3.8 RPMs are already available
• 4 flavors are now built
• SL3 with MySQL or Oracle backend
• same with RH73
• DPM 1.4.x is just next door
• to be distributed along with LCG 2.7.0 very soon
• will include : VOMS, virtual IDs, srmCopy
13/10/05
HEPiX Fall '05 - SLAC - GG
8
Documentation/Packaging/Installation
• Reference man pages are provided for each method and server
• Installation guide is also ready
• more use cases merged when needs arise
• see the Twiki page:
• https://uimon.cern.ch/twiki/bin/view/LCG/DpmAdminGuide
• The DPM is available as a tarball or as a set of RPMs
MySQL backend):
• DPM-server-mysql -1.3.8-1sec_sl3
• DPM-name-server-mysql -1.3.8-1sec_sl3
• DPM-srm-server-mysql -1.3.8-1sec_sl3
• DPM-rfio-server -1.3.8-2sec_sl3
• DPM-gsiftp-server -1.3.8-1sec_sl3
• DPM-client-1.3.8-1sec_sl3
• lcg-dm-common-1.3.8-1_sl3
(eg, for the
• Installation scripts (Yaim, Quattor) are also available
• A support discussion list available as well
• [email protected]
13/10/05
HEPiX Fall '05 - SLAC - GG
9
Disk Pool Manager short term plan
•
•
•
•
Bugfix release for incorrect free space reported 1.3.8
Support for large filesystems (> 2 TB) 1.3.8
Procedure to convert a Classic SE into a DPM 1.3.8
Integration with VOMS under work
• DNs will be mapped to virtual UIDs: the virtual uid is created on the
•
•
•
•
•
fly the first time the system receives a request for this DN (no pool
account)
VOMS roles will be mapped to virtual GIDs
A given user may have one UID and several GIDs
Integration with CSEC and CGSI
Administrative tools to update the DB mapping table
Propagation of permissions to Storage Elements
• We need a “Consistency Server” or RRS
• srmCopy under work
• “du” command under work
• RFIO_PORT_RANGE under work
13/10/05
HEPiX Fall '05 - SLAC - GG
10
DPM medium term plan (end ‘05)
• DPM with disk servers on several network domains
(distributed Tier2)
• Integration with Information System - BDII (DESY)
• already partly available in the simplest case
•
•
•
•
•
MySQL backups
DPM DSI plugin for Globus 4 gridFTP server
Drain of a pool, a server or a filesystem
Integration with fabric monitoring
Limit number of streams per disk server (may be
useful for some applications like bulk replication, so
pool dependent)
• Support for ROOTD/XROOTD transfer protocol
13/10/05
HEPiX Fall '05 - SLAC - GG
11
DPM longer term plan (first half ‘06)
• RFIO client library compatible with CASTOR
• very urgent in fact
• Quotas (INFN/DESY)
• (automatic) replication inside of a pool
• Global space reservation with max lifetime
• required by experiments
• Streaming mode (SRM v3)
• implementation of a migrator/recaller to
• either recall/migrate files automatically between Tier1 and
Tier2s
• or interface to a tape/DVD backend
13/10/05
HEPiX Fall '05 - SLAC - GG
12
DPM Insiders
• Many ways to manage a given DPM user file
• to transfer a file (back and forth) from local to SE
• thru the socket interface
– involves NS, DPM and RFIO/GsiFTP servers
• thru the SRM v1/v2 interfaces
– involves only the relevant SRM server on top of the 4 above
• thru direct RFIO/GsiFTP commands
– involves only the relevant server on top of NS & DPM ones
– implies special syntax for these commands
• thru lcg-utils commands (e.g. lcg-cr, lcg-gt, …)
– involves SRM v1 and GsiFTP servers only, on top of NS & DPM
• to remotely access a DPM SE file
• thru GFAL C API (POSIX interface, e.g. gfal_open, gfal_read, …)
– involves SRM v1 and RFIO servers only, on top of NS & DPM
• when running on a WN, I strongly advise this direct access type
since then the file does not require any local space
13/10/05
HEPiX Fall '05 - SLAC - GG
13
DPM Insiders - more details
• All these modes are interoperable
• DPNS/DPM commands allow for direct access to the NS
catalog
• Once again, all servers are authenticated
• RFIO & GsiFTP need to be installed only on disk servers
• increased security, increased stability
– physical files are accessible (RW) only to the dpmmgr account
• same RFIO & GsiFTP client commands are still able to
manage ordinary files on other servers
• with authentication
• Castor & DPM access still possible from the same client
node thru use of LCG_RFIO_TYPE env. var. : from
within GFAL only
13/10/05
HEPiX Fall '05 - SLAC - GG
14
Test Cycle
• Tests were run from very early in the development cycle and were
•
•
•
•
•
improved as soon as new functionalities were merged into the
servers
Test results were immediately taken into account for code
development
Test suites for SRM v2 & DPM socket interface and RFIO written by
Gilbert Grosdidier (Orsay)
Suite for SRM v1 written initially by Jiri Kosina (Prague)
Each suite includes up to 120 sub-tests
A global suite merging all of the above now available (GG)
– includes now lcg-util and Gfal tests as well
• Stress testing (50-100 such suites running in parallel) to assess
robustness and performance
• Everything was tested on RH73 and SLC3 for both Oracle and
MySQL backends
13/10/05
HEPiX Fall '05 - SLAC - GG
15
Testing (2)
• Took about 50% of the global effort
• together with install phase setup
• This was required
• to assess robustness to failure conditions
• many failure use cases are inserted in the suites
• to review carefully error message handling
• but nothing is perfect
• These suites now routinely used to check new installs
• esp. access permission & authentication cross-checks
• specifically, the global suite merging all possible access modes
is very well suited to this
• Many ill-coded client requests leading to SRM server
crashes have been fixed in the last release
• the gSOAP interface is weakly protected from these
13/10/05
HEPiX Fall '05 - SLAC - GG
16
Plans for gLite production service
• Provide a plausible solution for small Tier-2s
• Migration of a ‘Classic SE’ towards a DPM SE
•
Only metadata operations needed (the data does not need to be copied)
• Satisfies gLite requirement for SRM interface at small sites
• Several test installs already achieved at CERN
13/10/05
HEPiX Fall '05 - SLAC - GG
17
Current sites using DPM
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
-
NDGF (Norway-Denmark-Sweden)
Glasgow (UK)
Edinburgh (UK)
QMUL (UK)
RAL (UK)
INFN-Catania (Italy)
INFN-Legnaro (Italy)
INFN-Bari (Italy)
INFN-Padova (Italy)
CERN (Switzerland)
CSCS (Switzerland)
LAL-DAPNIA-LPNHE (France)
NIKHEF (Netherlands)
IFCA (Spain)
BNL (USA)
HIP (Finland)
TW-NCUHEP (Taiwan)
ASCC (Taiwan)
13/10/05
HEPiX Fall '05 - SLAC - GG
18