Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.

Download Report

Transcript Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.

Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Installing dCache into an existing Storage
environment at GridKa
Forschungszentrum Karlsruhe GmbH
Institute for Scientific Computing
P.O. Box 3640
D-76021 Karlsruhe, Germany
Dr. Doris Ressmann
[email protected]
http://www.gridka.de
GridKa May 2004
Forschungszentrum Karlsruhe
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
•Grid Computing Centre Karlsruhe
•GridKa
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
GridKa planned hardware resources
8000
4000
CPU
Disk
Tape
780 CPUs
160 TB disk
300 TB tape
3000
2000
2000
1000
0
0
2002 2003 2004 2005 2006 2007 2008 2009
GridKa May 2004
kSI95
Tbyte
6000
4000
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (TSM)
• TSM library management
• TSM is not developed for archive
Interruption of TSM archive
No control what has been archived
• dCache (DESY, FNAL)
creates a separate session for every file
Transparent access
Allows transparent maintenance at TSM
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache main components
compute nodes
mountpoint
gridftp
file transfer
head node
TSM
with
tapes
pools
file transfer
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
PNFS
Perfectly Normal File System
pool and tape
pnfs
real data
0000000000000000000014F0
000000000000000000001510
0000000000000000000015A0
0000000000000000000017E8
000000000000000000001858
GridKa May 2004
database for
filenames
metadata
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache interface
•
dCache Access Protocol (dcap)
●
compute node: dccp <source file> <pnfs mountpoint>
●
connection to head node
●
return available pool node
●
copy direct into available pool node
●
dc_open(...);
●
dc_read(...);
●
pool: data is precious (can't be deleted)
●
flush into tsm
●
data is cached (can be deleted from pool)
●
compute node: dccp <pnfs mountpoint> <destination file>
●
if not in pool the data will be taken from tsm
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (tsm)
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache pool node
20 GB
1h
800 GB
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (tsm)
after dCache tuning
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Test Environment
Problematic Hardware
• RAID controller 3WARE with 1.6 TB
– Always Degraded mode
– Rebuilding
70 kB/s or 10 MB/s
– Lost data
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
TSM properties
• TSM disk cache overflow
Allocation of tape drives (max 2)
Adapt server properties for specific
dCache requirements
Management Class (retention time)
Copy groups
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Conclusion and Future Work
• More reliable hardware
especially for write pools
• Several TSM server
• SRM and LCG connection
• Pools on parallel File system
GPFS
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
GridKa May 2004