Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.
Download
Report
Transcript Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Installing dCache into an existing Storage
environment at GridKa
Forschungszentrum Karlsruhe GmbH
Institute for Scientific Computing
P.O. Box 3640
D-76021 Karlsruhe, Germany
Dr. Doris Ressmann
[email protected]
http://www.gridka.de
GridKa May 2004
Forschungszentrum Karlsruhe
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
•Grid Computing Centre Karlsruhe
•GridKa
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
GridKa planned hardware resources
8000
4000
CPU
Disk
Tape
780 CPUs
160 TB disk
300 TB tape
3000
2000
2000
1000
0
0
2002 2003 2004 2005 2006 2007 2008 2009
GridKa May 2004
kSI95
Tbyte
6000
4000
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (TSM)
• TSM library management
• TSM is not developed for archive
Interruption of TSM archive
No control what has been archived
• dCache (DESY, FNAL)
creates a separate session for every file
Transparent access
Allows transparent maintenance at TSM
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache main components
compute nodes
mountpoint
gridftp
file transfer
head node
TSM
with
tapes
pools
file transfer
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
PNFS
Perfectly Normal File System
pool and tape
pnfs
real data
0000000000000000000014F0
000000000000000000001510
0000000000000000000015A0
0000000000000000000017E8
000000000000000000001858
GridKa May 2004
database for
filenames
metadata
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache interface
•
dCache Access Protocol (dcap)
●
compute node: dccp <source file> <pnfs mountpoint>
●
connection to head node
●
return available pool node
●
copy direct into available pool node
●
dc_open(...);
●
dc_read(...);
●
pool: data is precious (can't be deleted)
●
flush into tsm
●
data is cached (can be deleted from pool)
●
compute node: dccp <pnfs mountpoint> <destination file>
●
if not in pool the data will be taken from tsm
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (tsm)
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
dCache pool node
20 GB
1h
800 GB
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Tivoli Storage Manager (tsm)
after dCache tuning
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Test Environment
Problematic Hardware
• RAID controller 3WARE with 1.6 TB
– Always Degraded mode
– Rebuilding
70 kB/s or 10 MB/s
– Lost data
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
TSM properties
• TSM disk cache overflow
Allocation of tape drives (max 2)
Adapt server properties for specific
dCache requirements
Management Class (retention time)
Copy groups
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
Conclusion and Future Work
• More reliable hardware
especially for write pools
• Several TSM server
• SRM and LCG connection
• Pools on parallel File system
GPFS
GridKa May 2004
Forschungszentrum Karlsruhe
in der Helmholtz-Gemeinschaft
GridKa May 2004