Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.
Download ReportTranscript Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Installing dCache into an existing Storage environment at GridKa Forschungszentrum Karlsruhe GmbH Institute for Scientific Computing P.O. Box 3640 D-76021 Karlsruhe, Germany Dr. Doris Ressmann [email protected] http://www.gridka.de GridKa May 2004 Forschungszentrum Karlsruhe Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft •Grid Computing Centre Karlsruhe •GridKa GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft GridKa planned hardware resources 8000 4000 CPU Disk Tape 780 CPUs 160 TB disk 300 TB tape 3000 2000 2000 1000 0 0 2002 2003 2004 2005 2006 2007 2008 2009 GridKa May 2004 kSI95 Tbyte 6000 4000 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Tivoli Storage Manager (TSM) • TSM library management • TSM is not developed for archive Interruption of TSM archive No control what has been archived • dCache (DESY, FNAL) creates a separate session for every file Transparent access Allows transparent maintenance at TSM GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft dCache main components compute nodes mountpoint gridftp file transfer head node TSM with tapes pools file transfer GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft PNFS Perfectly Normal File System pool and tape pnfs real data 0000000000000000000014F0 000000000000000000001510 0000000000000000000015A0 0000000000000000000017E8 000000000000000000001858 GridKa May 2004 database for filenames metadata Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft dCache interface • dCache Access Protocol (dcap) ● compute node: dccp <source file> <pnfs mountpoint> ● connection to head node ● return available pool node ● copy direct into available pool node ● dc_open(...); ● dc_read(...); ● pool: data is precious (can't be deleted) ● flush into tsm ● data is cached (can be deleted from pool) ● compute node: dccp <pnfs mountpoint> <destination file> ● if not in pool the data will be taken from tsm GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Tivoli Storage Manager (tsm) GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft dCache pool node 20 GB 1h 800 GB GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Tivoli Storage Manager (tsm) after dCache tuning GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Test Environment Problematic Hardware • RAID controller 3WARE with 1.6 TB – Always Degraded mode – Rebuilding 70 kB/s or 10 MB/s – Lost data GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft TSM properties • TSM disk cache overflow Allocation of tape drives (max 2) Adapt server properties for specific dCache requirements Management Class (retention time) Copy groups GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Conclusion and Future Work • More reliable hardware especially for write pools • Several TSM server • SRM and LCG connection • Pools on parallel File system GPFS GridKa May 2004 Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft GridKa May 2004