Transcript cochran
Analysis Model, Resources, and Commissioning J. Cochran, ISU Caveat: for the purpose of estimating the needed resources, an analysis model is assumed; the AMFY* report will supercede and may alter results Given time constraints, some details are in the back-up slides *AMFY = Analysis Model First Year July 16-17, 2009 SLUO LHC Workshop, SLAC 1 ATLAS Computing Model D1PD analysis focus Note that user analysis on T1 not part of Computing Model D1PD D1PD D1PD may be different for early data July 16-17, 2009 SLUO@LHC (i.e. ESDs T2)Workshop, SLAC 2 ATLAS Analysis/Computing model – simpleton view claim all analyses (data reduction) can be broken down into a series of transforms [input output] (i.e. AOD D1PD, D1PD D3PD, D3PD plots) Contents defined by physics group(s) made in official production at T0; remake as needed on T1 Streamed ESD/AOD thin/ skim/ slim D1PD Produced outside official production on T2 and/or T3 1st stage anal DnPD root histo/ plots Produced on T3 ESD/AOD, D1PD, D2PD - POOL based D3PD - flat ntuple July 16-17, 2009 SLUO LHC Workshop, SLAC 3 Expected analysis patterns for early data Assume bulk of group/user activity will happen on T2s/T3s (define user accessible area of T1 as a T3af) Assume final stage of analysis (plots) happens on T3s (T2s are not interactive) [except for SLAC] Two primary modes: (1) Physics group/(user ?) runs jobs on T2s to make tailored dataset (usually D3PD) (potential inputs: ESD,AOD,D1PD) resultant dataset is then transferred to user’s T3 for further analysis (2) group/user copies input files to specified T3 (potential inputs: ESD,AOD,D1PD) On T3 group/user either generates reduced dataset for further analysis or performs final analysis on input data set Choice depends strongly on capabilities of T3, size of input data sets, etc. July 16-17, 2009 SLUO LHC Workshop, SLAC 4 Resources Tier2 simulation for 1 year (~2012, not 2010) horizontal axis: fraction fully simulated vertical axis: fraction fast-simulated (ATLFAST II) from T3 report (Amir Farbin) Assume only 20% of T2 resources available for analysis Is this sufficient ? What resources are needed for analysis ? Number and type of US-based analyses estimated (based on institutional polling) Using known benchmarks (+other assumptions) compute needed storage and cpu-s July 16-17, 2009 SLUO LHC Workshop, SLAC 5 Tier2 CPU Estimation: Results compare needed cpu-s with available cpu-s: kSI2k-s needed/1010 2010 all analyses independent 17 minimal cooperation 8.5 maximal cooperation 4.3 supermax cooperation 1.0 kSI2k-s available/1010 US Tier2s 13 Note that having every analysis make its own D3PDs is not our model! We have always known that we will need to cooperate Available Tier2 cpu should be sufficient for 2010 analyses July 16-17, 2009 SLUO LHC Workshop, SLAC 6 Tier2 Storage Estimation: results US Plan Included in LCG pledge: T1: All AOD, 20% ESD, 25% RAW each T2: 20% AOD (and/or 20% D1PD ?) 2 copies of AODs/D1PDs (data+MC) are distributed over US T2s 1 copy of ESD (data only) distributed over US T2s (expect only for 2009-2010) (may be able to use perfDPDs in some cases) US currently behind on LCG pledge for storage T2 pledge is somewhat beyond what’s needed for 20% AOD TB needed 2010 all analyses independent 3717 minimal cooperation 2379 maximal cooperation 613 supermax cooperation 143 We have July 16-17, 2009 Available for user analysis: 0 TB 17 TB if we assume only 20% ESD no level of cooperation is sufficient here insufficient analysis storage until Tier2 SLUO LHC Workshop, SLACdisk deficiency is resolved 7 Analysis Commissioning/Testing T2 Analysis queues in existence and use since Fall 2008 Queues tested with robotic submission (by experts) of example user jobs Robotic stress tests have been running on some queues since fall08 running on all queues since April 09 Robotic tests reached peak as major component of STEP09 exercise (early June 2009) User tests more difficult to organize US held a 4-day 3-site Jamboree/Stress Test in Sep08 - useful but not nearly extensive enough US expert user testing included as part of STEP09 (and ongoing) - see Andy’s talk Expanded US tests next week (including T2 T3 data transfer test) single-day “all hands” US test in mid-August (?) ATLAS-wide user tests now in planning stages July 16-17, 2009 SLUO LHC Workshop, SLAC 8 Backup July 16-17, 2009 SLUO LHC Workshop, SLAC 9 Data Formats Format RAW – data output from DAQ (streamed on trigger bits) Size(kB)/evt 1600 ESD – event summary data: reco info + most RAW 500 AOD – analysis object data: summary of ESD data 150 TAG – event level metadata with pointers to data files 1 Derived Physics Data (DPDs): D1PD – subset, refined, little brother of AOD ~25 D2PD – specific to physics (sub)group, augmented, undefined ~30 D3PD – flat roottuple ~5 perfDPD – performance DPD, calibrations, etc. (early data) claim that all analyses can be broken down into a series of transforms [input output] (i.e. AOD D1PD, D1PD D3PD, D3PD plots) July 16-17, 2009 SLUO LHC Workshop, SLAC 10 Starting point: The Transforms claim that all analyses can be broken down into a series of transforms [input output] (i.e. AOD D1PD, D1PD D3PD, D3PD plots) Input Transform Output Skimming – removing entire events ESD AOD D1PD D2PD D3PD ESD AOD D1PD D2PD D3PD plots Slimming – removing parts of objects Thinning – removing objects Augmenting – costs cpu, may increase output size Merging – concatenating files of same type Assume (for 2009 & 2010 user analysis): T2 activity will be ESD/PerfDPD D3PD and AOD/D1PD D3PD T3 activity will be D3PD plots July 16-17, 2009 SLUO LHC Workshop, SLAC basic model most people are using now 11 Transform rates Obtained from PanDA on FDR data – stable over expected range of file sizes (number of events) Rates correspond to kSpecInt2k ESD D3PD 13 Hz ESD D3PD-small 30 Hz ESD 82 Hz D3PD-verysmall AOD D3PD 14 Hz AOD D3PD-small 35 Hz AOD 91 Hz D3PD-verysmall D3PD plots Don’t yet have enough info to know which analyses will use standard, small, or verysmall Assume all standard for 2009-2010 HammerCloud tests on AOD find ~10 Hz Large variation (see ATL-COM-SOFT-002.pdf) Choose 10 Hz for both ESD/pDPD D3PD AOD/D1PD D3PD Choose 10 kHz July 16-17, 2009 SLUO LHCof Workshop, Depending on input/output file size and choice analysisSLAC software, rate varied from 1 - 60 kHz 12