Transcript cochran

Analysis Model, Resources, and Commissioning
J. Cochran, ISU
Caveat: for the purpose of estimating the needed resources, an analysis model
is assumed; the AMFY* report will supercede and may alter results
Given time constraints, some details are in the back-up slides
*AMFY = Analysis Model First Year
July 16-17, 2009
SLUO LHC Workshop, SLAC
1
ATLAS Computing Model
D1PD
analysis focus
Note that user analysis on T1
not part of Computing Model
D1PD
D1PD
D1PD
may be different
for early data
July 16-17, 2009
SLUO@LHC
(i.e. ESDs
T2)Workshop, SLAC
2
ATLAS Analysis/Computing model – simpleton view
claim all analyses (data reduction) can be broken down into a series of transforms [input  output]
(i.e. AOD  D1PD, D1PD  D3PD, D3PD  plots)
Contents defined by physics group(s)
made in official production at T0;
remake as needed on T1
Streamed
ESD/AOD
thin/
skim/
slim
D1PD
Produced outside official
production on T2 and/or T3
1st
stage
anal
DnPD
root
histo/
plots
Produced on T3
ESD/AOD, D1PD, D2PD - POOL based
D3PD - flat ntuple
July 16-17, 2009
SLUO LHC Workshop, SLAC
3
Expected analysis patterns for early data
Assume bulk of group/user activity will happen on T2s/T3s
(define user accessible area of T1 as a T3af)
Assume final stage of analysis (plots) happens on T3s (T2s are not interactive)
[except for SLAC]
Two primary modes:
(1) Physics group/(user ?) runs jobs on T2s to make tailored dataset (usually D3PD)
(potential inputs: ESD,AOD,D1PD)
resultant dataset is then transferred to user’s T3 for further analysis
(2) group/user copies input files to specified T3 (potential inputs: ESD,AOD,D1PD)
On T3 group/user either generates reduced dataset for further analysis or
performs final analysis on input data set
Choice depends strongly on capabilities of T3, size of input data sets, etc.
July 16-17, 2009
SLUO LHC Workshop, SLAC
4
Resources
Tier2 simulation for 1 year
(~2012, not 2010)
horizontal axis:
fraction fully simulated
vertical axis:
fraction fast-simulated (ATLFAST II)
from T3 report
(Amir Farbin)
Assume only 20% of T2 resources available for analysis
Is this sufficient ? What resources are needed for analysis ?
Number and type of US-based analyses estimated (based on institutional polling)
Using known benchmarks (+other assumptions) compute needed storage and cpu-s
July 16-17, 2009
SLUO LHC Workshop, SLAC
5
Tier2 CPU Estimation: Results
compare needed cpu-s with available cpu-s:
kSI2k-s needed/1010
2010
all analyses independent
17
minimal cooperation
8.5
maximal cooperation
4.3
supermax cooperation
1.0
kSI2k-s available/1010
US Tier2s
13
Note that having every analysis make its own D3PDs is not our model!
We have always known that we will need to cooperate
Available Tier2 cpu should be sufficient for 2010 analyses
July 16-17, 2009
SLUO LHC Workshop, SLAC
6
Tier2 Storage Estimation: results
US Plan
Included in LCG pledge:
T1: All AOD, 20% ESD, 25% RAW
each T2: 20% AOD (and/or 20% D1PD ?)
2 copies of AODs/D1PDs (data+MC) are distributed over US T2s
1 copy of ESD (data only) distributed over US T2s (expect only for 2009-2010)
(may be able to use perfDPDs in some cases)
US currently behind on LCG pledge for storage
T2 pledge is somewhat beyond
what’s needed for 20% AOD
TB needed
2010
all analyses independent
3717
minimal cooperation
2379
maximal cooperation
613
supermax cooperation
143
We
have
July 16-17,
2009
Available for user analysis:
0 TB
17 TB if we assume only 20% ESD
no level of cooperation is sufficient here
insufficient analysis
storage
until Tier2
SLUO
LHC Workshop,
SLACdisk deficiency is resolved
7
Analysis Commissioning/Testing
T2 Analysis queues in existence and use since Fall 2008
Queues tested with robotic submission (by experts) of example user jobs
Robotic stress tests have been running on some queues since fall08
running on all queues since April 09
Robotic tests reached peak as major component of STEP09 exercise (early June 2009)
User tests more difficult to organize
US held a 4-day 3-site Jamboree/Stress Test in Sep08
- useful but not nearly extensive enough
US expert user testing included as part of STEP09 (and ongoing) - see Andy’s talk
Expanded US tests next week (including T2  T3 data transfer test)
single-day “all hands” US test in mid-August (?)
ATLAS-wide user tests now in planning stages
July 16-17, 2009
SLUO LHC Workshop, SLAC
8
Backup
July 16-17, 2009
SLUO LHC Workshop, SLAC
9
Data Formats
Format
RAW – data output from DAQ (streamed on trigger bits)
Size(kB)/evt
1600
ESD – event summary data: reco info + most RAW
500
AOD – analysis object data: summary of ESD data
150
TAG – event level metadata with pointers to data files
1
Derived Physics Data (DPDs):
D1PD – subset, refined, little brother of AOD
~25
D2PD – specific to physics (sub)group, augmented, undefined
~30
D3PD – flat roottuple
~5
perfDPD – performance DPD, calibrations, etc. (early data)
claim that all analyses can be broken down into a series of transforms [input  output]
(i.e. AOD  D1PD, D1PD  D3PD, D3PD  plots)
July 16-17, 2009
SLUO LHC Workshop, SLAC
10
Starting point: The Transforms
claim that all analyses can be broken down into a series of transforms [input  output]
(i.e. AOD  D1PD, D1PD  D3PD, D3PD  plots)
Input
Transform
Output
Skimming – removing entire events
ESD
AOD
D1PD
D2PD
D3PD
ESD
AOD
D1PD
D2PD
D3PD
plots
Slimming – removing parts of objects
Thinning – removing objects
Augmenting – costs cpu, may increase output size
Merging – concatenating files of same type
Assume (for 2009 & 2010 user analysis):
T2 activity will be ESD/PerfDPD  D3PD and AOD/D1PD  D3PD
T3 activity will be D3PD  plots
July 16-17, 2009
SLUO LHC Workshop, SLAC
basic model most
people are using now
11
Transform rates
Obtained from PanDA on FDR data – stable over expected range of file sizes (number of events)
Rates correspond to kSpecInt2k
ESD  D3PD
13 Hz
ESD  D3PD-small
30 Hz
ESD 
82 Hz
D3PD-verysmall
AOD  D3PD
14 Hz
AOD  D3PD-small
35 Hz
AOD 
91 Hz
D3PD-verysmall
D3PD  plots
Don’t yet have enough info
to know which analyses will
use standard, small, or verysmall


Assume all standard
for 2009-2010
HammerCloud tests on
AOD find ~10 Hz
Large variation
(see ATL-COM-SOFT-002.pdf)
Choose 10 Hz for both
ESD/pDPD  D3PD
AOD/D1PD  D3PD
Choose 10 kHz
July
16-17, 2009
SLUO
LHCof
Workshop,
Depending
on input/output file size and
choice
analysisSLAC
software, rate varied from 1 - 60 kHz
12