Transcript John Miller
Pitfalls of Project Estimating: An Applied Lesson from Dr. Deming’s Funnel Experiment John Miller, PMP, CLSSBB 4/28/10 Contents • • • • • • • • • • • Introduction Dr. Deming’s Funnel Experiment What is a Process? Statistical Measures Estimating Methods Concepts Averages and Standard Deviation Data Points to Data Trends Control Charts Lessons from the Funnel Experiment Using Excel to Avoid Common Mistakes J. Miller, 4/28/10 Introduction Parametric and evidenced-based estimating tools have become very sophisticated. Many essential benefits of these tools can inexpensively be provided using Excel, but project managers need to be aware of risks associated with improperly adjusting estimating metrics. Dr. Deming’s Funnel experiment serve’s as a model for improperly adjusting estimating metrics. A simple example will demonstrate how statistically valid estimating methods can be duplicated in Excel. Dr. Deming’s Funnel Experiment “A experiment that demonstrates the effects of tampering [with a process]. Marbles are dropped through a funnel in an attempt to hit a flat-surfaced target below. The experiment shows that adjusting a stable process to compensate for an unstable result or an extraordinarily good result will produce output that is worse than if the process had been left alone.” Donna C. Summers, Quality Management 2nd Edition, p 546 J. Miller, 4/28/10 DR. Deming’s Funnel Experiment Funnel Comparison to Project Estimating Funnel = Estimating process Target = Goal -- project time goal or cost goal or resource goal Marble = Actual project or estimate that creates measurable output – cost, schedule, etc. Each marble drop = actual project or part of project Moving the Funnel = Adjusting the project resources after each project to try to meet the target on the next project. (Ex. Adding people or tools, if schedules are being missed, removing resources if budgets are being missed) Moving the Target = Adjusting the time or cost estimate estimating process after each project based on the results from the last project. J. Miller, 4/28/10 DR. Deming’s Funnel Experiment Funnel Comparison to Project Estimating Each of the four rules is discussed below. Rule 1: Leave the funnel fixed over the target. Why don't we simply adjust the funnel after each drop so the next drop will be closer to the target? Don’t change the estimating process, accept the variation, the difference between the estimate and actual projects. Rule 2: For every drop, the marble will come to rest a distance "z" from the target. Rule 2 is to move the funnel a distance -z from its last position. Move the funnel based on the funnels last position. Chang the estimating process (metrics) after each project or difference between estimate and actual. Rule 3: Move the funnel a distance -z from the target after each drop of the marble that ends up a distance z from the target. Note that Rule 2 moves the funnel based on the funnel's last position. Rule 3 moves the funnel a distance from the target. Rule 4: Rule 4 is simply to set the funnel over where the last drop came to rest. J. Miller, 4/28/10 What is a Process? • A series of steps or actions that convert input to output. • Project estimating is a process. • Processes are described by statistical measures – numbers that describe “groups”. • Groups are made of data points. • What do we need to better understand this group of data points? Centering – Mean Spread - Stand Deviation Shape – Normality, data distribution (assume all distributions here are Normal) • We need data. J. Miller, 4/28/10 Statistical Measures • Dimensional attributes • Product or Process metrics such as: # or % defects % on time deliveries Efficiency (productivity, ratios) # units of output / unit of time (may also include inputs) Time / # units of output xx units / hr Turn Around Time (TAT), xx Points / unit Cost/Unit Which can be averages? J. Miller, 4/28/10 Estimating Methods PMBOK 3rd 6.4.2 Activity Duration Estimating: Tools and Techniques .1 Expert Judgment Activity durations are often difficult to estimate because of the number of factors that can influence them, such as resource levels or resource productivity. Expert judgment, guided by historical information, can be used whenever possible. The individual project team members may also provide duration estimate information or recommended maximum activity durations from prior similar projects. If such expertise is not available, the duration estimates are more uncertain and risky. .2 Analogous Estimating Analogous duration estimating means using the actual duration of a previous, similar schedule activity as the basis for estimating the duration of a future schedule activity. It is frequently used to estimate project duration when there is a limited amount of detailed information about the project for example, in the early phases of a project. Analogous estimating uses historical information (Section 4.1.1.4) and expert judgment. Analogous duration estimating is most reliable when the previous activities are similar in fact and not just in appearance, and the project team members preparing the estimates have the needed expertise. J. Miller, 4/28/10 Estimating Methods PMBOK 3rd 6.4.2 Activity Duration Estimating: Tools and Techniques .3 Parametric Estimating Estimating the basis for activity durations can be quantitatively determined by multiplying the quantity of work to be performed by the productivity rate. For example, productivity rates can be estimated on a design project by the number of drawings times labor hours per drawing, or a cable installation in meters of cable times labor hours per meter. The total resource quantities are multiplied by the labor hours per work period or the production capability per work period, and divided by the number of those resources being applied to determine activity duration in work periods. .4 Three-Point Estimates The accuracy of the activity duration estimate can be improved by considering the amount of risk in the original estimate. Three-point estimates are based on determining three types of estimates: • Most likely. The duration of the schedule activity, given the resources likely to be assigned, their productivity, realistic expectations of availability for the schedule activity, dependencies on other participants, and interruptions. • Optimistic. The activity duration is based on a best-case scenario of what is described in the most likely estimate. • Pessimistic. The activity duration is based on a worst-case scenario of what is described in the most likely estimate. An activity duration estimate can be constructed by using an average of the three estimated durations. That average will often provide a more accurate activity duration estimate than the single point, most-likely estimate. J. Miller, 4/28/10 Estimating Methods .1 Expert Judgment Expert judgment, guided by historical information, can be used whenever possible. .2 Analogous Estimating Analogous estimating uses historical information and expert judgment. .3 Parametric Estimating Estimating the basis for activity durations can be quantitatively determined by multiplying the quantity of work to be performed by the productivity rate. .4 Three-Point Estimates An activity duration estimate can be constructed by using an average of the three estimated durations. • Mean The mean is the average data point value within a data set. To calculate the mean, add all of the individual data points then divide that figure by the total number of data points. • Published estimating methods overlook critical statistical attributes. What’s missing? J. Miller, 4/28/10 Estimating Methods – Averages What’s missing? Standard Deviation Mean If I use the “average” of past projects to estimate future projects, what is virtually guaranteed? J. Miller, 4/28/10 Concepts • Statistical measure – numbers that describe groups. A numerical value, such as standard deviation or average, that characterizes the sample or population from which it was derived. • What do we need to better understand this group of data points? Centering – Mean Spread - Stand Deviation Shape – Normality, data distribution (assume all distributions here are Normal) J. Miller, 4/28/10 Averages and Standard Deviation • Mean The mean is the average data point value within a data set. To calculate the mean, add all of the individual data points then divide that figure by the total number of data points. • Standard Deviation A statistic used to measure the variation in a distribution. Standard deviation is a measure of the spread of data in relation to the mean. It is the most common measure of the variability of a set of data. J. Miller, 4/28/10 Averages and Standard Deviation Standard Deviation Example Suppose we wished to find the standard deviation of the data set consisting of the values 3, 7, 7, and 19. Step 1: find the arithmetic mean (average) of 3, 7, 7, and 19 (3 + 7 + 7 + 19) / 4 = 9. Step 2: find the deviation of each number from the mean, 3−9=−6 7−9=−2 7−9=−2 19 − 9 = 10. Step 3: square each of the deviations, which amplifies large deviations and makes negative values positive, ( − 6)2 = 36 ( − 2)2 = 4 ( − 2)2 = 4 (10)2 = 100. Step 4: find the mean of those squared deviations, (36 + 4 + 4 + 100) / 4 = 36. Step 5: take the non-negative square root of the quotient (converting squared units back to regular units), so, the standard deviation of the set is 6. J. Miller, 4/28/10 Standard Deviation Discovered Assume: Mean = 30, Std Dev = 6 -1s=-6 - 2 s = 2(-6) = -12 - 3 s = 3(-6) = -18 - 4 s = 4(-6) = -24 - 5 s = 5(-6) = -30 - 6 s = 6(-6) = -36 s -6 1s=6 2 s = 2(6) = 12 3 s = 3(6) = 18 4 s = 4(6) = 24 5 s = 5(6) = 30 6 s = 6(6) = 36 s -5 s -4 s -3 s -2 s -1 -36 -30 -24 -18 -12 -6 -6 0 6 12 18 24 30 s 1 s 2 s 3 s 4 s 5 s 6 +6 +12 +18 +24 +30 +36 36 42 48 54 60 66 J. Miller, 4/28/10 Standard Deviation Discovered Assume: Mean = 30, Std Dev = 6 30 -6 0 6 12 18 24 36 42 48 54 60 66 J. Miller, 4/28/10 Standard Deviation is Critical Assume: There are two projects, each using the Three Point Estimate Method Project A Optimistic = 1 Most likely = 50 Pessimistic = 99 Project B Optimistic = 49 Most likely = 50 Pessimistic = 51 0 10 1 20 30 40 50 60 49 51 70 80 90 100 99 Project A Project B Data points are 1, 50 ,99 Mean = 50 StdDev = 49 Data points are 49, 50, 51 Mean = 50 StdDev = 1 J. Miller, 4/28/10 Normal Distribution with Std Dev Estimating Methods PMBOK 3rd 6.4.3 Activity Duration Estimating: Outputs .1 Activity Duration Estimates Activity duration estimates are quantitative assessments of the likely number of work periods that will be required to complete a schedule activity. Activity duration estimates include some indication of the range of possible results. For example: • 2 weeks ± 2 Points to indicate that the schedule activity will take at least eight Points and no more than twelve (assuming a five-Point workweek). • 15 percent probability of exceeding three weeks to indicate a high probability—85 percent—that the schedule activity will take three weeks or less. Let’s take a closer look at making a valid estimate… J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 Point 7 Point 6 Point 5 Point 4 Point 3 Point 2 Point 1 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 P1 P2 P3 P4 P5 P6 P7 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 Point 7 Point 6 Point 5 Point 4 Point 3 Point 2 Point 1 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 P1 P2 P3 P4 P5 P6 P7 Data Points to Data Trends J. Miller, 4/28/10 P1 P2 P3 P4 P5 P6 P7 Data Points to Data Trends J. Miller, 4/28/10 UCL X s? LCL P1 P2 P3 P4 P5 P6 P7 Data Points to Data Trends J. Miller, 4/28/10 Data Points to Data Trends J. Miller, 4/28/10 Process Exercise What about when we don’t have data for all process steps? What if we had a way to estimate the data? Creating a Control Chart 1. Define the metric: time/unit of output, delta of actual vs. planned time or cost, etc 2. Must be homogeneous 3. Data collection planning 1. Who will collect data? 2. What aspect of the process will be measured? 3. Where, or at what point in the process will the measurement be taken? 4. When or how frequently will the data be collected? 5. Why is this sample being taken? 6. How will the data be collected? 7. How many samples will be taken? 4. Collect the data Step Step Step Step Step Step Step Step Step Step # 1 2 3 4 5 6 7 8 9 10 Steps Project 1 45 32 78 12 67 54 87 43 54 87 10 Project 2 35 67 9 64 55 75 33 58 8 Project 3 56 45 29 54 39 66 63 70 8 What isn’t collected every66 project Projectif4data43 43 56 for31 40 or period? 69 45 75 9 Project 5 41 67 9 59 56 71 43 67 78 9 Project 6 60 45 14 54 67 76 65 74 8 Project 7 54 39 63 61 59 44 60 7 Project 8 41 74 13 63 73 49 59 71 8 Total Mean System Step Time Duration 559 55.9 396 49.5 422 52.8 468 52.0 491 54.6 455 56.9 380 54.3 443 55.4 Creating a Control Chart Step Step Step Step Step Step Step Step Step Step 1 2 3 4 5 6 7 8 9 10 Project 1 45 32 78 12 67 54 87 43 54 87 Project 2 35 67 9 64 55 75 33 58 Project 3 56 45 29 54 39 66 63 70 Project 4 43 43 56 31 66 40 69 45 75 Project 5 41 67 9 59 56 71 43 67 78 Project 6 60 45 14 54 OR 67 76 65 74 Project 7 54 39 63 61 59 44 60 Project 8 Step Step41 Step74 Step13 Step Step63 Step73 Step49 Step59 Step71 Project 9 1 51 2 45 3 72 4 12 5 65 6 7 87 8 46 9 56 1085 Project 49 34 60 67 56 75 Project10 1 45 32 78 128 67 54 87 43 54 87 Project 2 35 67 9 64 55 75 33 58 Project 3 56 45 29 54 39 66 63 70 Project 4 43 43 56 31 66 40 69 45 75 Project 5 41 67 9 59 56 71 43 67 78 Project 6 60 45 14 54 67 76 65 74 AsProject long 7as the 54 units 39 are the 63 same, the 61 model 59 works! 44 60 Project 8 41 74 13 63 73 49 59 71 Project 9 51 45 72 12 65 87 46 56 85 Project 10 49 34 60 8 67 56 75 Total Mean # System Step Steps Time Duration 10 559 55.9 8 396 49.5 8 422 52.8 9 468 52.0 9 491 54.6 8 455 56.9 7 Total 380 Mean 54.3 443 Step 55.4 # 8 System 519 Duration 57.7 Steps9 Time 349 49.9 107 559 55.9 8 396 49.5 8 422 52.8 9 468 52.0 9 491 54.6 8 455 56.9 7 380 54.3 8 443 55.4 9 519 57.7 7 349 49.9 Creating a Control Chart Step Step Step Step Step Step Step Step Step Step # 1 2 3 4 5 6 7 8 9 10 Steps Project 1 45 32 78 12 67 54 87 43 54 87 10 Project 2 35 67 9 64 55 75 33 58 8 Project 3 56 45 29 54 39 66 63 70 8 Project 4 43 43 56 31 66 40 69 45 75 9 Project 5 41 67 9 59 56 71 43 67 78 9 Project 6 60 45 14 54 67 76 65 74 8 Project 7 54 39 63 61 59 44 60 7 Project 8 41 74 13 63 73 49 59 71 8 Project 9 51 45 72 12 65 87 46 56 85 9 Project 10 49 34 60 8 67 56 75 7 Total Mean System Step Time Duration 559 55.9 396 49.5 422 52.8 468 52.0 491 54.6 455 56.9 380 54.3 443 55.4 519 57.7 349 49.9 Total # Projects 10 10 10 10 10 10 10 10 10 10 Recorded # Projects 8 8 9 9 9 9 9 7 8 7 % of Projects Reporting 80.0% 80.0% 90.0% 90.0% 90.0% 90.0% 90.0% 70.0% 80.0% 70.0% Mean 49.9 39.3 64.7 15.2 61.9 54.3 75.4 43.3 60.3 77.1 Standard Deviation 6.6 5.1 10.1 8.6 5.2 9.4 7.3 5.0 4.5 6.6 Variance 44.1 25.9 102.0 74.4 27.1 88.0 53.0 24.9 19.9 43.8 J. Miller, 4/28/10 Creating a Control Chart Total # Projects 10 10 10 10 10 10 10 10 10 10 Recorded # Projects 8 8 9 9 9 9 9 7 8 7 % of Projects Reporting 80.0% 80.0% 90.0% 90.0% 90.0% 90.0% 90.0% 70.0% 80.0% 70.0% Mean 49.9 39.3 64.7 15.2 61.9 54.3 75.4 43.3 60.3 77.1 Standard Deviation 6.6 5.1 10.1 8.6 5.2 9.4 7.3 5.0 4.5 6.6 Variance 44.1 25.9 102.0 74.4 27.1 88.0 53.0 24.9 19.9 43.8 5. Calculate the centerline – Sum of Means 6. Calculate the Upper and Lower Control Limits. UCL = Sum of Means + 3 σ LCL = Sum of Means - 3 σ Sum of Means Sum of Variances 541.4 503.3 Sq Rt of Variances 22.4 σ = Sq Rt of Sum of Variances -3 StdDev -3* 22.4 -67.2 474.2 -2 StdDev -2* 22.4 -44.8 496.6 -1 StdDev -1* 22.4 -22.4 519.0 Mean 541.4 541.4 +1 StdDev 1* 22.4 22.4 563.8 +2 StdDev 2* 22.4 44.8 586.2 +3 StdDev 3* 22.4 67.2 608.6 Creating a Control Chart -3 StdDev -3* 22.4 -67.2 474.2 -2 StdDev -2* 22.4 -44.8 496.6 -1 StdDev -1* 22.4 -22.4 519.0 Mean 541.4 541.4 +1 StdDev 1* 22.4 22.4 563.8 +2 StdDev 2* 22.4 44.8 586.2 +3 StdDev 3* 22.4 67.2 608.6 After calculating the control limits, place the center line and control limits on the chart 608.6 541.4 474.2 Creating a Control Chart A Guide to the Project Management Body of Knowledge (PMBOK® Guide) Third Edition 2004 Project Management Institute, Four Campus Boulevard, Newtown Square, PA 19073-3299 USA J. Miller, 4/28/10 Determine if the Process is Stable Over Time • Check the date for trends • Check the date for patterns J. Miller, 4/28/10 Process Stable Over Time? Learn how to decide if the process is stable and if the data can be used for further analysis. Data is analyzed using a control chart. (1) Point beyond UCL and LCL (beyond 3 sigma) J. Miller, 4/28/10 Process Stable Over Time? (2) Seven consecutive points on the same side of center line. J. Miller, 4/28/10 Process Stable Over Time? (3) Trend: 6 consecutive points steadily increasing or decreasing: J. Miller, 4/28/10 Process Stable Over Time? (4) Repeating pattern and cycles. J. Miller, 4/28/10 Process Stable Over Time? I-MR Chart of Northeast Individual Value 1400 U C L=1367.5 1200 _ X=1000.0 1000 800 LC L=632.4 600 1 3 5 7 9 11 O bser v ation 600 13 15 17 19 Can the above data be used for any further analysis and decision making? 1 Moving Range 1 450 300 Observations: 150 No point is beyond UCL and LCL 0 no pattern repetition There is 1 3 5 7 9 11 13 15 17 There is no continuous increase or decrease of 6 data points O bser v ation There is NO 7 consecutive points on the same side of center line U C L=451.6 __ M R=138.2 LC L=0 19 So, this data can be used for future predictions J. Miller, 4/28/10 Process Stable Over Time? I-MR Chart of Northwest Individual Value 1400 1 U C L=1298.0 1200 _ X=1025.9 1000 800 LC L=753.8 1 600 1 3 5 7 9 11 O bser vation 13 15 17 Can the above data be used for any further analysis and decision making 19 U C L=334.3 M oving Range 300 Observation : There are points beyond UCL and LCL 200 There is no pattern repetition There is no continuous increase or decrease of 6 data points 100 There is 7 consecutive points on the same side of center line 0 __ M R=102.3 LC L=0 This1tells, there is a special cause influencer, which is causing this Before making 3 5 7 9 11 13 15 to happen. 17 19 bser vation any predictions for the future, need toOanalyze the special cause J. Miller, 4/28/10 Process Stable Over Time? I-MR Chart of Southwest Individual Value 1600 U C L=1534.5 1400 _ X=1195.9 1200 1000 LC L=857.3 1 3 5 7 9 11 O bser vation 13 15 Moving Range Can400 the above data be used for any further analysis and decision making : 300 Observation : No point is beyond UCL and LCL 200 There is no pattern repetition There 100 is no continuous increase or decrease of 6 data points There is 7 consecutive points on the same side of center line 0 17 19 U C L=416.0 __ M R=127.3 LC L=0 1 using 3 7 predictions, 9 11 13 15 cause 17 So, Before the data5for future need to analyze the for data 19 points to be on O bser vation same side (during the period) and if required separate those points from the rest, in making the decision J. Miller, 4/28/10 Determine if the Process is Stable Over Time • Check the date for trends • Check the date for patterns Apply Lessons from the Funnel Experiment Rule 1: Leave the funnel fixed over the target. Why don't we simply adjust the funnel after each drop so the next drop will be closer to the target? Don’t change the estimating process, accept the variation, the difference between the estimate and actual projects. Rule 2: For every drop, the marble will come to rest a distance "z" from the target. Rule 2 is to move the funnel a distance -z from its last position. Move the funnel based on the funnels last position. Chang the estimating process (metrics) after each project or difference between estimate and actual. Rule 3: Move the funnel a distance -z from the target after each drop of the marble that ends up a distance z from the target. Note that Rule 2 moves the funnel based on the funnel's last position. Rule 3 moves the funnel a distance from the target. Rule 4: Rule 4 is simply to set the funnel over where the last drop came to rest. J. Miller, 4/28/10 Using Excel to Avoid Common Mistakes Questions? Contact John H. Miller [email protected] J. Miller, 4/28/10