summary - vscht.cz

Download Report

Transcript summary - vscht.cz

SUMMARY
β€’ Z-distribution
β€’ Central limit theorem
Statistical inference
If we can’t conduct a census, we
collect data from the sample of a
population.
Goal: make conclusions about
that population
Confidence interval
𝑠
for 𝑛 β‰₯ 30: π‘₯ ± 𝑍 ×
𝑛
critical value
kritická hodnota
margin of error
moΕΎná odchylka
𝑠
for 𝑛 < 30: π‘₯ ± π‘‘π‘›βˆ’1 ×
𝑛
What a confidence interval does tell us?
β€’ When we say "we are 95% confident that the true value of
the parameter is in our confidence interval", we express
that 95% of the observed confidence intervals will hold the
true value of the parameter.
β€’ After a sample is taken, the population parameter is either
in the interval made or not. It is not a matter of chance!
β€’ A confidence interval does not predict that the true value
of the parameter has a particular probability of being in
the confidence interval given the data actually obtained.
neco ×
β€’
Just to summarize, the margin of error depends on
the confidence level (common is 95%)
2. the sample size 𝑛
1.
β€’
β€’
3.
the variability of the data (i.e. on Οƒ)
β€’
β€’
β€’
as the sample size increases, the margin of error decreases
For the bigger sample we have a smaller interval for which
we’re pretty sure the true population lies.
more variability increases the margin of error
Margin of error does not measure anything else
than chance variation.
It doesn’t measure any bias or errors that happen
during the proces.
β€’
It does not tell anything about the correctness of your
data!!!
𝑠
𝑛
HYPOTHESIS TESTING
Aim of hypothesis testing
β€’ decision making
Engagement ratio
β€’ Hopefully, you like this course so far.
β€’ How to measure this?
number of minutes awake
Engagement ratio =
total minutes avilable
Engagement distribution
𝑁 = 100
πœ‡ = 0.077
Οƒ = 0.107
𝒏 = πŸ‘πŸŽ
𝒙 = 𝟎. πŸπŸ‘
0.0
0.5
1.0
𝑁 = 100, πœ‡ = 0.077, Οƒ = 0.107
β€’ If all students attended to my lesson with singing, what is our point estimate for the
engagement ratio?
β€’ 0.13, number of minutes awake = 100 × 0.13 = 13 min
β€’ Interval estimate
β€’ What is the standard error of the mean that we would to use to compare this sample
mean (0.13) with the means of other samples of the same size?
β€’
𝑆𝐸 =
0.107
30
= 0.019
95%
0.077
0.13
95%
95%
M
0.13
95%
M
95%
0.13
M
𝑛 = 30, π‘₯ = 0.13, Οƒ = 0.107
Confidence interval
𝑠
π‘₯±π‘×
𝑛
β€’ But in our case we know the population standard
deviation 𝜎 = 0.107. So instead of the sample 𝑠 we can
use the population 𝜎.
β€’ What is a 95% confidence interval in our case?
0.13 ± 1.96 ×
0.107
30
⟹ 0.090 … 0.170
Confidence inteval
β€’ Our interval estimate for 95% confidence interval has a lower
bound of 0.090 and an upper bound of 0.170.
β€’ Remember, what these numbers mean. This is the ratio of the
β€œminutes awake” during a lesson to the β€œtotal minutes available
in a lesson” (which is 100 minutes).
Engagement ratio ER =
number of minutes awake
100
β€’ So we’re predicting if I incorporate this musical lesson then the
entire population of 100 students will be awaken between 9
minutes and 17 minutes.
β€’ Without my song the population of 100 students was awaken
7.7 minutes.
β€’ Thus we’re pretty sure a singing works, it will keep you awaken
at least 9 min, but possibly up to 17 min.
Self-assessment
β€’ The engagement ratio may not be perfect, it may have
few flaws.
β€’ For example a fact you’re not sleeping does not
necessarily mean you’re engaged.
β€’ Another option – you can self-report how engaged you
think you’re (at the scale between 1 and 10).
β€’ And you can also self-assess how much you think you
learnt (at the scale between 1 and 10).
Results
Measure of β€œEngagement”
Measure of β€œLearning”
β€’ πœ‡πΈ = 7.5, 𝜎𝐸 = 0.64
β€’ πœ‡πΏ = 8.2, 𝜎𝐿 = 0.73
β€’ π‘₯𝐸 = 8.94
β€’ π‘₯𝐿 = 8.35
Quiz
β€’ Ultimately we want to know if incorporating a song about
the concepts in the lesson will lead to higher engagement
and learning. What statistics should we calculate to
determine this?
1.
2.
3.
4.
Note if the sample means are less than or greater than a
population mean.
Calculate the actual difference between each sample mean and
population mean.
Find where each sample mean falls on the distribution of sample
means for their respective populations.
Find how many 𝜎s each sample mean is from the population
mean.
Measure of β€œEngagement”
Measure of β€œLearning”
β€’ πœ‡πΈ = 7.5, 𝜎𝐸 = 0.64
β€’ πœ‡πΏ = 8.2, 𝜎𝐿 = 0.73
β€’ 𝑛 = 30
β€’ 𝑛 = 30
β€’ 𝑀𝐸 = ?
β€’ 𝑀𝐿 = ?
β€’ 𝑆𝐸𝐸 = ?
β€’ 𝑆𝐸𝐿 = ?
𝒁𝑬 =
πŸ–. πŸ—πŸ’ βˆ’ πŸ•. πŸ“
𝟎. πŸ”πŸ’
πŸ‘πŸŽ
= 𝟏𝟐. πŸ‘πŸ
Measure of β€œEngagement”
𝒁𝑳 =
πŸ–. πŸ‘πŸ“ βˆ’ πŸ–. 𝟐
𝟎. πŸ•πŸ‘
πŸ‘πŸŽ
= 𝟏. 𝟏𝟐
Measure of β€œLearning”
β€’ πœ‡πΈ = 7.5, 𝜎𝐸 = 0.64
β€’ πœ‡πΏ = 8.2, 𝜎𝐿 = 0.73
β€’ 𝑛 = 30
β€’ 𝑛 = 30
β€’ 𝑀𝐸 = 7.5
β€’ 𝑆𝐸𝐸 =
0.64
30
𝒙𝑬 = πŸ–. πŸ—πŸ’
= 0.12
β€’ 𝑀𝐿 = 8.2
β€’ 𝑆𝐸𝐿 =
0.73
30
𝒙𝑬 = πŸ–. πŸ‘πŸ“
= 0.13
Probability of getting a given mean
β€’ 𝑍𝐸 = 12.32, 𝑍𝐿 = 1.12
β€’ What is the probability of randomly selecting a sample of
size 30 and getting a mean at least 8.94 for an
engagement and 8.35 for a learning?
β€’ For 𝑍𝐸 the probability is really low.
β€’ For 𝑍𝐿 the probabilty is 1.0 βˆ’ 0.8686 ~ 0.13.
β€’ So what does this mean, what can we conclude? Check
all what applies.
Conclusions?
1. The song seems to have had an effect on learning, but
not engagement.
2. The song seems to have had an effect on engagement,
but not learning.
3. The song caused and increase in bot engagement and
learning.
4. The song caused an increase in engagement, but not in
learning.
Summary of our findings
Dependent variable
(scale from 1 to 10)
Sample mean 𝒙
(n=30)
engagement
8.94
𝑝 β‰ͺ 0.01
learning
8.35
𝑝 β‰ˆ 0.13
Probability
Likely or unlikely?
Summary of our findings
Dependent variable
(scale from 1 to 10)
Sample mean 𝒙
(n=30)
engagement
???
𝑝 = 0.05
learning
???
𝑝 = 0.10
Probability
Likely or unlikely?
Levels of likelihood - 𝛼 levels
0.05 (5%)
0.01 (1%)
0.001 (0.1%)
β€’ Three conventional levels of (un)likelihood
β€’ If the probability of getting a sample mean is less than
0.05 – 0.01 – 0.001 then it is usually considered unlikely.
β€’ These are called the
𝜢 levels. Or significance levels
(hladiny významnosti).
β€’ 𝛼 level is our criteria for deciding if something is likely or
unlikely.
Quiz
β€’ Focus on 𝛼 = 0.05
Sampling
distribution
Z*
β€’ Which of the following are true?
1. If the probability of getting a particular sample mean is less than
𝛼, it is unlikely to occur.
2. If a sample mean has a Z-score greater than Z*, it is β€œunlikely” to
occur.
3. If the probability of getting a particular sample mean is β€œunlikely”,
the sample mean is in he orange region.
4. The alpha level corresponds to the orange region.
Z-critical value
If the probability of obtaining a
particular sample mean is less than
alpha level then it will fall in this tail
which is called the critical region.
Z*
Z-critical value
If the Z-score of the sample mean is greater than the Zcritical value we have an evidence that this mean is
different from the regular population (the population that
had not watched the musical lesson).
Critical regions
β€’ What is the Z-critical value for 𝛼 = 0.05?
β€’ Using Z-table you find Z-value for 0.95 probability. Which is 1.65.
β€’ What is the Z-critical value for 𝛼 = 0.01?
β€’ 2.33
β€’ What is the Z-critical value for 𝛼 = 0.001?
β€’ 3.08
β€’ We take a sample mean from a sample size 𝑛.
β€’ Then we calculate its Z-score
π‘₯βˆ’πœ‡
𝑍=𝜎
𝑛
β€’ And we get a Z-score of 1.82.
β€’ We say that this is significant at 𝑝 < 0.05.
β€’ 1.82 is somewhere in the red region at the previous picture. It is
less than 0.05, but not less than 0.01.
β€’ It means that a probability of obtaining this sample mean is less
than 5%, but is not less than 1%.
β€’ And remember, 0.05 is the alpha level.
Significance quiz
Z-score
3.14
2.07
2.57
14.31
Significant at
p<
p<
p<
p<
𝛼 level
Z-critical value
0.05
1.65
0.01
2.32
0.001
3.08
Significance quiz
Z-score
3.14
2.07
2.57
14.31
Significant at
p < 0.001
p < 0.05
p < 0.01
p < 0.001
𝛼 level
Z-critical value
0.05
1.65
0.01
2.32
0.001
3.08