Pershing, Validity and Reliability in Questionnaire Development

Download Report

Transcript Pershing, Validity and Reliability in Questionnaire Development

Questionnaire Development Measuring Validity & Reliability

James A. Pershing, Ph.D.

Indiana University

Definition of Validity

  Instrument measures what it is intended to measure:    Appropriate Meaningful Useful Enables a performance analyst or evaluator to draw correct conclusions

Types of Validity

    Face Content Criterion   Concurrent Predictive Construct

Face Validity

Looks Good To Me     It looks OK Looks to measure what it is supposed to measure Look at items for appropriateness  Client  Sample respondents Least scientific validity measure

Content-Related Validity

Balance Definition Sample Content Format   Organized review of format and content of instrument   Comprehensiveness Adequate number of questions per objective  No voids in content By subject matter experts

Criterion-Related Validity

Subject Instrument A Instrument B

Task Observation Inventory Checklist John Mary Lee yes no yes no no no Pat Jim Scott Jill no yes yes no no yes yes yes Usually expressed as a correlation coefficient (0.70 or higher is generally accepted as representing good validity)    How one measure stacks-up against another  Concurrent = at same time  Predictive = now and future Independent sources that measure same phenomena Seeking a high correlation

T H E O R Y

Construct-Related Validity

Prediction 1 - Confirmed Prediction 2 - Confirmed Prediction 3 - Confirmed Prediction n - Confirmed

   A theory exists explaining how the concept being measured relates to other concepts Look for positive or negative correlation Often over time and in multiple settings Usually expressed as a correlation coefficient (  0.70 or higher is generally accepted as representing good validity)

Definition of Reliability

  The degree to which measures obtained with an instrument are consistent measures of what the instrument is intended to measure Sources of error   Random error = unpredictable error which is primarily affected by sampling techniques  Select more representative samples  Select larger samples Measurement error = performance of instrument

Types of Reliability

   Test-Retest Equivalent Forms Internal Consistency   Split-Half Approach Kuder-Richardson Approach  Cronbach Alpha Approach

Test-Retest Reliability

  Administer the same instrument twice to the same exact group after a time interval has elapsed.

Calculate a reliability coefficient ( r ) to indicate the relationship between the two sets of scores.

  r r of+.51 to +.75 moderate to good over +.75 = very good to excellent

T I M E

Equivalent Forms Reliability

   Also called alternate or parallel forms Instruments administered to same group at same time Vary: Response Set: -- Order -- Wording Stem: -- Order -- Wording  Calculate a reliability coefficient ( r ) to indicate the relationship between the two sets of scores.

  r r of+.51 to +.75 moderate to good over +.75 = very good to excellent

Internal Consistency Reliability

  Split-Half Break instrument or sub parts in ½ -- like two instruments Correlate scores on the two halves Best to consult statistics book and consultant and use computer software to do the calculations for these tests  Kuder-Richardson (KR) Treats instrument as whole  Compares variance of total scores and sum of item variances   Cronbach Alpha Like KR approach Data scaled or ranked

Reliability and Validity

So unreliable as to be invalid Fair reliability and fair validity Fair reliability but invalid Good reliability but invalid Good reliability and good validity The bulls-eye in each target represents the information that is desired. Each dot represents a separate score obtained with the instrument. A dot in the bulls-eye indicates that the information obtained (the score) is the information the analyst or evaluator desires.

Comments and Questions