Usability evaluation - IDC
Download
Report
Transcript Usability evaluation - IDC
1
© IDC, IIT Bombay
BadDesigns.com
2
© IDC, IIT Bombay
3
© IDC, IIT Bombay
4
© IDC, IIT Bombay
5
© IDC, IIT Bombay
6
© IDC, IIT Bombay
7
© IDC, IIT Bombay
8
© IDC, IIT Bombay
9
© IDC, IIT Bombay
10
© IDC, IIT Bombay
11
© IDC, IIT Bombay
12
© IDC, IIT Bombay
13
© IDC, IIT Bombay
14
© IDC, IIT Bombay
15
© IDC, IIT Bombay
16
© IDC, IIT Bombay
17
© IDC, IIT Bombay
18
© IDC, IIT Bombay
World Is Full of Unusable Objects
Home assignment
– Find 3 unique usability problems in the day-to-day objects that you have
observed or encountered
– Describe them as 3 slides in a presentation
19
Usability Evaluation
Objective
To identify usability problems in an interface
– To improve the design – formative evaluation
– To compare competitive products or as acceptance test – summative
evaluation
21
© IDC, IIT Bombay
Many Methods
Heuristic evaluation
User Testing
Thinking Aloud Protocol
Card sort
Feedback from real life usage
– Observational field studies
– Logging actual use
– User Feedback
– Questionnaires and Interviews
– Focus groups
22
Usability Evaluation Methods
Heuristic evaluation
– Jakob Nielsen, Usability Engineering, Morgan Kaufman, 1990
23
Task
For use by a shopkeeper in a kirana
shop to make a bill
– Find as many usability problems as you
can in 5 minutes in the Standard
Windows calculator
For each problem
– State it succinctly (if you thought of a
design idea, tell us what problem do you
solve)
– Identify a (potential) user experience
goal that was not met
– Rate it for severity
– Identify layer of user experience
24
© IDC, IIT Bombay
Task
Analyse the alarm app shown in this video for use by a patient
educated till class VII as reminder tool for his medication
– Find as many usability problems as you can in 5 minutes
25
© IDC, IIT Bombay
Analyse the app shown in this video
http://www.youtube.com/watch?v=s94JxnVwJtI
26
Task
Analyse the alarm app shown in this video for use by a patient
educated till class VII as reminder tool for his medication
– Find as many usability problems as you can in 5 minutes
For each problem
– State it succinctly (if you thought of a design idea, tell us what
problem do you solve)
– Identify a (potential) user experience goal that was not met
– Rate it for severity
– Identify layer of user experience where the problem occurs
27
© IDC, IIT Bombay
Heuristic Evaluation Technique
5-10 evaluators examine the product and identify usability
problems
It helps to have a mix of usability experts, domain experts and
users
– Single / double experts
– Same evaluators provide consistency across versions and save time
– New evaluators provide fresh eyes
28
© IDC, IIT Bombay
Heuristic Evaluation Technique
Give assistance regarding the domain knowledge
– Brief about users, personas, scenarios, primary goals, and usability
goals
– Evaluators may be asked to focus on parts of the product / specific
personas
Give a checklist of
– HCI heuristics and principles
– Platform standards, common practices
– Domain standards, common practices
29
© IDC, IIT Bombay
Heuristic Evaluation Technique …
30
Each evaluator has the freedom to choose his/her method of
evaluation
– Each evaluator evaluates the product independently
– Evaluators should go through product 2-3 times
First iteration
– Look for problems novice might face
– Analyze Surface and Skeleton issues
Second iteration
– Use primary goals, usability goals, heuristics, domain standards,
platform standards to identify problems
– Look for conceptual problems, analyze Structure
Third iteration
– Look for what is being missed out
– Question Scope and Strategy issues
© IDC, IIT Bombay
Heuristic Evaluation Technique …
Sometimes, an observer notes the findings
– Each of us will note for ourselves here
Observer may give hints about product use
– These are evaluators, not users
– Save time, capture the problem, and move on
– Help only when evaluators ask, after noting problems
31
© IDC, IIT Bombay
Heuristic Evaluation Technique …
After the evaluations, have a debriefing meeting to compile
findings
The debriefing meeting also produces design suggestions
– Make them sound like suggestions rather than decisions
– Do not express only a design idea, point to the problem first
Typical debriefing sessions last 1-4 hours
– Produce preliminary report during the debriefing meeting
– Fine-tune later
32
© IDC, IIT Bombay
Heuristic Evaluation Output
Output of heuristic evaluation is a list of usability problems
Report formats
– Introduction, product evaluated, method used
– Scenario based problems
~ Enter the cubicle, swipe the card, enter PIN, check balance,
withdraw cash, collect cash, card and receipt, exit the cubicle
– Screen based problems in callouts
~ Idle screen, test keyboard screen, login screen, password screen,
main menu
Common Industry Format (CIF)
~ ISO/IEC 25062:2006
33
© IDC, IIT Bombay
Heuristic Evaluation Some Numbers
5 evaluators find about 75% of the problems
10 evaluators find about 90% of the problems
– HE is prone to reporting false alarms
– Don’t negotiate for too long unless it is a very important issue
– If someone thinks it is a problem, capture it
34
© IDC, IIT Bombay
Usability Evaluation Methods
Heuristic evaluation
User Testing
35
User Testing
A representative set of tasks
Representative users try to perform these tasks
An experimenter conducts the experiment
– Observes users
– Captures, interprets and prioritizes problems
36
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
~ Should we design an IVR? Can low-tech savvy users use IVRs?
~ Is the design OK? Should we start the development?
~ Can all voters use the new EVM? Can we use it in the next election?
– Subset of usability goals and persona goals
37
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
Determine test tasks
– Determine training to be given prior to tasks
– Provide a scenario description
– Give users a goal to achieve, not instructions to execute
– Avoid leading instructions / hints / questions
– Set ‘error traps’ and planned disturbances
– Try to provide early success experiences
38
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
Determine test tasks
>> Output of a user test
Determine evaluation criteria
– Determine the output of the UT
– Identify performance measures
~ Quantitative
~ Success, time, errors, efficiency, satisfaction
~ Qualitative
~ Ability to handle new situations
~ Conceptual understandability of the product
~ Questions asked by users
~ Training procedure
39
© IDC, IIT Bombay
User Testing Output
Qualitative output
~ What is the first reaction?
~ What is confusing? What do users ask? Why?
~ What do the users like / hate most?
~ What features are users likely to use?
~ What are user priorities?
– Good for formative evaluation
40
© IDC, IIT Bombay
User Testing Output
Qualitative output
Task success
~ Find the share price of Reliance
~ Find a good insurance policy
– Levels of success
~ Yes / no
~ Successful in first attempt / second attempt / with help
41
© IDC, IIT Bombay
User Testing Output
Qualitative output
Task success
Task time
– Tell the participant to be as fast as possible
– But avoid declaring
~ “You took 6.3 seconds for that task. Great show.”
– Decide when to stop the timer
– To probe or not to probe?
– Include unsuccessful tasks in timing?
42
© IDC, IIT Bombay
User Testing Output
Qualitative output
Task success
Task time
Errors
– Error “an action of the user that does not accomplish the desired
goal”
– Severity of errors “the more difficult it is to recover from the error,
more sever it is”
Few users face the
Many users face the
problem / problem occurs problem / problem occurs
infrequently
frequently
Easy to recover
Difficult to recover
Low
Medium
Medium
High
43
© IDC, IIT Bombay
User Testing Output
Qualitative output
Task success
Task time
Errors
Efficiency
– Efficiency = success + time
– Number of mouse clicks
– “Lostness” – Smith
44
© IDC, IIT Bombay
User Testing Output
45
Qualitative output
Task success
Task time
Errors
Efficiency
Satisfaction scores
– What to measure?
~ Expectation, ease, satisfaction, time (no watch), smiles / frowns
– How to measure?
~ Likert scales, semantic differential scales
– When to measure?
~ Pre-test, post-test, in the field
– Try to make it look anonymous – online reduces bias
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
Determine test tasks
Determine evaluation criteria
– This will be the output of the UT
~ Qualitative
~ Quantitative: success, time errors, efficiency, satisfaction
– Correctness criteria for each task
– Things to measure
– Things to watch out for
– When to help users
46
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
Determine test tasks
Determine evaluation criteria
Invite / recruit appropriate users
– Matching personas, target markets
– Covering independent variables
– Professional recruiters
– Screener
– Telephonic screening
47
© IDC, IIT Bombay
User Testing Screener
48
Make it short
– Less than 20 questions that take about 5 minutes to run through.
Be clear and specific
– Avoid questions that can lead to ambiguous answers
Avoid unnecessary questions
– A screener is not an interview
– Each question should eliminate some users
Order questions from general to specific
– Try to eliminate more people sooner
Communicate the format of the UT and the expected time
commitments
– Clearly communicate if the user needs to be prepared in some way
Plan on drop outs and no-shows
– Recruit 20% more users than you need
© IDC, IIT Bombay
Planning a User Test
Determine the key question that the UT should answer
Determine test tasks
Determine evaluation criteria
Invite / recruit appropriate users
Prepare the infrastructure
– Carry out all test tasks to ensure readiness
– Carry out pilot tests for practice
49
© IDC, IIT Bombay
User Testing Process
Introduce the user to the topic and put him/her at ease
–
–
–
–
Explicitly say that it is the product that is being tested
Explain that the product is new and untested
Promise confidentiality, inform about any recordings
Avoid large crowds, conspicuous cameras, interruptions other than the
planned ones
Give test tasks one at time
Give hints to only if the user is stuck for a long time
– Or if they run into a known usability problem
After the tests, debrief the user clarify any doubts
Review test plan before repeating with another user
50
© IDC, IIT Bombay
Interview Structure
51
Start
– Break ice – factual questions
– Ask 1-2 open ended questions
Middle
– Give tasks, one at a time
– Contextualize findings – concrete, detailed data
– Keep track of test goals
End
– Debrief, answer questions, discuss insights
– Ask for suggestions
Let the user talk
– Avoid interruptions
– Try to build a partnership
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
~ “We put in a lot of effort to design this product. Do you like it?”
52
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
Questions should be non-judgmental
~ “Don’t you think that calorie advice will be more useful as a
graph?”
~ “What do you learn from this graph?”
or
“With this tool select an appropriate dessert for the family.” then
observe and probe as appropriate
53
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
Questions should be non-judgmental
Keep questions “open-ended”
~ “Which of these features is most important to you?”
implies “At least some features should be important to you.”
~ “Rate the importance of these features on a scale 1-5”
54
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
Questions should be non-judgmental
Keep questions open-ended
Focus on experience, not extrapolation
~ “Do you think car pooling is a useful feature?”
Implies “In the universe of all things, do you think that someone
somewhere will find carpooling useful?”
~ “Given your lifestyle and the amount of commuting you do, do you
think carpooling valuable to you right now?”
55
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
Questions should be non-judgmental
Keep questions open-ended
Focus on experience, not extrapolation
Further, focus on immediate experience
~ “Is a in-car GPS interesting to you?”
Implies “Will a in-car GPS facility be of interest to you at some
time in your life?”
~ “If your car is equipped with this GPS facility, would you use it?”
Then, if yes “How will you use it?”
56
© IDC, IIT Bombay
Questions for Non-directed Interviewing
Don’t imply the answer
Questions should be non-judgmental
Keep questions open-ended
Focus on experience, not extrapolation
Further, focus on immediate experience
Focus a question on a single topic
~ “How will you use the MP3 option at a party and while driving?”
57
© IDC, IIT Bombay
Running a Non-directed Interview
Define ambiguous terms
Restate answers to verify (= share interpretation)
– Listen to user’s questions carefully
Use artifacts to ground discussions
Don’t force opinions
– Never say that the user is wrong – you are here to find out what the
user thinks, not to convince him / her about your opinions
Follow up with examples to probe
– But wait for an undirected answer first
Review interview tapes
– You will find your own ways of improvement
58
© IDC, IIT Bombay
What can go wrong?
Reliability
– “Can we get the result if we repeat the test”
– 10x difference in individual computer usage speeds
– Background of users
~ Access, age, familiarity with UTs
59
© IDC, IIT Bombay
What can go wrong?
Reliability
Validity
– “Does the test reflect the usability issues of interest”
– Choice of users
~ Are they a representative sample of real users?
– Choice of tasks
~ Are these tasks important? To whom?
– Conclusions and their relation with the experiment
~ User took time in first attempt, but this is a frequent task
~ We tested a prototype, then built the whole system, but the prototype
was too small, and the system is far too complex
~ We tested the new widget, but the real problem was finding it
~ Probing confusions and evaluating task time at the same time
60
© IDC, IIT Bombay
What can go wrong?
Reliability
Validity
Experimental design (mostly more validity issues)
– Testing too many things
~ The UT took 4 hours, but the user was exhausted
~ You cannot text everything, you have to rely on the abilities and the
experience of the designers
– Confusing independent & dependent variables
~ Measuring tech-savvyness from task success
61
© IDC, IIT Bombay
What can go wrong?
Reliability
Validity
Experimental design
Experimenter bias
– Experimenter likes one design
– The way the task is given
– What the experimenter chooses to see, probe
62
© IDC, IIT Bombay
What can go wrong?
Reliability
Validity
Experimental design
Experimenter bias
Intervening variables
– Unforeseen circumstances that might interfere with the testing or
performance
~ Interruptions in task time
~ User took three attempts to enter amount (Rs. 500) – this gave him
extra practice with the PIN
63
© IDC, IIT Bombay
User Testing
Gives ‘better’ results than heuristic evaluation
– More confidence about findings
Needs more time and money
64
© IDC, IIT Bombay
Task: Find 2-3 UTs of Each Type
Qualitative output
Task success
Task time
Errors
Efficiency
Satisfaction scores
65
© IDC, IIT Bombay
Usability Evaluation Methods
Heuristic evaluation
User Testing
Think Aloud Protocol
66
Think Aloud Protocol
A set of users use the product while continuously thinking out
aloud
– An evaluator notes all comments
Users need practice before real tasks
– Use a short, familiar but unrelated task
Result is a summary of qualitative data
– Excellent for formative evaluation
– Can’t get performance data, particularly tasks times
Videotaping sessions helps capture nuances
67
© IDC, IIT Bombay
Thinking Aloud Protocol Variants
Constructive Interaction
– Two users use the system together
Retrospective Testing
– Let users think aloud in a video-tape playback
Coaching method
– Let an expert teach a novice
68
© IDC, IIT Bombay
Usability Evaluation Methods
Heuristic evaluation
User Testing
Think Aloud Protocol
Card sort
– Louis Rosenfield, Peter Morville, Information Architecture for the World
Wide Web
69
Categorise...
70
Brinjal
Food
Rice
Brush
Mango
Scale
Camera
Milk
Shampoo
Cauliflower
Mobile phone
Stationery
Chicken
Non-veg
Sweet potatoes
DVDs
Notebook
Tomatoes
Electronics
Pen
Toiletries
Easel
Pencil
Toothpaste
Eraser
Potatoes
Vegetables
Fruits
Raw Banana
Water colours
© IDC, IIT Bombay
What did you use as categories?
Did you make new categories?
71
Brinjal
Food
Rice
Brush
Mango
Scale
Camera
Milk
Shampoo
Cauliflower
Mobile phone
Stationery
Chicken
Non-veg
Sweet potatoes
DVDs
Notebook
Tomatoes
Electronics
Pen
Toiletries
Easel
Pencil
Toothpaste
Eraser
Potatoes
Vegetables
Fruits
Raw Banana
Water colours
© IDC, IIT Bombay
Where did you put these?
72
Brinjal
Food
Rice
Brush
Mango
Scale
Camera
Milk
Shampoo
Cauliflower
Mobile phone
Stationery
Chicken
Non-veg
Sweet potatoes
DVDs
Notebook
Tomatoes
Electronics
Pen
Toiletries
Easel
Pencil
Toothpaste
Eraser
Potatoes
Vegetables
Fruits
Raw Banana
Water colours
© IDC, IIT Bombay
Card Sort Technique
Make index cards
– Categories
– Sub-categories
– Content
– Number the cards (for later analysis)
73
© IDC, IIT Bombay
Card Sort Technique
Make index cards
Give a practice task
– People we know
– Celebrities
– Fruits and vegetables
74
© IDC, IIT Bombay
Card Sort Technique
Make index cards
Give a practice task
Ask users to
– Make piles of cards
– Label piles
– One pile could be ‘I don’t care’ pile
– Think aloud
Take notes while users work
75
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
– Totally open – users write their own cards and categories
– In between – cards are given, users write category and sub-category
labels
– Totally closed – pre-labelled cards and categories
– Early on – use open (discovery)
– Later on – closed (validation)
76
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
Phrasing
– Word, phrase, sentence
~ Equipment, Having fun, Honesty is the best policy
– Category with sample sub-categories
~ Equipment (Computers, Lathe machines, ACs)
– Picture, question, answer, topic, task
77
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
Phrasing
Granularity
– High-level or detailed
– Paragraphs, pages, main pages, sub-sites
– Focussed or generic
– Focussed or random
78
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
Phrasing
Granularity
Heterogeneity
– Early on – mix it up
– Later on – high consistency
79
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
Phrasing
Granularity
Heterogeneity
Cross-listing
– Early on – avoid copies (structure)
– Later on – allow copies (navigation)
80
© IDC, IIT Bombay
Card Sort Variables
Open / Closed
Phrasing
Granularity
Heterogeneity
Cross-listing
Qualitative / Quantitative
– Qualitative suitable for CI / mind-mapping / evaluating IA
– Quantitative suitable for summative evaluation, to do cluster analysis
81
© IDC, IIT Bombay
Card Sort Analysis
Users’ terminologies
Mind-maps
– Users’ conceptual models
% times two cards go together
% times a card goes in a category
82
© IDC, IIT Bombay
Questions?
Heuristic evaluation
User Testing
Thinking Aloud Protocol
Card sort
Feedback from real life usage
– Observational field studies
– Logging actual use
– User Feedback
– Questionnaires and Interviews
– Focus groups
83
Assignment 4
Pick a topic of your interest
– Accessibility, HCI and robotics, e-learning,
Read 5 recent papers (2 years) on this topic
Prepare a 10-minute presentation on this topic
– Submit a 2-page abstract with references
Inform your topics to the Aakash and Abhishek by next class
– Presentations start from the class of 19-3
– The presentation slots will be drawn randomly and announced by 12-3
– This assignment will be evaluated by your peers
84