Audibility & Speech Recognition
Download
Report
Transcript Audibility & Speech Recognition
Audibility
&
Speech
Recognition
Chapter 6
Perry C. Hanavan, Au.D.
Speech Test Applications
obtain information for counseling
illustrate benefits of visual cues
determine hearing aid candidacy
– or determine candidacy for CI or ALDs
predict hearing aid benefit
determine when binaural aids might NOT be appropriate
– 80%, AD and 20% AS
determine amplification characteristics and features
– whether high frequency beneficial
demonstrate advantage of special hearing aid features
demonstrate aided performance is better than undaided
determine whether cognitive or APD exists
– older person may have cognitive problems, anyone may have APD
demonstrate that understanding of speech is impaired
Audiologic Evaluation
Information helpful to providing AR
Audibility
Dynamic Range
Frequency Resolution
Temporal Resolution
Audiologic Evaluation
Critical consideration in helping to plan AR:
Time of Onset of loss
Degree of Loss
Etiology
Type of Loss
And wealth of other factors
Audiologic Evaluation
Observations
Interviews
Questionnaires
Otoscopic examination
Pure tone results
Speech recognition
Immittance
OAE
Electrophysiologic (ECochG, ABR, MLR, LR, etc.)
Pure Tone Results
Degree of loss
– normal, minimal, mild, moderate, moderately
severe, severe, profound
Type of loss
– conductive
– sensorineural
– mixed
Tactile
Only
SNR
Many
Items
Stimulus
Familiarity
Equivalent
Lists
Audition
Plus
Vision
Stimulus
Units
Distance/
Intensity
Learning
Effects
Stimulus
Context
Stimulus
Mode
Stimulus
Elements
WithinSubject
Statistical
Procedure
Audition
Only
Audition
Plus
Tactile
Vision
Only
Open
Set
Speech
Test
Factors
Clinical
Significance
Auditory
Skill
Response
Format
Closed
Set
Paired
t-statistic
Stimulus
Format
Listener
Variables
Cognitive
Ability
Communication
Mode
Test-Restest
Variability
Linguistic
Ability
Hearing
Loss
Live
Synthesized
Speech
Test
Reliability
Test
Conditions
Altered
Speech
Recorded
Stimulus Mode
Tactile
Only
auditory alone
vision alone
Audition
Plus
Vision
tactile alone
auditory and vision
auditory and tactile
tactile and vision
auditory, vision and tactile
Audition
Only
Audition
Plus
Tactile
Vision
Only
Auditory Alone
phoneme, syllable, word, phrase, sentence
open, closed set
high, low context cues
quiet, noise – signal to noise ratio (SNR)
recorded, live
Auditory Plus Vision
Speechreading enhancement
– speechreading enhancement
– speechreading enhancement ratio
• vision only / auditory plus vision = SE
– Children –Craig Sentence and Craig Words and CHIVE
– Adults –Iowa Sentence Test and CUNY Sentences
– Erber (vision and hearing assessment)
Auditory Plus Visual Information
Sumby and Pollack (1954) demonstrated
that the addition of visual speech
information could significantly improve
speech perception performance and that the
importance of visual speech information
increased as the listening situation became
more difficult.
Auditory Plus Vision
Tye-Murray
– CHIVE (adult)
– CAVET (children)
– Audition plus vision
– Vision only
– Audition alone
Eber
– Sent-Ident
Speechreading Enhancement/
Visual Enhancement
Benefit from adding a visual signal to an auditory signal
Refers to the benefit obtained from seeing and hearing a
speaker compared with auditory alone
– Difference or ratio between speech recognition performance in an
vision-only condition and an audition plus-vision condition
– Difference Formula (AV-V)
– Normalized Score Formula (AV-V/100-V)
Patient A (V score=50% AV score = 75%)
– 75-50=25% enhancement (difference score)
– (75-50)/(100-50)=50% (normalized difference score)
Patient B (V score = 10% AV score = 55%)
– 55-10=45% visual enhancement
– (55-10)/(100-10)=50% (normalized difference score)
Auditory Enhancement
Benefit from adding an auditory signal to a visual
only signal
Difference Formula (AV-A)
Normalized Score Formula (AV-A/100-A)
Patient A (A score=50% AV score = 75%)
– 75-50=25% enhancement (difference score)
– (75-50)/(100-50)=50% (normalized difference score)
Patient B (A score = 10% AV score = 55%)
– 55-10=45% auditory enhancement
– (55-10)/(100-10)=50% (normalized difference score)
Integration Enhancement
Measure of ability to integrate auditory and
visual information
AV-[100-(100-A)+(100-V)]/100-[100(100-V)+(100-A)]
THE AUDITORY SANDWICH
Visual cues:
• lip-reading
• printed word
• cued speech
• signs
HEAR
SEE - SAY
HEAR-UNDERSTAND
First, listen.
Then, if need be,
watch or say it.
Then, listen again
(no visual cues)
Always put it back into hearing!
Back to “choices”
Auditory
-Verbal
Auditory
Cued
-Oral
Speech
AUDITORY
Total
Communication
Language
Communication
Cognition
ASL
VISUAL
Choices reframed
Fully
Auditory
Mostly
Auditory
Mostly
Visual
Fully
Visual
Communicator
Communicator
Communicator
Communicator
A
Av AV VA
V
Flexibility is essential
Fully
Auditory
Mostly
Auditory
Mostly
Visual
Fully
Visual
Communicator
Communicator
Communicator
Communicator
A
Av AV VA
V
Stimulus Elements
SNR
Units: phoneme, syllable, word,
phrase, sentence, non-sense
Distance/
stimuli
Intensity
Proximity: distant, near
Intensity: soft, comfortable, loud
Context: high, low context cues,
SNR: quiet, background sounds-signal to noise ratio
Familiarity: high, low familiarity
with material, nonsense material
Stimulus
Familiarity
Stimulus
Units
Stimulus
Context
Learning Effects
Learning effect: familiarity with
items, procedures…not change in
ability
– Repeated Frame Test
– Cinderella-Brahman Speech
Recognition Test
Equivalent lists: lists that contain
items that are presumed to be
equally difficult to recognize
– PB, sentences
– Repeated Frame Sentences Test
Numerous stimuli
– Full lists rather than half lists
Many
Items
Equivalent
Lists
Stimulus Items
Live vs. Recorded
–
–
–
–
–
Voicing frequency
Intonation
Speech rate
Clarity of articulation
Physical characteristics
Synthesized speech
Altered speech
– Time-compressed
– Expanded
– Filtered
Live
Synthesized
Speech
Altered
Speech
Recorded
Significance
Clinical significance
When a small change in performance
is clinically significant
When comparison between two test
results is clinically significant
Statistical design
Paired t-statistic
Within subject statistical procedure
WithinSubject
Statistical
Procedure
Paired
t-statistic
Significant
Difference
for NU-6 recorded
speech tests
Listener Variables
Degree of hearing loss
Cognitive abilities
– Thinking, reasoning,
remembering, imagining, or
learning words
Linguistic abilities
– Knowledge of language
Communication mode used
Multicultural
Cognitive
Ability
Communication
Mode
Linguistic
Ability
Hearing
Loss
Test Reliability
Test reliability: the degree to which
a single test score approximates the
true score
Test-retest variability: measure of
consistency from one test
presentation to the next
Test conditions: Variables affecting
test-retest variability:
– mode of presentation—live vs recorded
– location—test booth vs classroom
– talker—familiar vs unfamiliar, male vs
female
– number times item repeated—once,
twice, etc. leads to better performance
Test
Reliability
Test
Conditions
Response Format
Closed or limited set
Open set
Auditory skill
–
–
–
–
Detection
Discrimination
Identification
Comprehension
Open
Set
Auditory
Skill
Closed
Set
Acclimatization
and
Brain Reorganization
Plasticity
Physiological changes in the
CNS (and PNS – auditory
nerve) that occurs from sensory
experiences
– Brain’s ability to reorganize
space
– Benefit from HA, CI, HATs may
need to be measured at later date
– Brain may continue to acclimate
for several years following HA,
CI, HAT, therapy
Making Connections
A child is born with
over 100 billion
neurons or brain cells.
These neurons form
connections, called
synapses, which make
up the wiring of the
brain.
Brain Development
EARLY EXPERIENCES At
age eight months an infant may
have 1,000 trillion synapses.
By age 10 the number of
synapses decrease to about 500
trillion.
The final number of synapses is
largely determined by a child's
early experiences, which can
increase or decrease the number
of synapses by as much as 25
percent.
Brain Development
"USE IT OR LOSE IT!" The
brain operates on a "use it or lose
it" principle: only those
connections and pathways that are
frequently activated are retained.
Other connections that are not
consistently used will be pruned
or discarded so the active
connections can become stronger.
Brain Development
DEFINING LANGUAGE SKILLS
When an infant is three months old, his
brain can distinguish several hundred
different spoken sounds.
Over the next several months, his brain
will organize itself more efficiently so
that it only recognizes those sounds that
are part of the language he regularly
hears.
During early childhood, the brain
retains the ability to relearn sounds it
has discarded, so young children
typically learn new languages easily and
without an accent.
Dendrites
Dendrites are thin, branching
fibers lined with receptors at
which the dendrite receives
information from other
neurons.
The greater the surface area,
the greater the amount of
information.
Some dendrites are covered
with spines which greatly
increase its surface area.
Plasticity, Adaptation, Acclimatization
Many researchers think that training techniques
sometimes can help those with the reading
disability, dyslexia, because they modify brain
networks. The images above hint that this is the
case. The top images show the brain activity (litup areas) of a 10 year-old boy while he
completes a task that requires the ability to
identify the sounds of words. His reading level
equaled that of an eight-year-old child. The
bottom images show his brain activity while he
completes the same task after receiving eight
weeks of a type of special training. Following
the intervention training, his reading level
increased by three years and the images
indicate that his brain activity changed as
well. Researchers are conducting a very large,
ongoing study to confirm this one example.
Auditory Acclimatization
Back in the 1940s, wideband high-fidelity phonograph consoles were
just becoming available. Because of his interest in high-quality audio,
Harvey Fletcher bought one for his home. Harvey enjoyed listening to
this new high-fidelity system, but unfortunately, the enjoyment was not
shared by his wife. After listening to a an old 78 rpm record, with the
surface noise made particularly prominent by the extended bandwidth
of this new high-fidelity system, she said: "That sounds awful. I don't
really like having that screechy sound in my home."
Always the creative thinker, the next day, when his wife was out of the
house, Harvey went into the living room and soldered twenty 1 uF
capacitors across the loudspeaker terminals, rolling off the high
frequencies. (Remember that amplifiers were high impedance back
then, so the trick worked.) That evening, when the music played, his
wife was now happy.
One night each week, while his wife was sleeping, Harvey would sneak
downstairs and clip one capacitor. After twenty weeks, when the music
played, they were both happy.
Acclimatization
Adapting to a new environment (in this case, auditory) or as
defined by Darwin, the process of inuring to a new climate,
or the state of being so inured. This seems to be a reasonable
term, as it also is used to describe how the human body
acclimates to temperature, altitude, and other environmental
conditions. From an auditory standpoint, Gatehouse was one
of the first to use the term acclimatization, explaining the
speech processing capabilities of a group of people aided
monaurally. In later research Gatehouse used the term
acclimatization to describe an improvement in speech
recognition over time. Today, the term is used widely to
explain adaptation to hearing aid use in general, and is not
limited to the Gatehouse definition.
Adaptation
The process of adapting to something,
such as environmental conditions (in this
case, auditory); the responsive adjustment of
a sense organ. This too is a reasonable term,
as it has long been used in reference to the
eye—e.g., adaptation to varying light
conditions.
Sensory Reorganization
When nerve stimulation
changes, as with amputation, the
brain reorganizes. In one theory,
signals from a finger and thumb
of an uninjured person travel
independently to separate
regions in the brain's thalamus
(left). After amputation,
however, neurons that formerly
responded to signals from the
finger respond to signals from
the thumb (right).
Auditory Reorganization/Plasticity
Cochlear dead
regions
Brain
reorganization
will occur
with damage
to regions of
the cochlea
Speech Recognition Tests
WIPI
Word Intelligibility by
Picture Identification
(WIPI)
closed-set picture-pointing
(six pictures per plate)
appropriate for children
whose language age is
between 5 and 10-11
comprised of four 25monosyllabic word lists
contains 26 color plates
(one for practice), six
pictures per page. (A, V,
A-V)
NU-CHIPS
NORTHWESTERN UNIVERSITY-CHILDREN’S PERCEPTION OF SPEECH (NUCHIPS)
closed set picture pointing word recognition test
children whose language age is as low as three
50 words familiar to three-year-old children in
four randomizations
includes two picture books with 50 monochrome
plates, four pictures per plate. Book A is used for
forms A & B, book B for C & D.
There are two recordings, one male and one
female talker.
MAC
The Minimal Auditory Capabilities (MAC) (2nd
Edition) battery specifically targeted to CI patient
Consists of a series of tasks which are graded in
difficulty
Most of the MAC battery sub-tests assume patient’s
hearing loss has occurred post-lingual, but can be
employed as a means of evaluating the hearing abilities
of persons for whom traditional speech materials are
too difficult
Second edition has been standardized.
The recorded materials include gross sound
identification, inflection detection, contrast detection,
accent discrimination, and word identification
14 sub-tests, 13 audio and one video
SERT
SOUND EFFECTS RECOGNITION TEST
(SERT) developed for those instances where
conventional word recognition measures are not
appropriate, such as when language limitations due
to hearing impairment
Certain children who are unable to recognize even
simple speech can perceive correctly environmental
sounds to which they are exposed in their daily
lives
Under these circumstances, the SERT can provide
valuable information about the integrity of the
auditory system.
Closed set, picture-pointing tasks
10 sounds plus a practice sound
CID Every Day Sentences
EVERYDAY SPEECH
10 sets of 10 sentences each with 50
"target" words in each set for word
recognition assessment under contextual
conditions
Can be employed in auditory training
Sentences vary in length and are spoken
with minimal inflection
Normative data on recording lacking
BKB Sentences
Bench, Koval, & Bamford (BKB)
Open set sentences
Appropriate for linguistic abilities of most 8-15 years of
age with hearing loss
Lists of 16 simple sentences, including 50 key words were
devised to include vocabulary, grammar and sentence
length for 8-15 year olds
The sentences are presented in an open-set format and the
child imitates as much of the sentence as possible.
Responses are recorded word-for-word and scored by
percent of key words correctly repeated.
NU-6
NORTHWESTERN UNIVERSITY
AUDITORY TEST NUMBER SIX (NU-6)
Phonetically balanced CNC monosyllabic
open set word recognition test
Four lists of 50 words each recorded in four
randomizations
Talker has a General American dialect
Standardized
Auditory Numbers Test
ANT
Test helps identify tactile from auditory
listeners
Word closed set recognition auditory alone
test
Appropriate for 3-8 year old children with
severe to profound hearing loss
Simple auditory alone test to measure
ability to perceive simple auditory cues
– (Erber, 1980)
Iowa Consonant Confusion Test
Closed set consonant (phoneme) recognition test
test can also be analyzed in terms of the listener's
ability to identify phonetic features:
– Chance performance for consonant voicing, manner, and
place of articulation identification is 50%, 33%, and 20%
respectively
– Example: Mr. S achieved a total score of 79% correct,
96% on voicing, 94% on manner, and 85% on place
10 consonants presented 12 times in VCV context
–
–
–
–
p, t, k, b, d, g, v, z, n, m
Presentation examples: aba, ada, aga, etc
Each consonant presented 12 times in random order
Presented with carrier phrase “The next word is”
• (Tyler et al, 1983)
S
t
i
m
i
l
u
s
p
t
k
b
d
g
v
z
n
m
p
0
0
0
1
1
1
1
1
0
0
t
2
3
1
1
0
0
2
2
1
0
k
6
2
3
2
3
5
1
0
0
0
Response
b d g v
2 1 0 0
1 1 0 1
0 0 3 1
2 0 3 1
2 0 0 1
0 2 0 0
0 3 0 2
2 0 0 3
0 0 0 0
0 0 0 0
z
0
4
2
0
2
0
1
0
0
0
n
0
0
1
1
3
0
2
2
4
4
m
1
0
1
1
0
4
0
2
7
8
Example
of Iowa
Consonant
Confusion
Test
(example:
auditory
alone)
Children’s Auditory Test
CAT
Auditory alone limited set test assessing
ability to perceive stress patterns and word
recognition
Consists of 12 words
– Monosyllabic words
– Trochees
– Spondees
IMSPAC
Imitative tests of Speech Pattern Contrast
Perception
Developed by Arthur Boothroyd
Syllable level, 4 lists randomized
Choose odd one of 3 (forced choice)
Pointing, button-press or verbal response
Age 7 years and up
AB Short Word List
Isophonemic Word List
Developed in 1968 by Arthur Boothroyd
Speech recognition open set test
Each list consists of ten words, and each word is
constructed as consonant - vowel – consonant
30 phonemes, 10 vowels and 20 consonants
present in each list
CNC words
Score is based on the phonemes correct out of 30
Larsen Recorded Test
Auditory word discrimination test
Pairs of phonemes in words
– few vs. chew
– bill vs. mill
– nice vs. vice
Lists represent an attempt to present the phone in
the initial, medial and final position of a word
Limited choice—select one of two words by
drawing line through printed word heard
CAVET
Children’s Audiovisual Enhancement Test
Assesses speechreading enhancement in children within
the vocabulary level of 7-9 year olds with profound
prelingual hearing loss
Designed to minimize ceiling and floor effects, eliminate
syntactic factors, and minimize semantic factors
3 lists of 20 words each with half of words easy to
recognize in a vision-only condition and half are less likely
to be recognized in each list but presented in random order
Each list is designated for auditory alone, visual alone, or
auditory-visual only mode
Test available in CD-ROM and VHS format
– (Tye-Murray & Geers, 2002)
CUNY Sentences Test
The City University of New York (CUNY)
Topic Related Sentence Sets consist of 25
equivalent sets of 12 sentences of varying length
Each sentence is related to one of 12 topics
Open set sentence recognition test
Listener is presented with sentences in three
listening conditions: A, V and AV
The test is scored with the total number of words
correctly identified
(Boothroyd, Hannin, & Hnath, 1985)
Lexical Neighborhood Test (LNT)
Lexical neighbors--words that sound similar to a target item.
Often defined as words that differ by a single phoneme from
target word
Open-set test that requires child to imitate stimulus words
immediately after they are presented
Each list consists of 50 monosyllabic words, 25 of which
are high-frequency words with few lexical neighbors (easy)
and 25 of which are lower frequency words with many
lexical neighbors (hard)
Words were selected to be familiar to children with limited
vocabularies.
– Easy Words: juice, good, drive, time, hard, gray, foot, orange, count
– Hard Words: thumb, pie, wet, fight, toe, cut, pink, hi, song, fun, use, mine,
Alternate version of test, the Multisyllabic Lexical Neighborhood
Test (MLNT) consists of 50 words with two to three syllables.
– (Kirk, Pisoni, & Osberger, 1995)
QuickSIN
Provides a one-minute estimate of SNR loss for adults
A quick method to quantify patient’s ability to hear in noise
Can determine if extended high frequency emphasis improves or
degrades understanding of speech in noise
Assist in choosing appropriate amplification and/or other hearing
assistance technologies
Demonstrates directional microphones may improve speech
intelligibility in noise
Open set test recognition test
Consists of list of six sentences with five key words per sentence
presented in four-talker babble noise.
Sentences are presented at pre-recorded signal-to-noise ratios which
decrease in 5dB steps from 25 (very easy) to 0 (extremely difficult).
The SNR's used are 25, 20, 15, 10, 5, 0, encompassing normal to
severely impaired performance in noise.
BKB - SIN
Similar to Quick-SIN but can be utilize
with children
Sentence recognition using BKB sentence
material
Open set sentence recognition test
BKB SIN test
Test score sheets
HINT
The HINT is a prerecorded test that measures
sentence speech recognition abilities in quiet or in
noise accurately, reliably and efficiently
The HINT demonstrated the substantial role that
binaural, directional hearing plays in a normal hearing
individual's ability to communicate in noise
Any degree of hearing impairment, therefore, reduces
the benefits of directional hearing in noise and
increases communication inadequacy
Used to evaluate functional hearing capabilities of HI
users, CI users and those in listening-critical jobs
Copyrighted by the House Institute and is available
for purchase on a compact disc recording to hearing
clinics and research laboratories nationwide.
A children's version of the HINT is also available
DICHOTIC SENTENCE
IDENTIFICATION (DSI)
Developed in an attempt to conceive a dichotic listening task that
would be only minimally affected by peripheral hearing loss
Sentences selected from the Synthetic Sentence Identification (SSI)
test presented dichotically
Onsets and offsets of the sentences are aligned with an accuracy of
100 microseconds.
Developers claim DSI less susceptible to hearing loss than SSW
test
Viable test of central auditory function with hearing loss
Test applicable for auditory assessment of impaired ears through
pure tone averages (PTAs) up to 50 dB
2 sets of 30 pairs of sentences (closed set identification test)
Normative data are available
Ling-6
Phoneme level detection and recognition
test
m, s, sh, e, a, u
Procedure
Detection and identification of phonemes
Further Auditory Evaluations
Mark C. Flynn—Evaluation of Individuals
with hearing loss
Assignment
If you accept the challenge, assign each test
discussed in this section on the following
grid, thus,
Be able to correctly indicate on the
following template, each of the auditory
tests discussed earlier
Speech Paradigm
Detection
Non-Speech
Sounds
Phonemes
Syllables
Suprasegmentals
Words
Phrases
Sentences
Connected
Speech
Discrimination
Recognition
Comprehension