Transcript ASA Meeting
STATEMENT OF THE PROBLEM to evaluate the presence of morbid changes in the lungs or pleural cavity. These tests are simple, non invasive and widely used in the respiratory clinical practice (Laënnec, 1819). •The detection of voiced sounds (vocal tract resonances) in the pulmonary structures can indicate, for instance, consolidation of the lung parenchyma which could be caused by cancer or pneumonia (Pasterkamp et al. 1997). •The method normally used is auscultation, i.e., evaluation of the sound radiated at the chest surfaces. •However, little objective information is known regarding the specific acoustic properties of normal and pathologic conditions in the lower airways. Thus, can the vocal tract resonances truly be detected in the subglottal system during phonation? If so, what is their influence in the subglottal sound field? Can the vocal tract resonances appear in normal subjects too? •To evaluate the effect of cross-talk between air and skin transmission and the relative phases between the signals, cross-correlation with respect to the signal recorded at the mouth location was performed (see figure 3). •From figure 3: Signals that are in phase with the mouth show positive correlation coefficients, matching previous numerical studies (Zañartu et. al 2007). •High correlation in the chest is associated with corruption by airborne transmitted sounds. This is not desired. /a/ DO WE NEED NEW SENSORS? /e/ /i/ /o/ 1 Figure 5 shows different cases of spectral analysis (PSD) of stationary portions of signals representing the supraglottal (mouth, in blue) and subglottal (notch, in red) sound fields. Steady portions of two distinct vowels (/i/ and /a/) were used. 0.8 0.6 0.4 Figures (a) & (d): 0.2 •They show the PSDs of the 0 -0.2 EGG Mouth Chest Chest Accel Throat Notch complete signals. The gray circles indicate the first two resonances on each case. Back BCUP -0.4 •It can be observed that the notch Figure 3. Correlation coefficients with respect to the mouth microphone signal /u/ signal remains almost the same, although the vocal tract resonances have significantly changed. Mouth A different approach to evaluate corruption with airborne sounds was performed by comparing spectrograms from each sensor with that of the mouth microphone. •Little efforts have been done to evaluate the subglottal sound field during phonation in human subjects due to the difficulties to obtain a non invasive access to such sound field. •Studies on respiratory sounds have shown that good estimates of lung sounds can be registered through skin radiation measurements using air-coupled microphones and special accelerometers (Kraman et al. 1995; Wodicka et al., 1994). However, these devices have not been used or tested to measure voiced sounds in the skin surfaces. (a) PSD of vowel /i/ (d) PSD of vowel /a/ •These perturbations can be better •The vocal tract formants can be clearly observed in appreciated in the figures below: the mouth microphone, but not so much in the other sensors. sound field in the subglottal system during normal phonation. The challenges are: Rest of the figures (below): •The skin sensors have a limited frequency response •Can we get sufficiently “good” voiced signals using skin measurements? What is a “good” signal here? •What are the limitations of such approach? What other sensors or devices may be used for this purpose? •They show the PSDs of the (<2kHz). They show the same response to lung sounds (Kraman et al. 1995) Chest - Accelerometer •The structure of the subglottal signals remains almost HYPOTHESES (b) PSD of vowel /i/ - Open phase only (e) PSD of vowel /a/ - Open phase only •The notch sensor detects the first subglottal using the EGG signal. •During the open phase, there is resonance at 600 Hz. This feature does not change with the vowels. Chest – Air coupled mic with BAI clear acoustic coupling: At the frequencies where there is a vocal tract resonance, the supraglottal spectra show a dip (zero). •The chest sensors are not as sensitive to subglottal sounds. They show features from the supraglottal resonances (vocal tract) that were not present in the notch. This suggests that they are corrupted by airborne transmitted sounds. •The air-coupled + BAI sensor was better isolated to Figure 4. Spectrogram from sensors at different locations for different vowels. airborne transmitted sounds, but not completely. segmented signals, i.e., the open portion (b and e) and closed portion (c and f) of the vocal folds cycle. •This segmentation was performed constant from vowel to vowel •Based on acoustic theory it is hypothesized that the vocal tract formants are not easily •However, at the vocal tract resonant frequencies, the notch spectra shows some perturbations. From figure 4: Notch •The present project primarily investigates the use sound recordings through skin surface radiation to identify the detectable in the subglottal airways. They should only appear as perturbation (poles & zeros close together) in the subglottal transfer function (frequency response). •Based on lung sounds studies, it is expected that air-coupled microphones attached to the skin are able to capture subglottal features using speech excitation. Their frequency response may not be ideal, but sufficient for the purpose. 1.2 Correlation coeff R •The pectoriloquy and egophony are clinical exams where voiced sounds (with vibration of the vocal folds) are used RESULTS: Presence of vocal tract resonances in the subglottal system RESULTS: Sensor performance (c) PSD of vowel /i/ - Closed phase only (f) PSD of vowel /a/ - Closed phase only Figure 5. PSDs of sounds at the mouth (blue) and notch (red) for a vowel /i/ (a,b,c) and a vowel /a/ (d,e,f). •During the closed phase there is no coupling. This is expected since the glottis is closed. CONCLUSIONS METHODS •The methods used followed the approved IRB proposal for human subject use. Four sets of measurements in four The results presented are preliminary since they constitute a series of observations made only for each individual subject. No group statistics have been developed yet. A larger pool of normal and pathologic cases are also needed. different subjects (healthy adult subjects with ages 18-65) were considered. However, the results presented in this paper correspond to analyses of each individual subject. No group comparison have been made so far. Skin radiated sounds can be used to qualitative describe the behavior of the subglottal tract during phonation in a simple and non-invasive fashion. More works needs to be done to estimate qualitative expressions of the variables under evaluation. •All the recordings took place inside a soundproof chamber (IAC) at the Biomedical Acoustics Laboratory in the new state-of-the-art biomedical building on campus. The best estimates of the subglottal sound field were observed at the sternal notch using the air-coupled microphones. Such measurements closely match previous results from experimental and numerical studies of phonation. results of maneuvers involving normal phonation are presented. Each subject repeated three times five sustained vowels (/a/, /e/, /i/, /o/, /u/) that are produced for ~2 seconds each, with a ~1 second pause between them. The methods used allowed estimating the effect of subglottal and supraglottal coupling and the importance of the sensor properties: •The subjects were asked to perform different maneuvers related to phonation and respiration. In this paper, only the •Several sensors were simultaneously recorded: •Four electret condenser microphones (Sony ECM-77B) were coupled with the skin surfaces using air chamber •The preliminary results suggest that there is poor acoustic coupling between supra and sub glottal systems. Vocal tract formants are almost undetectable in the subglottal airways using skin radiation, appearing mainly as zeros in the subglottal frequency response. •The most common sensors used for respiratory exploration at the skin are very susceptible to capture airborne transmitted sounds, invalidating the traditional examinations that use speech. A proposed method to insulate the sensors (the BAI) from this type of contamination improved their performance but needs further development. couplers (Kraman et al. 1995) and adhered to the skin surfaces using double-sided tape disks (2181, 3M). •One lightweight accelerometer (EMT 25C, Siemens) was also adhered to the skin surfaces using double-sided tape disks (2181, 3M). •An electroglottograph (EG2-PCX, Glottal Enterprises) was used around the neck to measure glottal impedance, allowing estimation of the closed and open phases of vocal fold vibration. •An omnidirectional condenser microphone (Beringher, ECM 8000) was used to capture sound radiated at the mouth. •Electronic processing equipment, included an audio preamplifier and mixer (1202, Mackie), an analog filter (3384 & Figure 1. Location of the sensors: air-coupled mics (red), accelerometer (blue), aircoupled mic + BAI (yellow), condenser mic (green), EGG (pink) 3343, Krohn-Hite Corp.), and a desktop computer provided with and National Instruments DAQ (BNC-2090 and NI PC1-M10-16E-1) along with a Labview 8.2 environment. •All signals were first low-pass filtered (Butterworth with 8 poles and a cutoff frequency of 5000Hz) and then digitally recorded with a sampling frequency of 22050 Hz. •The location of the skin radiation sensors were selected at the sternal notch, chest, throat, and back (see figure 1). •To test the effect of air-conduction versus skin conduction, such sensors were tested in different conditions: coupled (directly adhered to the skin), uncoupled (attached to the skin through a small piece of foam that minimizes skin conduction), and coupled but covered with a BAI (bioacoustic insulator). •Spectral estimation was performed via the power spectral densities (PSD) using the Welch method with 8192 FFT points and spectrograms having a Hanning window with a peak-null BW of 150Hz, 8192 FFT points and 20 points of overlap. (a) (b) (c) Figure 2. Type of sensors used: (a) accelerometers and air-coupled mics, (b) EGG,(c) air-coupled mic + BAI REFERENCES •Austin, S. F. and Titze, I. R. (1997). "The effect of subglottal resonance upon vocal fold vibration," J. Voice 11, pp 391–402. •Harper, P., Pasterkamp, H., Kiyokawa, H., and Wodicka, G.R., (2003) “Modeling and measurement of flow effects on tracheal sounds,” IEEE Transactions on Biomedical Engineering, 50(1): 1-11. •Kraman, S. S., Wodicka, G. R., Oh, Y., and Pasterkamp, H. (1995): “Measurement of respiratory acoustic signals: Effect of microphone air cavity width, shape and venting”, Chest,108, pp. 1004–1008. •Laënnec, R. T. H. (1819) “De l'auscultation médiate ou traité du diagnostic de maladies des poumons et du coeur, fondé principalement sur ce nouveau moyen d'exploration“, Brosson et Chaudé, Paris. •Lulich, S. M., (2006) “The Role of Lower Airway Resonances in Defining Vowel Feature Contrasts”, PhD dissertation in Speech and Hearing Bioscience and Technology, MIT. •Pasterkamp, H., Kraman, S.S. and Wodicka, G.R., (1997) “Respiratory Sounds: Advances Beyond the Stethoscope“, Am J Respir Crit Care Med, Vol. 156. pp. 974–987. •Stevens, K. (2000). Acoustic Phonetics (MIT Press, Massachusetts), 1st ed. •Titze, I. R. and Story, B. H., (1997) “Acoustic interactions of the voice source with the lower vocal tract,” J. Acoust. Soc. Am, vol. 101(a), pp. 2234-2243. •Wodicka, G. R., Kraman, S. S., Zenk, G. M., and Pasterkamp, H. (1994): “Measurement of respiratory acoustic signals. Effect of microphone air cavity depth”Chest,106, pp. 1140–1144 •Zañartu, M., Mongeau, L. and Wodicka, G. R. (2007), "Influence of acoustic loading on an effective single mass model of the vocal folds", J. Acoust. Soc. Am., 121(2), pp. 1119-1129. •Zhang, Z., Neubauer, J. , and Berry, D. A. (2006), “The influence of subglottal acoustics on laboratory models of phonation”, J. Acoust. Soc. Am. 120(3), pp. 1558-1569. ACKNOWLEDGMENTS This project is a joint research effort between Professors George R. Wodicka (Purdue University), Hans Pasterkamp (University of Manitoba), Steve Kraman (University of Kentuky) and Jessica Huber (Purdue University). Special thanks to Julio Ho for the help provided during the data collection.