spacer
spacer
header
 
Home arrow Publications & Disserations
 
[2013] [2012] [2011][2010] [2009] [2008] [2007] [2006] [2005] [2004] [2003] [2002] [2001] [2000]
[1999] [1998] [1997] [1996] [1995] [1994] [before 1994]

 [Ph.D. Dissertations] [M.S. Theses] [Published Abstracts]

COPYRIGHT NOTICE
Copyright and all rights therein for the documents available in this webpage are maintained by the authors or by other copyright holders. The documents made available here are purely meant for ensuring timely dissemination of scholarly and technical work on a non-commercial basis. It is understood that all persons accessing, storing or copying the information in any of these documents will adhere to the terms and constraints invoked by each copyright holder. These works may not be reposted without the explicit permission of the copyright holder.


2013 ↑Top

Kantapon Kaewtip, Lee Ngee Tan, Abeer Alwan, Charles E.Taylor, "A robust automatic bird phrase classifier using dynamic time-warping with prominent region identification", ICASSP 2013, accepted.

Harish Arsikere, Steven M. Lulich and Abeer Alwan, "Non-linear frequency warping for VTLN using subglottal resonances and the third formant frequency," ICASSP 2013, accepted.

L. N. Tan, and Abeer Alwan, "Multi-Band Summary Correlogram-based Pitch Detection for Noisy Speech", Speech Communication, in press. [Matlab code of MBSC pitch detector]

L. N. Tan, G. Kossan, M. L. Cody, C. E. Taylor, A. Alwan, "A Sparse Representation-based Classifier for In-set Bird Phrase Verification and Classification with Limited Training Data," ICASSP 2013, accepted.

Gang Chen, Jody Kreiman, Bruce Gerratt, Juergen Neubauer, Yen-Liang Shue, and Abeer Alwan, "Development of a glottal area index that integrates glottal gap size and open quotient," Journal of the Acoustical Society of America, Vol. 133, Issue 3, March 2013, pp. 1656–1666. [pdf]

Harish Arsikere, Gary K.F. Leung, Steven M. Lulich, and Abeer Alwan,"Automatic estimation of the first three subglottal resonances from adults’ speech signals with application to speaker height estimation ," Speech Communication, Vol. 55, pp. 51-70, 2013. [pdf]

2012 ↑Top

Steven M. Lulich, John R. Morton, Harish Arsikere, Mitchell Sommers, Gary K. F. Leung, and Abeer Alwan, "Subglottal resonances of adult male and female native speakers of American English," Journal of the Acoustical Society of America, Volume 132, Issue 4, pp. 2592-2602 (2012).

Jody Kreiman, Yen-Liang Shue, Gang Chen, Markus Iseli, Bruce R. Gerratt, Juergen Neubauer, and Abeer Alwan, "Relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation," Journal of the Acoustical Society of America, Volume 132, Issue 4, pp. 2625-2632 (2012).

Jody Kreiman, Marc Garellek, Gang Chen, Abeer Alwan, and Bruce R.Gerratt, "Perceptual evaluation of source models," International Conference on Voice Physiology and Biomechanics, 2012

Harish Arsikere, Gary K.F. Leung, Steven M. Lulich and Abeer Alwan, "Automatic estimation of the first two subglottal resonances in children's speech with application to speaker normalization in limited-data conditions," Interspeech 2012.

Gang Chen, Yen-Liang Shue, Jody Kreiman, and Abeer Alwan, "Estimating the voice source in noise", Interspeech 2012.

L. N. Tan, K. Kaewtip, M. L. Cody, C. E. Taylor, and A. Alwan, "Evaluation of a Sparse Representation-Based Classifier For Bird Phrase Classification Under Limited Data Conditions", Interspeech 2012.

Wei Chu and Abeer Alwan, "FBEM: A Filter Bank EM Algorithm for the Joint Optimization Of Features and Acoustic Model Parameters In Bird Call Classification", ICASSP 2012, pp. 1993-1996.

Gang Chen, Jody Kreiman, and Abeer Alwan, "The Glottaltopograph: A Method of Analyzing High-Speed Images of the Vocal Folds", ICASSP 2012, pp.3985-3988. [toolkit]

Harish Arsikere, Gary K.F. Leung, Steven M. Lulich and Abeer Alwan, "Automatic height estimation using the second subglottal resonance", ICASSP 2012, pp. 3989-3992.

Julien van Hout and Abeer Alwan, "A Novel Approach to Soft-Mask Estimation and Log-Spectral Enhancement For Robust Speech Recognition", ICASSP 2012, pp. 4105-4108.

W. Chu and Abeer Alwan, "SAFE: A Statistical Approach to F0 Estimation under Clean and Noisy Conditions," IEEE Trans. on Audio, Speech, and Language Processing, Volume 20, No. 3, pp. 933 - 944, March 2012.


2011 ↑Top

S. Lulich, A. Alwan, H. Arsikere, J. Morton, and, M. Sommers, "Resonances and wave propagation velocity in the subglottal airways", Journal of the Acoustical Society of America, Volume 130, Issue 4, pp. 2108-2115, 2011.

B. J. Borgstrom and A. Alwan, "A Unified Framework for Designing Optimal STSA Estimators Assuming Additive Superposition of Speech and Noise", IEEE Trans. on Audio, Speech, and Language Processing,  Vol. 19, No. 8, pp. 2579 - 2590 , Nov. 2011.

T. Drugman and A. Alwan, "Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics," Interspeech 2011, pp 1973-1976

G. Chen, J. Kreiman, Yen-Liang Shue, and A. Alwan, "Acoustic Correlates of Glottal Gaps," Interspeech 2011, pp 2673-2676

S. Lulich, H. Arsikere, J. Morton, G. Leung, A. Alwan, and M. Sommers, "Analysis and automatic estimation of children's subglottal resonances," Interspeech 2011, pp 2817-2820

Harish Arsikere, Steven Lulich, and Abeer Alwan, "Automatic Estimation of the First Subglottal Resonance," Journal of the Acoustical Society of America (Express Letters), Vol. 129, Issue 5, pp. 197-203, May 2011.

W. Chu, D.T. Blumstein, “Noise robust bird song detection using syllable pattern-based hidden Markov models,” ICASSP 2011, pp. 345-348. 

Lee Ngee Tan and Abeer Alwan, "Noise-Robust F0 Estimation Using SNR-Weighted Summary Correlograms From Multi-Band Comb Filters," ICASSP 2011, pp. 4464-4467.

Harish Arsikere, Steven Lulich, and Abeer Alwan, "Automatic Estimation of the Second Subglottal Resonance from Natural Speech," ICASSP 2011, 4616 - 4619.

Bengt Borgstrom and Abeer Alwan, "Log-Spectral Amplitude Estimation With Generalized Gamma Distributions For Speech Enhancement," ICASSP 2011, pp. 4756-4759.

A. Alwan, J. Jiang and W. Chen, "Perception of place of articulation for plosives and fricatives in noise," Speech Communication, Vol. 53, Issue 2, pp. 195-209, Feb. 2011.

S. Panchapagesan and A. Alwan, "A study of acoustic-to-articulatory
inversion of speech by analysis-by-synthesis using chain matrices and the
Maeda articulatory model
," J. Acoust. Soc. Am. Volume 129, Issue 4, pp. 2144-2162, 2011.

Joseph Tepperman, Sungbok Lee, Shrikanth (Shri) Narayanan, and Abeer Alwan, "A Generative Student Model for Scoring Word Reading Skills," IEEE Transactions On Audio, Speech, And Language Processing, Vol. 19, No. 2, February 2011.

2010 ↑Top

B. J. Borgstrom and A. Alwan, "A Statistical Approach to Mel-Domain Mask Estimation for Missing-Feature ASR", IEEE Signal Processing Letters, Vol. 17, No. 11, pp. 941-944, Nov. 2010.

Y.-L. Shue, G. Chen, and A. Alwan, "On the Interdependencies between Voice Quality, Glottal Gaps, and Voice-Source related Acoustic Measures," Interspeech 2010, pp. 34-37.

G. Chen, X. Feng, Y.-L. Shue, and A. Alwan, "On Using Voice Source Measures in Automatic Gender Classification of Children's Speech," Interspeech 2010, pp. 673-676.

B. J. Borgstrom, P. H. Borgstrom, and A. Alwan, "Efficient HMM-Based Estimation of Missing Features, with Applications to Packet Loss Concealment," Interspeech 2010, pp. 2394-2397.

W. Chu and A. Alwan, "SAFE: a statistical algorithm for F0 estimation for both clean and noisy speech," Interspeech 2010, pp. 2590-2593. [slides]

Y.-L. Shue and A. Alwan, "A new voice source model based on high-speed imaging and its application to voice source estimation," ICASSP 2010, pp. 5134-5137.

L. N. Tan, B. J. Borgstrom and A. Alwan, "Voice Activity Detection using Harmonic Frequency Components in Likelihood Ratio Test," ICASSP 2010, pp. 4466-4469. [Matlab code] [Speech/Non-speech label files for Aurora 2]

B. J. Borgstrom and A. Alwan, "HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, No. 5, July 2010.

B. J. Borgstrom and A. Alwan, "Improved Speech Presence Probabilities Using HMM-Based Inference, with Applications to Speech Enhancement and ASR," Journal of Selected Topics in Signal Processing, Vol. 4, No. 5, pp. 808-815.

Y. Shue, S. Shattuck-Hufnagel, M. Iseli, S. Jun, N. Veilleux, and A. Alwan, "On the acoustic correlates of high and low nuclear pitch accents in American English,'' Speech Communication, 2010, Vol 52, No. 2, pp. 106-122.

2009 ↑Top

S. Wang, S. Lulich, and A. Alwan, "Automatic detection of the second subglottal resonance and its application to speaker normalization," J. Acoust. Soc. Am, 2009. Volume 126, Issue 6, pp. 3268-3277.

R. Scarborough, P. Keating, S. Mattys, T. Cho, and A. Alwan, "Optical Phonetics and Visual Perception of Lexical and Phrasal Stress in English,'' Language and Speech, 2009, Vol. 52, No. 2-3, 135-175.

P. Price, J. Tepperman, M. Iseli, T. Duong, M. Black, S. Wang, C. K. Boscardin, M. Heritage, P. D. Pearson, S. Narayanan, and A. Alwan, "Assessment of emerging reading skills in young native speakers and language learners," Speech Communication, Volume 51, Issue 10, October 2009, pp. 968-984.

S. Wang, P. Price, Y.-H. Lee and A. Alwan, "Measuring children's phonemic awareness through blending tasks," SLaTE workshop 2009.

H. You and A. Alwan, "Temporal Modulation Processing of Speech Signals for Noise Robust ASR," Interspeech 2009, pp. 36-39.

Y.-L. Shue, J. Kreiman, and A. Alwan, "A Novel Codebook Search Technique for Estimating the Open Quotient," Interspeech 2009, pp. 2895-2898.

S. Wang, Y.-H. Lee and  A. Alwan, "Bark-shift based nonlinear speaker normalization using the second subglottal resonance," Interspeech 2009, pp. 1619-1622.

W. Chu and A. Alwan, "A Correlation-Maximization Denoising Filter Used as an Enhancement Frontend for Noise Robust Bird Call Classification," InterSpeech 2009, pp. 2831-2834. [slides]

V. Mitra, B. Borgstrom, C. Espy-Wilson, and A. Alwan, "A Noise-type and level-dependent MPO-based speech enhancement architecture," InterSpeech 2009, pp. 2751-2754.

B. J. Borgstrom and A. Alwan, "Missing Feature Imputation of Log-Spectral Data For Noise Robust ASR ," to appear, Workshop on DSP in Mobile and Vehicular Systems, 2009.

B. J. Borgstrom and A. Alwan, "Utilizing Compressibility in Reconstructing Spectrographic Data, with
Applications to Noise Robust ASR
," IEEE Signal Processing Letters, Vol. 16, Issue 5, pp. 398-401, 2009.

W. Chu and A. Alwan, "Reducing F0 Frame Error of F0 Tracking Algorithms Under Noisy Conditions with an Unvoiced/Voiced Classification Frontend," ICASSP 2009, pp.3969-3972. [slides]

S. Panchapagesan and A. Alwan, "Frequency Warping for VTLN and Speaker Adaptation by Linear Transformation of Standard MFCC," Computer Speech and Language, Vol. 23, Issue 1, pp. 42-64, Jan. 2009.

2008 ↑Top

A. Alwan, " Dealing with Limited and Noisy Data in ASR: A Hybrid Knowledge-Based and Statistical Approach," Keynote Speech at Interspeech 2008, pp. 11-15.

S. Panchapagesan and A. Alwan, " Vocal Tract Inversion by Cepstral Analysis-by-Synthesis using Chain Matrices ," Interspeech 2008, pp. 2857-2860.

S. Wang, S.M. Lulich, and A. Alwan, " A reliable technique for detecting the second subglottal resonance and its use in cross-language speaker adaptation ," Interspeech 2008, pp. 1717-1720.

Y. Shue, S. Shattuck-Hufnagel, M. Iseli, S. Jun, N. Veilleux, and A. Alwan, " Effects of Intonational Phrase Boundaries on Pitch-Accented Syllables in American English ," Interspeech 2008, pp. 873-876. The Best Student Paper Award.

B. J. Borgstrom and A. Alwan, " HMM-Based Estimation of Unreliable Spectral Components for Noise Robust Speech Recognition ," Interspeech 2008, pp. 1769-1772.

B. J. Borgstrom, A. Bernard, and A. Alwan, " Error Recovery - Channel Coding and Packetization," Chapter 8 in Automatic Speech Recognition on Mobile Devices and over Communication Networks, Springer-Verlag. Editors: Z.-H. Tan and B. Lindberg, pp. 163-185, 2008.

S. Wang, A. Alwan, and S. Lulich, " Speaker Normalization Based on Subglottal Resonances," ICASSP 2008, pp. 4277-4280.

B. J. Borgstrom and A. Alwan, " An Efficient Approximation of the Forward-Backward Algorithm to Deal With Packet Loss, With Applications to Remote Speech Recognition ," ICASSP 2008, pp. 4425-4428.

B. J. Borgstrom and A. Alwan, " A Low Complexity Parabolic Lip Contour Model With Speaker Normalization For High-Level Feature Extraction in Noise Robust Audio-Visual Speech Recognition", IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, Vol. 38, No. 6, pp. 1273-1280, 2008.

2007 ↑Top

J. Tepperman, M. Black, S. Lee, A. Kazemzadeh, M. Gerosa, M. Heritage, A. Alwan, and S. Narayanan, "A Bayesian Network Classifier for Word-level Reading Assessment,'' InterSpeech 2007, pp. 2185-2188.

R. Scarborough, P. Keating, M. Baroni, T. Cho, S. Mattys, A. Alwan, E. Auer, L.E. Bernstein, "Optical Cues to the Visual Perception of Lexical and Phrasal Stress in English," UCLA Working Papers in Phonetics, no. 105, p.118-124.

S. Wang, P. Price, M. Heritage and A. Alwan, "Automatic Evaluation of Children's Performance on an English Syllable Blending Task", SLaTE workshop 2007.

S. Wang, X. Cui, and A. Alwan, "Speaker Adaptation with Limited Data using Regression-Tree based Spectral Peak Alignment", IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, No. 8, pp. 2454-2464, Nov. 2007.

A. Alwan, Y. Bai, M. Black, L. Casey,M. Gerosa, M. Heritage, M. Iseli, B. Jones, A. Kazemzadeh, S. Lee, S. Narayanan, P. Price, J. Tepperman, and S. Wang, "A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources", IEEE Multimedia Signal Processing Workshop, Oct. 2007, pp. 26-30.

Y. Shue, M. Iseli, N. Veilleux, and A. Alwan "Pitch Accent versus Lexical Stress: Quantifying Acoustic Measures Related to the Voice Source", Proceedings of Interspeech 2007, pp. 2625-2628, Belgium.

B. J. Borgstrom and A. Alwan "A Packetization and Variable Bitrate Interframe Compression Scheme For Vector Quantizer-Based Distributed Speech Recognition,"Proceedings of Interspeech 2007, pp. 578-581, Belgium.

J. Jiang, A. Alwan, E. Auer, P. Keating, and L. Bernstein , "Similarity structure in visual speech perception and optical phonetic signals", Perception and Psychophysics, Vol. 69, No. 7, pp. 1070-1083, October 2007.

X. Cui and A. Alwan, "Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion,'' IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 2, pp. 652-660, Feb. 2007.

B. J. Borgstrom, M. van der Schaar, A. Alwan, "Rate Allocation for Non-Collaborative Multi-User Speech Communication Systems Based On Bargaining Theory", IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 4, pp. 1156-1166, May 2007.

M. Iseli, Y.-L. Shue, A. Alwan, "Age, sex, and vowel dependencies of acoustical measures related to the voice source", Journal of the Acoustic Society of America, Vol. 121, Issue 4, pp. 2283-2295, April 2007.

H. You, A. Alwan, "A Statistical Acoustic Confusability Metric for Hidden Markov Models,'' IEEE ICASSP Proceedings, vol. 4, pp. 745-748, 2007.

2006 ↑Top

R. Scarborough, P. Keating, M. Baroni, T. Cho, S. Mattys, A. Alwan, E. Auer, L.E. Bernstein, "Optical Cues to the Visual Perception of Lexical and Phrasal Stress in English," Proceedings of the 3rd International Conference on Speech Prosody, May 2006.

B. J. Borgstrom, M. van der Schaar, A. Alwan, "Bargaining-Based Rate Allocation for Non-Collaborative Multi-User Speech Communication Systems", SiMPE workshop, 2006.

M. Iseli, Y.-L. Shue, M. Epstein, P. Keating, A. Alwan, "Voice Source Correlates of Prosodic Features in American English: a Pilot Study", Proceedings of ICSLP 2006, pp. 2226-2229.

Shizhen Wang, Xiaodong Cui and Abeer Alwan, "Rapid Speaker Adaptation Using Regression-Tree Based Spectral Peak Alignment", Proceedings of ICSLP 2006, 1479-1482.

S. Panchapagesan, "Frequency Warping by Linear Transformation of Standard MFCC", Proceedings of ICSLP 2006, pp. 397-400.

J. Tepperman, J. Silva, A. Kazemzadeh, H. You, S. Lee, A. Alwan, and S. Narayanan, "Pronunciation Verification of Children's Speech for Automatic Literacy Assessment," Proceedings of ICSLP 2006.

A. Kazemzadeh, J. Tepperman, J. Silva, H. You, S. Lee, A. Alwan, and S. Narayanan, "Automatic Detection of Voice Onset Time Contrasts for Use in Pronunciation Assessment," Proceedings of ICSLP 2006.

J. Xue, B. J. Borgstrom, J. Jiang, L. Bernstein, A. Alwan, "Acoustically-driven Talking Face Synthesis Using Dynamic Bayesian Networks", Proceedings of IEEE ICME 2006, pp. 1165-1168.

S. Panchapagesan and A. Alwan, "Multi-parameter Frequency Warping for VTLN by Gradient Search", IEEE ICASSP Proceedings, I-1181, May 2006

M. Iseli, Y, Shue, and A. Alwan, "Age- and Gender-Dependent Analysis of Voice Source Characteristics", IEEE ICASSP Proceedings, I-389, May 2006

Li Deng, X. Cui, R. Pruvenok, J. Huang, S. Momen, Y. Chen, and A. Alwan, "A Database of Vocal Tract Resonance Trajectories for Reasearch in Speech Processing",  IEEE ICASSP Proceedings, I-369, May 2006

X. Hu, M. Bergsneider, E. Rubinstein and A. Alwan, "Reduction of Compartment Compliance Increases Venous Flow Pulsatility and Lowers Apparent Vascular Compliance: Implications for Cerebral Blood Flow Hemodynamics,"Medical Engineering and Physics, Vol. 28, Issue 4, pp.
304-314, May 2006.  

X. Cui and A. Alwan, "Adaptation of Children's Speech with Limited Data Based on Formant-like Peak Alignment,"Computer Speech and Language, Vol. 20, Issue 4, pp. 400-419, October 2006.

Jintao Jiang, Marcia Chen, Abeer Alwan, "On the perception of voicing in syllable-initial plosives in noise",Journal of the Acoustical Society of America, Volume 119, Issue 2, pp. 1092-1105, February 2006.

2005 ↑Top

J. Xue, J. Jiang, A. Alwan and L. Bernstein, "Consonant confusion structure based on machine classification of visual features in continuous speech,"Audio-Visual Speech Processing Workshop 2005, Vancouver Island, Canada, pg. 103-108.

H. You, A. Alwan, A. Kazemzadeh and S. Narayanan, "Pronunciation Variation of Spanish-accented English Spoken by Young Children,"Eurospeech 2005, pg. 749-752.

A. Kazemzadeh, H. You, M. Iseli, B. Jones, X. Cui, M. Heritage, P. Price, E. Anderson, S. Narayanan and A. Alwan, "TBALL Data Collection: the Making of a Young Children's Speech Corpus,"Eurospeech 2005, pg. 1581-1584.

X. Cui and A. Alwan, "MLLR-Like Speaker Adaptation Based on Linearization of VTLN with MFCC features,"Eurospeech 2005, pg. 273-276.

X. Cui and A. Alwan, "Noise Robust Speech Recognition Using Feature Compensation Based on Polynomial Regression of Utterance SNR,"IEEE Transactions on Speech and Audio Processing, Vol. 13, Number 6, pp. 1161-1172, November 2005.
 

2004 ↑Top

S. Narayanan and A. Alwan, "Text to Speech Synthesis:New Paradigms and Advances,"Pearson Education, Prentice Hall, August 2004.

H. You, Q. Zhu, and A. Alwan, "Entropy-base Variable Frame Rate Analysis of Speech Signals and Its Application to ASR,"in Proc. ICASSP, Pp.549-552, Montreal, Canada, May. 2004.

M. Iseli and A. Alwan, "An Improved Correction Formula for The Estimation of Harmonic Magnitudes and Its Application to Open Quotient Estimation,"in Proc. ICASSP, Pp.669-672, Montreal, Canada, May. 2004.

X. Cui and A. Alwan, "Combining Feature Compensatoin and Weighted Viterbi Decoding for Noise Robust Speech Recognition With Limited Adaptation Data,"in Proc. ICASSP, Pp. 969-972, Montreal, Canada, May. 2004.

2003 ↑Top

P. Keating, M. Baroni, S. Mattys, R. Scarborough, A. Alwan, E. Auer, and L. Bernstein, "Optical Phonetics and Visual Perception of Lexical and Phrasal Stress in English,"Proc. 15th International Congress of Phonetic Sciences: 2071-2074, 2003

M. Hasegawa-Johnson, S. Pizza, A. Alwan, J.S. Cha and K. Haker, "Vowel Category Dependence of the Relationship Between Palate Height, Tongue Height, and Oral Area,"Journal of Speech, Language, and Hearing Research, Vol. 46, Issue 3, June 2003. pp. 738-739.

M.O. Rosa, J.C. Pereira, M. Grellet and A. Alwan, "A Contribution to Simulating a Three-dimensional Larynx Model Using the Finite Element Method,"in JASA, Vol 114, Issue 5, Nov. 2003, pp. 2893-2905.

Q. Zhu and A. Alwan, "Non-linear feature extraction for robust recognition in stationary and non-stationary noise,"Computer, Speech, and Language, 17(4): 381-402, Oct. 2003.

X. Cui, Al. Bernard, and A. Alwan, "A Noise-Robust ASR Back-end Technique Based on Weighted Viterbi Recognition,"in Proc. EUROSPEECH, Switzerland, pp. 2169-2172, Sept. 2003.

Z. AlBawab, I. Locher, J. Xue, and A. Alwan, "Speech Recognition over Bluetooth Wireless Channels,"In Proc. EUROSPEECH, Switzerland, pp. 1233-1236, Sept. 2003.

W. Chen and A. Alwan, "Perpception of the Place of Articulation Feature for Plosives and Fricatives in Noise,"in Proc. ICPhS, Barcelona, August, 2003.

James J. Hant and Abeer Alwan, "A Psychoacoustic-Masking Model to Predict the Perception of Speech-Like Stimuli in Noise,"Speech Communication, Vol. 40, May 2003, pp. 291-313.

H.F. Chi, S.X. Gao, S.D. Soli, and A. Alwan, "Band-limited Feedback Cancellation with a Modified Filtered-X LMS Algorithm for Hearing Aids,"special issue of Speech Communication on Signal Processing for Hearing Aids, Vol. 39, Issues 1-2, Jan. 2003, pp. 147-161.

2002 ↑Top

A. Bernard and A. Alwan, "Low-bitrate Distributed Speech Recognition for Packet-based and Wireless Communication", IEEE Transactions on Speech and Audio Processing, Vol. 10, Number 8, pp. 570-580, Nov. 2002.

M. Hasegawa-Johnson and A. Alwan, "Speech Coding: Fundamentals and Applications,"a chapter in the Wiley Encyclopedia of Telecommunications, Wiley, Editor: Prof. John Proakis, December 2002, Vol. 5, pp. 2340-2359.

J. Jiang, A. Alwan, P.A. Keating, E.T. Auer, and L.E. Bernstein, "On the relationship between face movements, tongue movements, and speech acoustics,"special issue of EURASIP Journal on Applied Signal Provessing on joint audio-visual speech processing, Nov. 2002, pp.1174-1188.

X. Hu, A. Alwan, V.I. Nenov, E.H. Rubinstein, and M. Bergsneider, "Estimating brain compliance based on a novel model of intracranial cerebrospinal fluid dynamics,"EMBS-BMES Proceedings, Houston, Texas, Oct. 2002.

Qifeng Zhu and Abeer Alwan, "The Effect of Additive Noise on Speech Amplitude Spectra: a Quantitative Approach,"the IEEE Signal Processing Letters, Vol. 9, Issue 9, Sept. 2002, pp. 275-277

Brian Gabelman and Abeer Alwan, "Analysis and Synthesis of Amplitude Modulation Components in Pathological Voices,"Proc. IEEE 2002 Workshop on Speech Synthesis, Santa Monica.

A. Bernard and A. Alwan, "CHANNEL NOISE ROBUSTNESS FOR LOW-BITRATE REMOTE SPEECH RECOGNITION,"ICSLP Proceedings, Denver, Colorado, Sep. 2002, Vol.3, pp.2213-2216.

X. Cui, M. Iseli, Q. Zhu, and A. Alwan, "EVALUATION OF NOISE ROBUST FEATURES ON THE AURORA DATABASES,"ICSLP Proceedings, Denver, Colorado, Sep. 2002, Vol.1, pp.481-484.

A. Bernard, X. Liu, R. Wesel and A. Alwan, "Speech transmission using rate-compatible trellis codes and embedded source coding,"IEEE Transactions on Communications, vol.50, (no.2), IEEE, Feb. 2002. pp. 309-320.

J. Jiang, A. Alwan, L.E. Bernstein, E.T. Auer, and P.A. Keating, "PREDICTING FACE MOVEMENTS FROM SPEECH ACOUSTICS USING SPECTRAL DYNAMICS,"Proc. ICME 2002, Lausanne, Switzerland, pp. 181-184.

J. Jiang, A. Alwan, L. Bernstein, E. Auer, and P. Keating, "Similarity structure in perceptual and physical measures for visual consonants across talkers,"Proc. IEEE ICASSP, 2002, Orlando, pp. 441-444.

X. Cui and A. Alwan, "Efficient Adaptation Text Design Based On The Kullback-Leibler Measure,"Proc. IEEE ICASSP, 2002, Orlando, pp. 613-616.

B. Gableman and A. Alwan, "ANALYSIS BY SYNTHESIS OF FM MODULATION AND ASPIRATION NOISE COMPONENTS IN PATHOLOGICAL VOICES,"Proc. IEEE ICASSP, 2002, Orlando, pp. 449-452.

2001 ↑Top

A. Alwan, Q. Zhu, and J. Lo, "Human and Machine Recognition of Speech Sounds and Noise,"Invited paper, Proc. of the World Mulitconference on Systems, Cybernetics,and Information, Vol XIII, pp 218-223, Florida, Aug. 2001.

Brian Strope and Abeer Alwan, "Modeling the Perception of Pitch-Rate Amplitude Modulation in Noise", in "Computational Models of Auditory Function", a book edited by Steve Greenberg and Malcolm Slaney, pp. 315-327, IOS Press, NATO Science Series, Netherlands, 2001.

A. Bernard and A. Alwan, "Joint channel decoding - Viterbi recognition for wireless applications,"Proc. EUROSPEECH 2001, Aalborg, Denmark, Vol. 4, pp. 2703-2706.

Q. Zhu, X. Cui, M. Iseli and A. Alwan, "Noise Robust Feature Extraction for ASR using the Aurora 2 Database,"Proc. EUROSPEECH 2001, Aalborg, Denmark, Vol. 1, pp. 185-188.

M. Chen and A. Alwan, "On the Perception of Voicing for Plosives in Noise,"Proc. EUROSPEECH 2001, Aalborg, Denmark, Vol. 1, pp. 175-178.

J. Jiang, A. Alwan, E. Auer, and L. Bernstein, "Predicting visual consonant perception from physical measures,"Proc. EUROSPEECH 2001, Aalborg, Denmark, Vol. 1, pp. 179-182.

L. Bernstein, J. Jiang, A. Alwan, and E. Auer, "Visual phonetics and optical phonetics,"Proc. AVSP 2001, Scheelsminde, Denmark, pp. 104-109.

A. Bernard and A. Alwan, Source and channel coding for remote speech recognition over error-prone channel,"Proc. of ICASSP 2001, Vol. 4, pp. 2613-2616.

Q. Zhu and A. Alwan, "An efficient and scalable 2D DCT-based feature coding scheme for remote speech recognition,"Proc. ICASSP 2001, Vol. 1, pp. 113-116.

2000 ↑Top

Q. Zhu and A. Alwan, "Amplitude Demodulation of Speech Spectra and its Application to Noise Robust Speech Recognition,"6th International Conference on Spoken Language Processing, ICSLP 2000. Vol. 1, pp. 341-344

W. Chen and A. Alwan, "Place of Articulation Cues for Voiced and Voiceless Plosives and Fricatives in Syllable-Initial Position,"6th International Conference on Spoken Language Processing, ICSLP 2000. Vol. 4, pp. 113-116.

J. Hant and A. Alwan, "Predicting the Perceptual Confusion of Synthetic Stop Consonants in Noise,"6th International Conference on Spoken Language Processing, ICSLP 2000. Vol. 3, pp. 941-944.

J. Jiang, A. Alwan, L. Bernstein, P. Keating, and E. Auer, "On the Correlation between Facial Movements, Tongue Movements and Speech Acoustics,"6th International Conference on Spoken Language Processing, ICSLP 2000. Vol. 1, pp. 42-45.

M. Iseli and A. Alwan, "Inter- and Intra-speaker Variability of Glottal Flow Derivative using the LF Model,"6th International Conference on Spoken Language Processing, ICSLP 2000. Vol. 1, pp. 477-480.

M. Siqueira and A. Alwan, "Steady-state analysis of continuous adaptation in acoustic feedback reduction systems for hearing aids,"IEEE Transactions on Speech and Audio Processing, Vol. 8, No. 4, pp. 443-453, July 2000.

Espy-Wilson, C.Y.; Boyce, S.E.; Jackson, M.; Narayanan, S.; and A. Alwan."Acoustic modeling of American English /r/,"Journal of the Acoustical Society of America (JASA), July 2000, Vol.108, (no.1):343-56.

Srinivasamurthy, N.; Ortega, A.; Zhu, Q.; Alwan, A. "Towards efficient and scalable speech compression schemes for robust speech recognition applications,"2000 IEEE International Conference on Multimedia and Expo (ICME) Proceedings. Latest Advances in the Fast Changing World of Multimedia, NY, 30 July-2 Aug. 2000. IEEE Press, Vol. 1, pp.249-52.

Q. Zhu and A. Alwan, "On the use of variable frame rate analysis in speech recognition,"Proc. IEEE ICASSP, Istanbul, Turkey, Vol. III, pp. 1783-1786, June 2000.

S. Narayanan and A. Alwan, Noise Source models for fricative consonants,"IEEE Transactions on Speech and Audio Processing,Vol. 8, No. 3, pp. 328-344, May 2000.

1999 ↑Top

A. Alwan, "Modeling speech production and perception mechanisms and their applications to synthesis, recognition, and coding,"Fifth International Symposium on Signal Processing and its Applications, Proceedings, Brisbane, Qld, Australia, 1999, Vol. 1, p. 7.

J. Hant and A. Alwan, "Modeling the masking of Formant Transitions in Noise,"Proc. of Eurospeech 99, Budapest, Hungary, Vol. 4, pp. 1895-1898.This paper was one of three papers nominated for the best student paper award in Speech Communication at Eurospeech '99.

A. Alwan, P. Bangayan, B. Garrett, J. Kreiman, and C. Long, "Analysis by synthesis of pathological voices,"an invited chapter in the book Voice Quality Measurement, R. Kent ed., pp. 307-335, Singular Publishing Group, 1999.

A. Bernard and A. Alwan, "Perceptually Based and Embedded Wideband CELP Coding of Speech,"Proc. of Eurospeech 1999, Budapest, Hungary, Vol. 4, pp. 1543-1546.

A. Alwan, S. Narayanan, B. Strope, and A. Shen, "Speech production and perception models and their applications to synthesis, recognition, and coding,"an invited chapter in the book Speech Processing, Recognition, and Artificial Neural Networks, Chollet, DiBenedetto, Esposito, and Marinaro ed., pp. 138-161, Springer-Verlag, UK, 1999.

A. Alwan, J. Lo, Q. Zhu, "Human and Machine Recognition of Nasal Consonants in Noise,"Proceedings of the 14th International Congress of Phonetic Sciences, Vol. 1 Page 167-170, August 1999, San Francisco.

A.Bernard, X. Liu, R. Wesel and A. Alwan, "Embedded Joint Source-Channel Coding of Speech using Symbol Puncturing in Trellis Code,"Proceedings of ICASSP 99, Vol. 5, pp. 2427-2430, Phoenix, March 1999.

M. Siqueira and A. Alwan, "Bias Analysis in Continuous Adaptation Systems for Hearing Aids,"Proceedings of ICASSP 99, Vol. 2, pp. 925-928, Phoenix, AZ, March 1999.

1998 ↑Top

A.Bernard, X. Liu, R. Wesel and A. Alwan, "Channel Adaptive Joint Source-Channel Coding of Speech,"Proc. of the 32nd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, November, 1998, vol. 1, pp. 357-361.

M. Siqueira and A. Alwan, "Steady-State Analysis of Continuous Adaptation Systems for Hearing Aids with a Delayed Cancellation Path,"Proc. of the 32nd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, November, 1998. Piscataway, NJ, USA: IEEE, 1998. pp. 518-22 vol.1.

J. Hant, B. Strope, and A. Alwan, "Variable-duration notched-noise experiments in a broadband noise context,"Journal of the Acoustical Society of America, Oct. 1998, vol.104, No. 4, pp. 2451-2456.

B. Strope and A. Alwan, "Modeling the perception of pitch-rate amplitude modulation in noise,"Proc. of the NATO ASI on Computational Hearing, pp. 117-122, July 1998.

B. Strope and A. Alwan, "Amplitude Modulation Cues for Detecting Voicing Distinctions in Noise,"Proceedings of the ICA/ASA Conference, Seattle, pp. 209-210, June 1998.

J. Hant, B. Strope, and A. Alwan, "Variable-Duration Notched-Noise Experiments in a Noise Context,"Proceedings of the ICA/ASA Conference, Seattle, pp. 869-870, June 1998.

Amit Rane, Derrick C. Wei, Lisa E. Falkson and A. Alwan, "Modeling the Transitory Behavior of Speech Using a Time-Varying Transmission Line Model,"Proceedings of the ICA/ASA Conference, Seattle, pp. 261-262, June 1998.

B. Gabelman, J. Kreiman, B. Gerratt, N. Antonanzas-Barroso, and A. Alwan, "Perceptually motivated modeling of noise in pathological voices,"Proceedings of the ICA/ASA Conference, Seattle, pp. 1293-1294, June 1998.

B. Gerratt, J. Kreiman, N. Antonanzas-Barroso, B. Gabelman, and A. Alwan, "Source Modeling of Severely Pathological Voices,"Proceedings of the ICA/ASA Conference, Seattle, pp. 1271-1272, June 1998.

B. Strope and A. Alwan, "Robust Word Recognition Using Threaded Spectral Peaks,"Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seattle, Vol. II, pages 625-629, May 1998.

Bergsneider M; Alwan AA; Falkson L; Rubinstein EH, "The relationship of pulsatile cerebrospinal fluid flow to cerebral blood flow and intracranial pressure: a new theoretical model,"Acta Neurochirurgica, Supplementum, 1998, 71:266-8.

1997 ↑Top

P. Bangayan, C. Long, A. Alwan, J. Kreiman, B. Gerratt, "Analysis by synthesis of pathological voices using the Klatt synthesizer,"Speech Communication, Vol. 22, No. 4, 1997, pp. 343-368.

B. Strope and A. Alwan, "Modeling auditory perception to improve robust speech recognition,"Proceedings of the 31st Asilomar Conference on Signals, Systems, and Computers, IEEE Comput. Soc, 1997, Vol. 2, pp. 1056-1060.

M. Siqueira, A. Alwan, R. Speece, "Steady-State Analysis of Continuous Adaptation Systems in Hearing Aids,"Proceedings of the IEEE workshop on Audio and Elctroacoustics, Mohonk, October, 1997.

S. Narayanan, A. Alwan, and Y. Song, "New Results in Vowel Production: MRI and EPG data,"Proceedings of Eurospeech, Vol.2, pp. 1007-1009, Patras, Greece, September 1997.

S. Roweis and A. Alwan, "Towards articulatory speech recognition,"Proceedings of Eurospeech, Vol.3, pp. 1227-1230, Patras, Greece, September 1997.

C. Espy-Wilson, S. Naraynan, S. Boyce, and A. Alwan, "Acoustic Modeling of American English /r/,"Proceedings of Eurospeech, Vol.1, pp. 393-396, Patras, Greece, September 1997.

B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition,"IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 5, pp. 451-464, September 1997.

J. Hant, B. Strope, and A. Alwan, "A psychoacoustic model for the noise masking of plosive bursts,"JASA, Vol. 101, No. 5, pp. 2789-2802, May 1997.

B. Tang, A. Shen, A. Alwan, and G. Pottie, "A Perceptually-Based Embedded Subband Speech Coder,"IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 2, pp. 131-140, March 1997.

S. Narayanan, A. Alwan, and K. Haker, "Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part I: The laterals,"JASA, Vol. 101, No. 2, pp. 1064-1077, February 1997.

A. Alwan, S. Narayanan, and K. Haker, "Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part II: The rhotics,"JASA, Vol. 101, No. 2, pages 1078-1089, February 1997.

M. Siqueira, R. Speece, V. Petsalis, A. Alwan, S. Soli and S. Gao, "Subband Adaptive Filtering Applied to Acoustic Feedback Reduction in Hearing Aids,"Proceedings of the 30th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, 3-6 Nov 1996, Vol. 1, pp. 788-792.

Bergsneider, M., Alwan, A., and Rubinstein, E., "Venous blood flow pulsatility and impedance increases as compartment compliance decreases,"Proc. of the Tenth International ICP Symposium, 1997, Marmarou et al. ed., Acta Neurochirurgica Supplement Vol. 71, p. 417, Springer-Verlag.

1996 ↑Top

J. Hant, B. Strope, and A. Alwan, "A Psychoacoustic Model for the Noise Masking of Voiceless Plosive Bursts,"Proceedings of the Int. Conf. Spoken Lang. Processing (ICSLP), Philadelphia, pp. 570-573, October 1996.

P. Bangayan, A. Alwan, and S. Narayanan, "From MRI and Acoustic Data to Articulatory Synthesis: a Case Study of the Laterals,"ICSLP Proc., Philadelphia, 793-796, October 1996.

S. Narayanan, A. Kaun, D. Byrd, P. Ladefoged, and A. Alwan, "Liquids in Tamil,'' ICSLP Proc., Philadelphia, pp. 797-800, October 1996.

B. Strope and A. Alwan, "A Model of Dynamic Auditory Perception and its Application to Robust Speech Recognition,"Proc. of the IEEE Int. Conf. Acous. Speech Sig. Proc., Vol. I, pp. 37-40, Atlanta, May 1996.

S. Narayanan and A. Alwan, "Parametric Hybrid Source Models for Voiced and Voiceless Fricative Consonants,"ICASSP 96 Proceedings, Vol. I, pp. 337-340, Atlanta, May 1996.

Alwan, A.; Bagrodia, R.; Bambos, N.; Gerla, M.; and others, "Adaptive Mobile Multimedia Networks,"IEEE Personal Communications, vol.3, (no.2):34-51,April 1996.

S. Narayanan and A. Alwan, "Imaging Applications in Speech Production Research,"SPIE 96 Medical Imaging Proceedings, 2709, 120-131, Newport Beach, Feb. 96 (Invited).

1995 ↑Top

A. Alwan, S. Narayanan, B. Strope, and A. Shen, "Speech Production and Perception Models and their Applications to Synthesis, Recognition, and Coding,"Proc. of the Int. Symp. Sig. Sys. and Elec. (ISSSE), pp. 367-372, October 1995 (Invited).

S. Narayanan, A. Alwan, and K. Haker, "An Articulatory Study of Fricative Consonants using MRI,"JASA, Vol. 98(3), pp. 1325-1347, September 1995.

S. Narayanan, A. Alwan, and K. Haker, "An Articulatory Study of Liquid Consonants in American English,"Proc. of the Int. Con. of Phon. Sci. (ICPhS), Stockholm, Sweden, Vol. 3, pp. 576-579, August 1995.

A. Alwan, P. Bangayan, J. Kreiman, and C. Long, "Time and Frequency Synthesis Parameters for Severe Pathological Voice Qualities,"Proc. of ICPhS, Stockholm, Sweden, Vol. 2, pp. 250-253, August 1995.

M. Siqueira, A. Alwan, and P. Diniz, "Finite Precision Analysis of the Fast QRD-RLS Lattice Algorithm,"Proc. of the IEEE Intl. Symp. Ckts. Sys. (ISCAS), Vol. 3, pp. 1616-1619, Seattle, WA, May 1995.

A. Shen, B. Tang, A. Alwan, and G. Pottie, "A Robust and Variable-Rate Speech Coder,"Proc. of the IEEE Int. Conf. Acous. Speech Sig. (ICASSP) 95, Vol. I, pp. 249-252, Detroit, May 1995.

B. Tang, A. Shen, G. Pottie, and A. Alwan, "Spectral Analysis of Subband Filtered Signals,"Proc. of the IEEE ICASSP 95, Vol. II, pp. 1324-1327, Detroit, May 1995.

B. Strope and A. Alwan, "A Novel Structure to Compensate for Frequency-Dependent Loudness Recruitment of Sensorineural Hearing Loss,"Proc. of the IEEE ICASSP 95, Vol. V, pp. 3539-3542, Detroit, May 1995.

S. Narayanan and A. Alwan, "A Nonlinear Dynamical Systems Analysis of Fricative Consonants,"JASA, Vol. 97, No. 4, pp. 2511-2524, April 1995.

1994 ↑Top

S. Narayanan, A. Alwan, and K. Haker, "An MRI Study of Fricative Consonants,"Proc. of the Intl. Conf. Spoken Lang. Processing (ICSLP), Japan, Vol. 2, pp. 627-630, September 1994.

M. Siqueira, P. Diniz and A. Alwan, "Infinite Precision Analysis of the Fast QR Decomposition RLS Algorithm,"Proc. IEEE ISCAS, London, pp. 293-296, June 1994.

Z. Jiang, A. Alwan, and A. Willson, "High-Performance IIR QMF Banks for Speech Subband Coding,"Proc. IEEE ISCAS, London, pp. 493-496, June 1994.

M. Siqueira and A. Alwan, "New Techniques for Adaptive Filtering Applied to Speech Echo Cancelation,"Proc. IEEE ICASSP, Australia, pp. 265-268, April 1994.

Before 1994 ↑Top

S. Narayanan and A. Alwan, "Strange Attractors and Chaotic Dynamics in the Production of Voiced and Voiceless Fricatives,"Proc. Eurospeech, Vol. I, pp. 77-80, Berlin, September 1993.

A. Alwan, "A Perceptual Metric for Masking,"Proc. IEEE ICASSP, Vol. 2, 712-715, April 1993.

A. Alwan, "The role of F3 and F4 in identifying the place of articulation for stop consonants,"ICSLP Proc., Vol. 2, pp. 1063-1066, Canada, Oct. 1992.

A. Alwan, "Modeling Speech Perception in Noise: a Case Study of the Place of Articulation Feature,"ICPhS Proc., Vol. 2, pp. 78-81, France, August 1991.

A. Alwan, "Perceptual cues for place of articulation for the voiced pharyngeal and uvular consonants,"JASA, Vol. 86, No. 2, pp. 549-556, August 1989.

Published Abstracts ↑Top

Hong You and Abeer Alwan, "The role of temporal modulation processing in speech∕non-speech discrimination tasks," J. Acoust. Soc. Am. Volume 127, Issue 3, pp. 1817-1817, 2010

Markus Iseli, Yen-Liang Shue, and Abeer Alwan, "Analysis of vowel and speaker dependencies of source harmonic magnitudes in consonant-vowel utterances,"JASA 117, 2619, 2005

Jianxia Xue, Abeer Alwan, Jintao Jiang, and Lynne E. Bernstein, "Phoneme clustering based on segmental lip configurations in naturally spoken sentences,"JASA 117, 2573, 2005

Jianxia Xue, Abeer Alwan, Edward T. Auer, Jr., and Lynne E. Bernstein, "On audio-visual synchronization for viseme-based speech synthesis,"J. Acoust. Soc. Am. 116, 2480, 2004

Markus Iseli and Abeer Alwan, "An improved correction formula for the estimation of voice source harmonic magnitudes,"JASA 115, 2610, 2004

Jul Setsu Cha and Abeer Alwan, "On the acoustic effects of piriform recesses in speech production,"J. Acoust. Soc. Am. 112, 2445, 2002

Brian C. Gabelman, Jody Kreiman, Bruce R. Gerratt, and Abeer Alwan, "Synthesis of nonperiodic features of pathological voices,"JASA, May 2001, Vol. 109, Issue 5, p. 2416

Abeer Alwan, "The 'noisy' speech chain,", JASA, Dec. 2000, Vol. 108, Issue 5, pp. 2626-2627

J. Jiang, A. Alwan, P. Keating, L. Bernstein, and E. Auer. "On the correlation between articulatory and acoustic data,"JASA, Dec. 2000, Vol. 108, Issue 5, p. 2508

Patricia A. Keating, Taehong Cho, Marco Baroni, Sven Mattys, Lynne E. Bernstein, Brian Chaney, and Abeer Alwan, "Articulation of word and sentence stress,"JASA, Dec. 2000, Vol. 108, Issue 5, p. 2466

J. Jiang, A. Alwan, P. Keating, and L. Bernstein, "On the correlation between orofacial movements, tongue movements, and speech acoustics,"JASA, May 2000, Vol. 107, Issue 5, p. 2904

M. Chen and A. Alwan, "On the perception of voicing for plosives in noise,'' JASA, May 2000, Vol. 107, Issue 5, p. 2917

Lynne E. Bernstein, Edward T. Auer, Jr., Brian Chaney, Abeer Alwan, and Patricia A. Keating, "Development of a facility for simultaneous recordings of acoustic, optical (3-D motion and video), and physiological speech data,"JASA, May 2000, Vol. 107, Issue 5, p. 2887

C. Espy-Wilson, S. Boyce, M. Jackson, A. Alwan and S. Narayanan, "Modeling the subglottal space for American English /r/,'' JASA, September 1998, Vol. 104, Issue 3, p. 1819

B. Gerratt, J. Kreiman, N. Antonanzas-Barroso, B. Gabelman, and A. Alwan,"Source modeling of severely pathological voices,'' JASA, May 1998, Vol. 103, Issue 5, p. 2892

B. Gabelman, J. Kreiman, B. Gerratt, N. Antonanzas-Barroso, and A. Alwan, "LF source model adequacy for pathological voices,'' JASA, Nov. 1997, Vol. 102, Issue 5, p. 32

C. Espy-Wilson, S. Narayanan, A. Alwan, and S. Boyce, "Modeling the acoustics of American English /r/,'' JASA, May 1997, Vol. 101, Issue 5, p. 3176

B. Strope and A. Alwan, "Dynamic auditory representations and statistical speech recognition: Threading spectral peaks for robust recognition,'' Proc. of the Acoustical Societies of Amer. and Japan, Vol. 100, No. 4, 2788, Dec. 1996. This paper received the best student paper award in Speech Communication at the ASA meeting.

P. Bangayan, A. Alwan, and S. Narayanan, "A transmission-line model of the lateral approximants'', Proc. of the Acous. Societies of Amer. and Japan, Vol. 100, No. 4, Dec. 1996.

J. Hant, B. Strope, and A. Alwan "Predicting noise-masked thresholds of plosive bursts,'' 4th Lake Arrowhead Conference on Issues in Advanced Hearing Aid Research, May 1996.

R. Speece, A. Alwan, M. Siqueira, S. Soli, S. Gao. "An Analysis of the Acoustic Feedback Path Transfer Function in Hearing Aids."Lake Arrowhead 4th Conference on Issues in Advanced Hearing Aid Research, May 1996.

B. Gabelman, J. Kreiman, B. Gerratt, and A. Alwan, "Optimization for source waveform synthesis of pathological voices,'' JASA, April 1996, Vol. 99, Issue 4, p. 2549

J. Hant, B. Strope, and A. Alwan, "Durational Effects on Masked Thresholds in Noise as a Function of Signal Frequency, Bandwidth, and Type,'' Proc. of the Acous. Soc. of Amer. (ASA), Vol. 98, No. 5, 2908, Nov. 1995.

B. Strope and A. Alwan, "A First-Order Model of Dynamic Auditory Perception,'' Proc. NIH Hearing Aid Research and Development Workshop, September 1995.

A. Alwan, M. Siqueira, S. Soli, and S. Gao, "An Analysis of the Acoustic Path Transfer Function in Hearing Aids,'' Proc. NIH Hearing Aid Research and Development Workshop, September 1995.

J. Saade, F. Zeng, J. Wygonski, R. Shannon, S. Soli, and A. Alwan "Quantitative measures of envelope cues in speech recognition,'' Proc. ASA, June 1995.

S. Narayanan, A. Alwan, and K. Haker, "Three dimensional tongue shapes of sibilant fricatives,'' JASA, Vol. 96, (5), 3342 (A), Nov. 1994.

B. Strope and A. Alwan, "Mapping of Constant Loudness Contours with Filter Mixtures in Digital Hearing Aids,'' Lake Arrowhead Conference on Hearing Aid Research, June 1994.

P. Bangayan, A. Alwan, J. Kreiman, and C. Long, "Synthesis of Severely Pathological Voices,"JASA, Vol. 95, No. 5, 1pSP5, May 1994.

C. Long, P. Bangayan, and A. Alwan, "Acoustic Analysis and Synthesis of Pathological Voice Qualities,"JASA, Vol. 93, No. 3, Pt. 2, 2aSP9, Oct. 1993. This paper received the best student paper award in Speech Communication at the ASA meeting.

M.S. Theses in Electrical Engineering ↑Top

Julien van Hout,  "Low Complexity Spectral Imputation for Noise Robust Speech Recognition," 5/2012

Yi-Hui Lee,  "An exploration study of the effect of voice quality on subglottal resonances," 6/2010

Sankaran Panchapagesan, "Modeling the Production of /l/ Based on MRI data,"  3/2003

Ivo Locher, "Design and Implementation of iBadge and its Distributed Speech Processing Capability", 9/2002

Jul Setsu Cha, "Articulatory Speech Synthesis of Female and Male Talkers,"12/01.

Vladimir Teplitsky, "A Noise Robust Speech Enhancement Algorithm for Cochlear Hearing Loss,"9/01.

Marcia Chen, "Perception of Voicing for Syllable-Initial Plosives in Noise,"6/01.

Willa Chen, "Perception of Place of Articulation for Syllable-Initial Consonants in Noise,"6/01.

Steve Chen, "Segregated and Redundent Hidden-Markov Models for Alphabet Recognition in Cars,'' 9/99.

Alexis Bernard, "Source-Channel Coding of Speech'' (pdf), ps, pdf.zip, and ps.zip, 12/98.

Lisa Falkson, "A circuit model for studying the dynamics of the intracranial compartment,'' 7/98.

Jeff Lo, "Perception and recognition of nasal consonants in quiet and in noise,'' 7/98.

Amit Rane, "Forward and Inverse Mapping of the Vocal Tract,'' 7/98.

Vaggelis Petsalis, "Automatic speech recognition of isolated digits in noise,'' 1/97.

Wayne Bayever, "Design and implementation of a formant vocoder,'' 9/96.

Philbert Bangayan, "A transmission-line model of /l/ based on MRI-derived data,'' 9/96.

James Hant, "A psychoacoustic model to predict the noise masking of plosive bursts,'' 6/96.

Yong Song, "Finite time-difference simulations of speech production,'' 7/95.

Brian Strope, "A model of dynamic auditory perception and its application to robust speech recognition,'' 6/95.

Albert Shen, "Perceptually-based subband coding of speech signals,'' 6/94

Ph.D. Dissertations in Electrical Engineering ↑Top

Wei Chu, "Noise Robust Signal Processing for Human Pitch Tracking and Bird Song Classification and Detection," 12/2011.

Jonas Borgstrom, "Inference of Missing or Degraded Data for Noise Robust Speech Processing," 06/2010.

Yen-Liang Shue, "The Voice Source in Speech Production: Data, Analysis and Models," 03/2010.

Shizhen Wang, "Rapid Speaker Normalization and Adaptation with Applications to Automatic Evaluation of Children's Language Learning Skills," 03/2010.

Hong You, "Robust Automatic Speech Recognition Algorithms for Dealing with Noise and Accent," 08/2009.

Jianxia Xue, "Acoustically-Driven Talking Face Animations Using Dynamic Bayesian Network," 12/2008.

Sankaran Panchapagesan, "Frequency Warping by Linear Transformation, and Vocal Tract Inversion for Speaker Normalization in Automatic Speech Recognition," 06/2008.

Markus Iseli, "Dependencies of voice source measures on age, sex, vowel context, and prosodic features,'' 06/2007.

Xiaodong Cui, "Environmental and Speaker Robustness in Automatic Speech Recognition with Limited Learning Data,'' 8/05.

Jintao Jiang, "Relating Optical Speech to Speech Acoustics and Visual Speech Perception,'' 10/03.

Brian Gabelman, "Analysis and Synthesis of Pathological Vowels,'' 8/03.

Alexis Bernard, "Source and Channel Coding for Speech Transmission and Remote Speech Recognition [pdf] [ps]", 3/02.

Qifeng Zhu, "Noise Robust Front-End Processing for Automatic Speech Recognition", 12/01.

James Hant, "A Computational Model to Predict Human Perception of Speech in Noise", 6/00.

Fred Chi, "Adaptive Feedback Cancellation for Hearing Aids: Theories, Algorithms, Computations, and Systems'', 11/99.

Marcio Siqueira, "Adaptive filtering algorithms in acoustic echo cancellation and feedback reduction'', 9/98.

Brian Strope, "Modeling auditory perception for robust speech recognition'', 8/98.

Shrikanth Narayanan, "Fricative consonants: an articulatory, acoustic, and systems study'', 6/95.

 

spacer
spacer