I received my B.S. degree from Department of Electronic Engineering, Tsinghua Unvierstiy, Beijing, P.R. China in 2008. Currently, I'm a Ph.D candidate in Prof. Abeer Alwan's speech lab.
My research area includes: speech production system modeling, voice source estimation, voice quality analysis, signal processing for clinical assessment, speech recognition, speech activity detection, voice quality analysis. Current research projects includes modeling the glottal sources signal, synthesizing natural voices using the glottal model, and perceptual validation.

CV download.


Publications

Journal Publications

Gang Chen, Jody Kreiman, Bruce Gerratt, Juergen Neubauer, Yen-Liang Shue, and Abeer Alwan, "Development of a glottal area index that integrates glottal gap size and open quotient," Journal of the Acoustical Society of America, Vol. 133, Issue 3, pp. 1656–1666. [pdf]

Jody Kreiman, Yen-Liang Shue, Gang Chen, Markus Iseli, Bruce R. Gerratt, Juergen Neubauer, and Abeer Alwan, "Relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation," Journal of the Acoustical Society of America, Volume 132, Issue 4, pp. 2625-2632 (2012).

Conference papers

G. Chen, M. Garellek, J. Kreiman, B. R. Gerratt, A. Alwan, "A perceptually and physiologically motivated voice source model", Interspeech 2013, accepted.

G. Chen, R. A. Samlan, J. Kreiman, A. Alwan, "Investigating the relationship between glottal area waveform shape and harmonic magnitudes through computational modeling and laryngeal high-speed videoendoscopy", Interspeech 2013, accepted.

Jody Kreiman, Marc Garellek, Gang Chen, Abeer Alwan, and Bruce R.Gerratt, "Perceptual evaluation of source models," International Conference on Voice Physiology and Biomechanics, 2012

G. Chen, Y.-L. Shue, J. Kreiman, and A. Alwan, "Estimating the voice source in noise", Interspeech 2012.

G. Chen, J. Kreiman, and A. Alwan, "The Glottaltopograph: A Method of Analyzing High-Speed Images of the Vocal Folds", ICASSP 2012, pp. 3985-3988. [toolkit]

G. Chen, J. Kreiman, Yen-Liang Shue, and A. Alwan, "Acoustic Correlates of Glottal Gaps," Interspeech 2011, pp 2673-2676

Y.-L. Shue, G. Chen, and A. Alwan, "On the Interdependencies between Voice Quality, Glottal Gaps, and Voice-Source related Acoustic Measures," Interspeech 2010, pp. 34-37.

G. Chen, X. Feng, Y.-L. Shue, and A. Alwan, "On Using Voice Source Measures in Automatic Gender Classification of Children's Speech," Interspeech 2010, pp. 673-676.


Research Experiences

Speech Processing and Audio Perception Lab, UCLA 09/2008 - present
Research Assistant, Advisor: Prof. Abeer Alwan
       -  Applied voice source measures in automatic gender classification on children's voice. Implemented SVM classifier based on fundamental frequency and formant frequencies with additional voice source related features. Classification accuracies were improved by 4.4% on average for all age groups (age 8- age 17).
       -  Analyzed high-speed image data of the vocal folds from various voice qualities. Proposed a new voice source signal model of human speech production system. Applied the proposed voice source signal model to automatic voice source estimation from the acoustic speech signal in noise. The proposed method outperformed state-of-the-art source estimation algorithms.
       -  Developed a statistical method ''glottaltopograph'' to automatically visualize and analyze high-speed vocal-fold video recording. The proposed method could automatically locate the problematic region of the laryngeal area for clinical assessment

Internship Experiences

• Signal processing group, Starkey Lab, Eden Prairie, MN.   Jun 2012-Sep 2012
Summer intern, Mentor: Dr.Ivo Merks 

    -Speech dereverberation for hearing aid applications. Applied a statistical room acoustic model to estimate the reverberation time. Estimated the late reflections and removed them via spectral subtraction to enhance the speech signal. The proposed algorithm is able to reliably estimate the reverberation time in noise with a low complexity for hearing aid devices.

• Speech group, Disney Research, Pittsburgh, PA.   Jun 2011-Sep 2011
Summer intern, Mentors: Dr.Kenichi Kumatani and Dr. John McDonough

    -Developed algorithms for multiple-speaker voice activity detection in Python and C++. Implemented this front-end processing in a speech recognition system of an interactive game for multiple children. This is a prototype developed for Disneyland theme parks.

• Hardware group, 3M Cogent, Pasadena, CA.         Jul 2010-Sep 2010
Summer intern, Mentor: Dr. Charley Lu   

    -Developed algorithms for fingerprint capturing and enhancement on WinCE/Window Mobile platforms. Designed GUI for mobile device.
    -Applied noise reduction on front-end of Speaker Identification project in mobile device. Detected and updated noise spectrum in real time. Designed filter banks and applied spectral subtraction to enhance the speech.

EMC Beijing/ Hong Kong, China                               Jul 2007- Sep2007
    -Performed database management (MySql) in projects with City University of Hong Kong