spacer
spacer
header


 

Wei Chu

Room 63-134 Engr. IV
Department of Electrical Engineering, UCLA
Speech Processing and Auditory Perception Laboratory

Email:

Detailed CV: (pdf)

Research Interests  

Speech Recognition, Audio Processing, Statistical Signal Processing, Machine Learning, and Game Theory.

Research Experiences  

Speech Processing and Audio Perception Lab, UCLA 09/2007 - present
Research Assistant, Advisor: Prof. Abeer Alwan
– Bird song detection and classification
– Noise robust F0 tracking.

Speech Lab, Rosetta Stone 06/2009 - 08/2009
Summer Intern, Mentor: Dr. Bryan Pellom
– Use statistically-based methods to decide the pronunciation of a word.

Speech Group, Mitsubishi Electric Research Lab 06/2008 - 09/2008
Summer Intern, Mentor: Prof. Bhiksha Raj
– Discriminative training on Sphinx III.

Speech Group, Microsoft Research Asia, Beijing 04/2007 - 08/2007
Summer Intern, Mentor: Dr. Chao Huang
– Acoustic events (speech, music, ring tone, background noise) detection in office environment.

Microprocessor Tech Lab, Intel China Research Center, Beijing 07/2006 - 10/2006
Research Intern, Mentor: Dr. Wei Hu
– Main actors locating and tracking on TV series and movies.

Tsinghua University, Beijing 09/2004 - 07/2007
Research Assistant, Advisor: Prof. Jia Liu
– Real-time Speech-To-Text system with non-speech input rejection on TI 5509 EVM. Master thesis work.
– Non-speech removal frontend for national ’863’ and ’242’ keyword spotting evaluation.

UFIDA Software Corp., Beijing 02/2004 - 06/2004
Software Intern, Supervisor: Mr. Yu Zhu
– Indexing and displaying the digital map for on-vehicle GPS software system. Bachelor thesis work.

Publications  

W. Chu, A. Alwan, "A correlation-maximization denoising filter used as an enhancement frontend for noise robust bird call classification,” InterSpeech 2009, pp. 2831-2834. [slides]

W. Chu, A. Alwan, "Reducing F0 frame error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend," ICASSP 2009, pp.3969-3972. [slides]

W. Chu, J. Liu, "Using Confidence Measures to Evaluate the Speaker Turns in Speaker Segmentation," Proc of Intl Conf on Information Sciences, Signal Processing and its Application (ISSPA07).

W Chu, J. Liu, "Subband Energy Distance Measure Applied in Multi-Pass Speech/Non-Speech Discrimination," Proc of Intl Conf on Information Sciences, Signal Processing and its Application (ISSPA07).

W. Chu, X. Xiao, J. Liu, "Confidence Score Based Unsupervised Incremental Adaptation for OOV Words Detection," Proc of Intl Workshops on Statistical Techniques in Pattern Recognition (SSSPR06), pp.723-731.

Master Thesis  

Wei Chu, "Noise and Interruptive Speech Rejection for the Embedded Speech Recognition System," Master thesis, Tsinghua University, Jun. 2007 (in Chinese).

 

 




Speech Tutorials & Links


 
Machine Learning  

• Hidden Markov Model

- Tutorial:
    * L. Rabiner "A tutorial on hidden Markov models and selected applications in speech recognition," Proceedings of the IEEE, 77 (2), pp. 257–286, February 1989.

- Toolkit:
    * HTK BOOK 3.4 (for my own use)

    * Mark Hasegawa-Johnson's Speech Mini Course and HTK lecture video

• Gaussian Mixture Model

- Tutorial: Reynolds, Douglas A., Quatieri, Thomas F., and Dunn, Robert B., "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, Vol. 10, No. 1-3, pp. 19-41, January 2000.

- Toolkit: My GMM classifier written in C (available for download soon)

• Support Vector Machine

- Tutorial: SVM on wiki

- Toolkit: LIBSVM

• Large-Margin Training

- Fei Sha, "Large margin training of acoustic models for speech recognition,", PhD Thesis, 2007

- Hui Jiang, "Large margin hidden Markov models for speech recognition", IEEE Trans. On Audio, Speech and Language Processing,  pp.1584-1595, Vol. 14, No. 5, September 2006.

Speech Analysis  

F0 Tracking or Pitch Detection Algorithm

  - Study:
    * L. Rabiner, M. Cheng, A. Rosenberg, and C. McGonegal, "A comparative performance study of several pitch detection algorithms," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. 24, no. 5, pp. 399–418, 1976.

  - Toolkit:
    * Praat (has visualization function)

    * ESPS get_f0: D. Talkin, "Robust algorithm for pitch tracking," Speech Coding and Synthesis, pp. 497–518, 1995. Wavesurfer (use ESPS get_f0, with visualization function)

    * TEMPO (a part of STRAIGHT toolkit):

    * YIN (for the pitch of music)

Speech Recognition  

Tutorial:

  - Dan Ellis's ICSI Speech FAQ

 
spacer
spacer