Speech Coding and Echo Cancelation for Wireless Communication

Speech Coding and Echo Cancelation for Wireless Communication

[ Project Summary | Keywords | Project References]


Project Summary

Design of high quality speech coders and echo-cancelation schemes for wireless networks is a challenging task since good quality should be maintained with low power consumption under time-varying channel conditions and limited bandwidth. The design should account for a number of parameters such as bit rate, delay, power consumption, complexity, and quality of coded speech. Available bandwidth will depend on network protocols. Depending on the application, a set of parameters is optimized.

In the past, speech codec design has mostly been driven by bandwidth-efficiency considerations; the target application being telephonic where the channel does not vary considerably with time and the Signal-to-Noise (SNR) is relatively high. For example, Code Excited Linear Prediction (CELP) based coders, are popular because of their low bit rates. The performance of these coders, however, deteriorates significantly in the presence of background noise, and the coders' complexity is rather high. In addition, the performance of CELP coders is poor for female speech and for non-speech signals, such as music. As a result, new standards for personal communication services are likely to use high-quality, medium bit-rate speech codecs such as the codec we have developed.

Work supported by ARPA CSTO and by NSF IRI 9309418.


Keywords

Wireless Communication, Perceptual Coding, Echo Cancelation.


Project References

Srinivasamurthy, N.; Ortega, A.; Zhu, Q.; Alwan, A. ``Towards efficient and scalable speech compression schemes for robust speech recognition applications.'' IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia, NY, 30 July-2 Aug. 2000, IEEE p. 249-52 vol.1.

A. Alwan, S. Narayanan, B. Strope, and A. Shen, ``Speech production and perception models and their applications to synthesis, recognition, and coding,'' in Speech Processing, Recognition, and Artificial Neural Networks, Chollet, DiBenedetto, Esposito, and Marinaro ed., p. 138-161, Springer-Verlag, UK, 1999.

A. Bernard and A. Alwan, ``Perceptually-based and embedded wideband CELP coding of speech,'' Proc. of Eurospeech 99, Budapest, Hungary, Vol. 4, p. 1543-1546, September 1999.

A. Bernard, X. Liu, R. Wesel, and A. Alwan, ``Embedded joint source-channel coding of speech using symbol puncturing of trellis codes,'' Proc. IEEE ICASSP, Vol. 5, p. 2427-2430, Phoenix, March 1999.

A. Bernard, X. Liu, A. Alwan, and R. Wesel, ``Channel adaptive joint source-channel coding of speech,'' Proc. of the 32nd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, November, 1998, vol. 1, pp. 357-361.

Alexis P. Bernard, ``Source-channel coding of speech,'' unpublished M.S. thesis, Dept. of Electrical Engineering, UCLA, 1998

B. Tang, A. Shen, A. Alwan, and G. Pottie, ``A perceptually-based embedded subband speech coder," IEEE Transactions on SAP, Vol. 5, No. 2, p. 131-140, March 1997.

Wayne Bayever, ``Design and implementation of a formant vocoder,'' M.S. thesis, EE Dept., UCLA, 9/96.

Alwan, A., Bagrodia, R., Bambos, N., Gerla, M., Kleinrock, L., Short, J., and Villasenor, J. ``Adaptive mobile multimedia networks.'' IEEE Personal Communications, April 1996, vol.3, (no.2):34-51.

Jain, R.; Alwan, A.; Gerla, M.; Kleinrock, L.; and others. Multimedia wireless networking. (Multimedia Computing and Networking 1996, San Jose, CA, USA, 29-31 Jan. 1996). Proceedings of the SPIE - The International Society for Optical Engineering, 1996, vol.2667:70-6.

M. Siqueira, A. Alwan, and P. Diniz, ``Finite Precision Analysis of the Fast QRD-RLS Lattice Algorithm, the Proc. of IEEE ISCAS, pp. 1616-1619, Vol. 3, Seattle, WA, May 1995.

A. Alwan, S. Narayanan, B. Strope, and A. Shen, ``Speech Production and Perception Models and their Applications to Synthesis, Recognition, and Coding,'' Proc. ISSSE, Oct. 1995.

A. Shen, B. Tang, A. Alwan, and G. Pottie, ``A Robust and Variable-Rate Speech Coder,'' Proc. Int. Con. Acous. Speech Sig. Proc. (ICASSP) 1995, Vol. I, 249-252.

B. Tang, A. Shen, G. Pottie, and A. Alwan, ``Spectral Analysis of Subband Filtered Signals,'' Proc. ICASSP 1995, Vol. II, 1324-1327.

Albert Shen, ``Perceptually-based subband coding of speech signals,'' M.S. thesis, EE Department, UCLA, 6/94

J. Zhong, A. Alwan, and A. Willson ``High-Performance IIR QMF Banks for Speech Subband Coding,'' Proc. IEEE ISCAS, London, June 1994, 493-496.

M. Siqueira and A. Alwan ``New Techniques for Adaptive Filtering Applied to Speech Echo Cancellation,'' Proc. IEEE ICASSP, Australia, April 1994, 265-268.


Back to SPAPL Home Page.

Philbert Bangayan (bangayan@icsl.ucla.edu)