Speech and Audio Research at HP Emerge Compute Labs

By Bilderback, Dayna

April 12, 2018

Speaker: Sunil Bharitkar, Ph.D.
Affiliation: Hewlett-Packard

Abstract: In HP’s Emerging Compute Lab, research is being conducted at the intersection of signal processing, auditory perception and machine learning to create fundamentally new experiences for differentiation in HP devices including VR HMD. In this talk we will present various techniques and algorithms, incorporating knowledge of binaural perception, machine learning, and signal processing, to enhance low-frequency perception, spatial rendering, and automated content classification. The research results have been validated through perceptual testing in large-scale studies giving statistically meaningful results. Ongoing research being conducted in the areas deep learning (stacked autoencoders and LSTM) for VR head-related transfer function synthesis, content classification, speech and multimodal biometrics, sensing towards emotion interpretation, and cancer cell data classification (jointly with Life Sciences Lab) will also be presented. The presentation will be accompanied with demonstrations.

Biography: Sunil Bharitkar received his Ph.D. in Electrical Engineering from the University of Southern California (USC) in 2004 and is involved in research in speech/audio analysis and processing including spatial audio for AR/VR, biometric & biomedical signal processing, multimodal signal processing, and machine learning. From 2011-2016 he was at Dolby leading/guiding research in audio, signal processing, haptics, machine learning, hearing augmentation, and standardization activities at ITU, SMPTE, AES. He co-founded the company Audyssey Laboratories in 2002 where he was VP of Research and responsible for inventing new technologies which were licensed to companies including IMAX, Denon, Audi, Sharp, etc. He also taught in the Department of Electrical Engineering at USC. Sunil has published over 50 technical papers and has over 20 patents in the area of signal processing applied to acoustics, neural networks and pattern recognition, and a textbook (Immersive Audio Signal Processing) from Springer-Verlag. He is a reviewer for papers at various conferences and journals. He has also been on the Organizing and Technical Program Committees of various conferences such as the 2008 and 2009 European Sig. Proc. Conference (EUSIPCO), the 57th AES Conference, SMPTE Conferences. He has also served as an invited tutorial speaker at the 2006 IEEE Conf. on Acoustics Speech and Signal Processing (ICASSP). He is a Senior Member of the IEEE, the Acoustical Soc. of America (ASA), European Association for Signal and Image Processing (EURASIP), and the Audio Eng. Soc. (AES). Sunil is a PADI diver & enjoys playing the Didgeridoo.

For more information, contact Prof. Abeer Alwan (alwan@ucla.edu)

Date/Time:
Date(s) - Apr 12, 2018
1:00 pm - 2:30 pm

Location:
E-IV Tesla Room #53-125
420 Westwood Plaza - 5th Flr., Los Angeles CA 90095