Insights into the Challenges and Opportunities of Large Multi-Modal Models for Blind and Low Vision Users: CLIP
PARIKSHA: A Scalable, Democratic, Transparent Evaluation Platform for Assessing Indic Large Language Models
Publication Capturing Long Distance Dependency in Language Modeling: An Empirical Study Jianfeng Gao, Hisami Suzuki International Conference on Natural Language Processing | May 2004
Publication Convolutional Networks for Speech Detection Somsak Sukittanon, Arun C. Surendran, John Platt, Chris J.C. Burges International Speech Communication Association | May 2004
Publication Noise Robust Speech Recognition with a Switching Linear Dynamic Model Jasha Droppo, Alex Acero Proc. ICASSP | May 2004 Access
Publication Custom Arithmetic for High-speed, Low-resource ASR Systems Jonathan Malkin, Xiao Li, Jeff Bilmes IEEE International Conference on Acoustic, Speech and Signal Processing | May 2004
Publication Tone Articulation Modeling for Mandarin Spontaneous Speech Recognition Jian-lai Zhou, Ye Tian, Yu Shi, Chao Huang, Eric Chang IEEE | May 2004
Publication What’s in a translation rule? Michel Galley, Mark Hopkins, Kevin Knight, Daniel Marcu Proc. of HLT-NAACL | May 2004
Publication Parameter Sharing in Subband Likelihood-Maximizing Beamforming for Speech Recognition Using Microphone Arrays Mike Seltzer, Richard M. Stern Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. | May 2004
Publication A Structured Speech Model with Continuous Hidden Dynamics and Prediction-Residual Training for Tracking Vocal Tract Resonances Li Deng, Leo J. Lee, Hagai Attias, Alex Acero Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing | May 2004 Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing
Publication A Detection Based Approach to Robust Speech Understanding Kuansan Wang Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing | May 2004 Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing
Publication Segmental Tonal Modeling for Phone Set Design in Mandarin LVCSR Chao Huang, Yu Shi, Jianlai Zhou, Min Chu, Terry Wang, Eric Chang 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing | May 2004