Speech & natural language publications
-
Comparison of Neutralizing Abilities of Human Monoclonal Antibodies Binding Different Epitopes on Botulinum Neurotoxin A
-
Strategies for high accuracy keyword detection in noisy channels
We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.
-
All for one: Feature combination for highly channel-degraded speech activity detection
This paper presents a feature combination approach to improve SAD on highly channel degraded speech as part of the Defense Advanced Research Projects Agency’s (DARPA) Robust Automatic Transcription of Speech…
-
A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
This paper presents SRI’s submission along with a careful analysis of the approaches that provided gains for this challenging evaluation including a multiclass voice-activity detection system, the use of noisy…
-
Damped oscillator cepstral coefficients for robust speech recognition
This paper presents a new signal-processing technique motivated by the physiology of human auditory system.
-
Adaptive Gaussian Backend for Robust Language Identification
This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust language identification (LID).
-
Improving Language Identification Robustness to Highly Channel-Degraded Speech through Multiple System Fusion
We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high…
-
Modulation features for noise robust speaker identification
In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.
-
A noise robust i-vector extractor using vector taylor series for speaker recognition
We propose a novel approach for noise-robust speaker recognition, where the model of distortions caused by additive and convolutive noises is integrated into the i-vector extraction framework.
-
Improving Speaker Identification Robustness to Highly Channel-Degraded Speech Through Multiple System Fusion
This article describes our submission to the speaker identification (SID) evaluation for the first phase of the DARPA Robust Audio and Transcription of Speech (RATS) program.
-
Fusion of acoustic, perceptual and production features for robust speech recognition non-stationary noise
This paper shows that fusion of multiple noise robust feature streams motivated by speech production and perception theories help to significantly improve the robustness of traditional speech recognition systems.
-
N-Gram Extension for Bag-of-Audio-Words
With ... enhanced representation, we find the average probability of miss noticeably decreases when evaluated on TRECVID 2011 and 2012 datasets, indicating clear improvements on the multimedia event detection task.