Speech & natural language publications
-
Comparison of Neutralizing Abilities of Human Monoclonal Antibodies Binding Different Epitopes on Botulinum Neurotoxin A
-
Adaptive Gaussian Backend for Robust Language Identification
This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust language identification (LID).
-
Improving Language Identification Robustness to Highly Channel-Degraded Speech through Multiple System Fusion
We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high…
-
Modulation features for noise robust speaker identification
In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.
-
Strategies for high accuracy keyword detection in noisy channels
We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.
-
A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
This paper presents SRI’s submission along with a careful analysis of the approaches that provided gains for this challenging evaluation including a multiclass voice-activity detection system, the use of noisy…
-
All for one: Feature combination for highly channel-degraded speech activity detection
This paper presents a feature combination approach to improve SAD on highly channel degraded speech as part of the Defense Advanced Research Projects Agency’s (DARPA) Robust Automatic Transcription of Speech…
-
Damped oscillator cepstral coefficients for robust speech recognition
This paper presents a new signal-processing technique motivated by the physiology of human auditory system.
-
“Can You Give Me Another Word for Hyperbaric?”: Improving Speech Translation Using Targeted Clarification Questions
We present a novel approach for improving communication success between users of speech-to-speech translation systems by automatically detecting errors in the output of automatic speech recognition (ASR) and statistical machine…
-
N-Gram Extension for Bag-of-Audio-Words
With ... enhanced representation, we find the average probability of miss noticeably decreases when evaluated on TRECVID 2011 and 2012 datasets, indicating clear improvements on the multimedia event detection task.
-
Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams
We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using a rich system combination with a diverse set of noise-robust modules and audio characterization.
-
Articulatory trajectories for large-vocabulary speech recognition
We present a neural network model to estimate articulatory trajectories from speech signals where the model was trained using synthetic speech signals generated by Haskins Laboratories’ task-dynamic model of speech…