Speech & natural language publications

October 1, 2013

Comparison of Neutralizing Abilities of Human Monoclonal Antibodies Binding Different Epitopes on Botulinum Neurotoxin A

Biomedical sciences publications, Publications, Speech & natural language publications
August 1, 2013

Adaptive Gaussian Backend for Robust Language Identification

ByAaron Lawson, Mitchell McLaren

This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust language identification (LID).

Publications, Speech & natural language publications
August 1, 2013

Improving Language Identification Robustness to Highly Channel-Degraded Speech through Multiple System Fusion

ByAaron Lawson, Martin Graciarena, Mitchell McLaren

We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high…

Publications, Speech & natural language publications
August 1, 2013

Modulation features for noise robust speaker identification

ByHoracio Franco, Martin Graciarena, Mitchell McLaren

In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.

Publications, Speech & natural language publications
August 1, 2013

Strategies for high accuracy keyword detection in noisy channels

ByAndreas Kathol, Dimitra Vergyri, Horacio Franco, Martin Graciarena

We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.

Publications, Speech & natural language publications
August 1, 2013

A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation

ByMartin Graciarena, Mitchell McLaren

This paper presents SRI’s submission along with a careful analysis of the approaches that provided gains for this challenging evaluation including a multiclass voice-activity detection system, the use of noisy…

Publications, Speech & natural language publications
August 1, 2013

All for one: Feature combination for highly channel-degraded speech activity detection

ByHoracio Franco, Martin Graciarena

This paper presents a feature combination approach to improve SAD on highly channel degraded speech as part of the Defense Advanced Research Projects Agency’s (DARPA) Robust Automatic Transcription of Speech…

National security publications, Publications, Speech & natural language publications
August 1, 2013

Damped oscillator cepstral coefficients for robust speech recognition

ByHoracio Franco, Martin Graciarena

This paper presents a new signal-processing technique motivated by the physiology of human auditory system.

Publications, Speech & natural language publications
May 1, 2013

“Can You Give Me Another Word for Hyperbaric?”: Improving Speech Translation Using Targeted Clarification Questions

ByAndreas Kathol

We present a novel approach for improving communication success between users of speech-to-speech translation systems by automatically detecting errors in the output of automatic speech recognition (ASR) and statistical machine…

Publications, Speech & natural language publications
May 1, 2013

N-Gram Extension for Bag-of-Audio-Words

With ... enhanced representation, we find the average probability of miss noticeably decreases when evaluated on TRECVID 2011 and 2012 datasets, indicating clear improvements on the multimedia event detection task.

Publications, Speech & natural language publications
May 1, 2013

Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams

We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using a rich system combination with a diverse set of noise-robust modules and audio characterization.

Publications, Speech & natural language publications
May 1, 2013

Articulatory trajectories for large-vocabulary speech recognition

ByColleen Richey

We present a neural network model to estimate articulatory trajectories from speech signals where the model was trained using synthetic speech signals generated by Haskins Laboratories’ task-dynamic model of speech…

Publications, Speech & natural language publications