Speech & natural language publications

September 1, 2016

Coping with Unseen Data Conditions: Investigating Neural Net Architectures, Robust Features, and Information Fusion for Robust Speech Recognition

ByHoracio Franco

This work investigates the performance of traditional deep neural networks under varying acoustic conditions and evaluates their performance with speech recorded under realistic background conditions that are mismatched with respect…

Publications, Speech & natural language publications
September 1, 2016

Minimizing Annotation Effort for Adaptation of Speech-Activity Detection Systems

ByMartin Graciarena

This paper focuses on the problem of selecting the best-possible subset of available audio data given a budgeted time for annotation.

Publications, Speech & natural language publications
September 1, 2016

The SRI CLEO Speaker-State Corpus

ByAndreas Kathol, Massimiliano de Zambotti

We introduce the SRI CLEO (Conversational Language about Everyday Objects) Speaker-State Corpus of speech, video, and biosignals.

Biomedical sciences publications, Publications, Speech & natural language publications
June 1, 2016

Exploring the role of phonetic bottleneck features for speaker and language recognition

ByAaron Lawson, Mitchell McLaren

Using bottleneck features extracted from a deep neural network (DNN) trained to predict senone posteriors has resulted in new, state-of-the-art technology for language and speaker identification.

Publications, Speech & natural language publications
June 1, 2016

A Phonetically Aware System for Speech Activity Detection

ByMartin Graciarena

In this paper, we focus on a dataset of highly degraded signals, developed under the DARPA Robust Automatic Transcription of Speech (RATS) program.

Publications, Speech & natural language publications
June 1, 2016

Analyzing the effect of channel mismatch on the SRI language recognition evaluation 2015 system

ByMitchell McLaren

We present the work done by our group for the 2015 language recognition evaluation (LRE) organized by the National Institute of Standards and Technology (NIST).

Publications, Speech & natural language publications
June 1, 2016

Noise and reverberation effects on depression detection from speech

This study compares the effect of noise and reverberation on depression prediction using standard mel-frequency cepstral coefficients, and features designed for noise robustness, damped oscillator cepstral coefficients.

Publications, Speech & natural language publications
December 1, 2015

Improving robustness against reverberation for automatic speech recognition

ByMitchell McLaren, Martin Graciarena, Horacio Franco, Dimitra Vergyri

In this work, we explore the role of robust acoustic features motivated by human speech perception studies, for building ASR systems robust to reverberation effects.

Publications, Speech & natural language publications
December 1, 2015

Time-frequency convolutional networks for robust speech recognition

ByHoracio Franco

This work presents a modified CDNN architecture that we call the time-frequency convolutional network (TFCNN), in which two parallel layers of convolution are performed on the input feature space: convolution…

Publications, Speech & natural language publications
December 1, 2015

The MERL/SRI System for the 3rd chime challenge using beamforming, robust feature extraction and advanced speech recognition

This paper introduces the MERL/SRI system designed for the 3rd CHiME speech separation and recognition challenge (CHiME-3).

Publications, Speech & natural language publications
October 1, 2015

Study of senone-based deep neural network approaches for spoken language recognition

ByMitchell McLaren

This paper compares different approaches for using deep neural networks (DNNs) trained to predict senone posteriors for the task of spoken language recognition (SLR).

Publications, Speech & natural language publications
September 1, 2015

Speech-based assessment of PTSD in a military population using diverse feature classes

ByBruce Knoth, Dimitra Vergyri, Mitchell McLaren

We analyzed recordings of the Clinician-Administered PTSD Scale (CAPS) interview from military personnel diagnosed as PTSD positive versus negative.

Publications, Robotics, sensors, & devices publications, Speech & natural language publications