Speech & natural language publications

November 1, 2014

The SRI AVEC-2014 Evaluation System

ByMartin Graciarena, Dimitra Vergyri, Colleen Richey, Andreas Kathol

We explore a diverse set of features based only on spoken audio to understand which features correlate with self-reported depression scores according to the Beck depression rating scale.

Cyber & formal methods publications, Publications, Speech & natural language publications
September 1, 2014

Spoken Language Recognition Based on Senone Posteriors

ByMitchell McLaren

This paper explores in depth a recently proposed approach to spoken language recognition based on the estimated posteriors for a set of senones representing the phonetic space of one or…

Publications, Speech & natural language publications
September 1, 2014

Content Matching for Short Duration Speaker Recognition

We show how content matching can be effectively done at the statistics level to enable the use of standard veriﬁcation backends. While no signiﬁcant improvements were observed for the general…

Publications, Speech & natural language publications
September 1, 2014

A Deep Neural Network Speaker Veriﬁcation System Targeting Microphone Speech

ByMitchell McLaren

We recently proposed the use of deep neural networks (DNN) in place of Gaussian Mixture models (GMM) in the i-vector extraction process for speaker recognition.

Publications, Speech & natural language publications
September 1, 2014

Application of Convolutional Neural Networks to Speaker Recognition in Noisy Conditions

ByMitchell McLaren

This paper applies a convolutional neural network (CNN) trained for automatic speech recognition (ASR) to the task of speaker identification (SID).

Publications, Speech & natural language publications
September 1, 2014

Evaluating Robust Features on Deep Neural Networks for Speech Recognition in Noisy and Channel Mismatched Conditions

ByMartin Graciarena, Horacio Franco

In this work we present a study exploring both conventional DNNs and deep Convolutional Neural Networks (CNN) for noise- and channel-degraded speech recognition tasks using the Aurora4 dataset.

Publications, Speech & natural language publications
September 1, 2014

Recent Improvements in SRI’s Keyword Detection System for Noisy Audio

ByDimitra Vergyri, Horacio Franco, Martin Graciarena

We present improvements to a keyword spotting (KWS) system that operates in highly adverse channel conditions with very low signal-to-noise ratio levels.

Publications, Speech & natural language publications
July 1, 2014

Identifying User Demographic Traits through Virtual-World Language Use

ByAaron Lawson

The paper presents approaches for identifying real-world demographic attributes based on language use in the virtual world.

Publications, Speech & natural language publications
June 1, 2014

Articulatory Features from Deep Neural Networks and Their Role in Speech Recognition

This paper presents a deep neural network (DNN) to extract articulatory information from the speech signal and explores different ways to use such information in a continuous speech recognition task.

Publications, Speech & natural language publications
June 1, 2014

Trial-Based Calibration for Speaker Recognition in Unseen Conditions

ByAaron Lawson, Mitchell McLaren

This work presents Trial-Based Calibration (TBC), a novel, automated calibration technique robust to both unseen and widely varying conditions.

Publications, Speech & natural language publications
June 1, 2014

Application of Convolutional Neural Networks to Language Identification in Noisy Conditions

ByAaron Lawson, Mitchell McLaren

This paper proposes two novel frontends for robust language identification (LID) using a convolutional neural network (CNN) trained for automatic speech recognition (ASR).

Publications, Speech & natural language publications
May 1, 2014

Robust Features and System Fusion for Reverberation-robust Speech Recognition

ByAndreas Kathol

In this work, we present robust acoustic features motivated by the knowledge gained from human speech perception and production, and demonstrate that these features provide reasonable robustness to reverberation effects…

Publications, Speech & natural language publications