Speech & natural language publications

April 1, 2018

Crowdsourcing Emotional Speech

ByJennifer Smith

We describe the methodology for the collection and annotation of a large corpus of emotional speech data through crowdsourcing.

Publications, School and district reform publications, Speech & natural language publications
December 1, 2017

Language Diarization for Semi-supervised Bilingual Acoustic Model Training

ByMitchell McLaren

In this paper, we investigate several automatic transcription schemes for using raw bilingual broadcast news data in semi-supervised bilingual acoustic model training.

Publications, Speech & natural language publications
December 1, 2017

Tackling Unseen Acoustic Conditions in Query-by-Example Search Using Time and Frequency Convolution for Multilingual Deep Bottleneck Features

ByHoracio Franco, Dimitra Vergyri

This paper revisits two neural network architectures developed for noise and channel robust ASR, and applies them to building a state-of-art multilingual QbE system.

Publications, Speech & natural language publications
December 1, 2017

Noise-robust Exemplar Matching for Rescoring Query-by-Example Search

ByHoracio Franco

This paper describes a two-step approach for keyword spotting task in which a query-by-example search is followed by noise robust exemplar matching rescoring.

Publications, Speech & natural language publications
October 1, 2017

Analysis of Phonetic Markedness and Gestural Effort Measures for Acoustic Speech-Based Depression Classification

ByAaron Lawson

In this paper we analyze articulatory measures to gain further insight into how articulation is affected by depression.

Publications, Speech & natural language publications
August 1, 2017

Calibration Approaches for Language Detection

ByMitchell McLaren, Aaron Lawson

In this paper, we focus on situations in which either (1) the system-modeled languages are not observed during use or (2) the test data contains OOS languages that are unseen…

Publications, Speech & natural language publications
August 1, 2017

Leveraging Deep Neural Network Activation Entropy to Cope with Unseen Data in Speech Recognition

ByHoracio Franco

This work aims to estimate the propagation of such distortion in the form of network activation entropy, which is measured over a short-time running window on the activation from each…

Publications, Speech & natural language publications
August 1, 2017

Improving Robustness of Speaker Recognition to New Conditions Using Unlabeled Data

ByAaron Lawson, Mitchell McLaren

We benchmark these approaches on several distinctly different databases, after we describe our SRICON-UAM team system submission for the NIST 2016 SRE.

Publications, Speech & natural language publications
August 1, 2017

Inferring Stance from Prosody

Speech conveys many things beyond content, including aspects of stance and attitude that have not been much studied.

Publications, Speech & natural language publications
May 1, 2017

Hybrid Convolutional Neural Networks for Articulatory and Acoustic Information Based Speech Recognition

This work explores using deep neural networks (DNNs) and convolutional neural networks (CNNs) for mapping speech data into its corresponding articulatory space. Our speech-inversion results indicate that the CNN models…

Publications, Speech & natural language publications
March 1, 2017

Joint modeling of articulatory and acoustic spaces for continuous speech recognition tasks

ByDimitra Vergyri, Horacio Franco

This paper investigates using deep neural networks (DNN) and convolutional neural networks (CNNs) for mapping speech data into its corresponding articulatory space.

Publications, Speech & natural language publications
March 1, 2017

Speech recognition in unseen and noisy channel conditions

ByHoracio Franco, Martin Graciarena, Dimitra Vergyri

This work investigates robust features, feature-space maximum likelihood linear regression (fMLLR) transform, and deep convolutional nets to address the problem of unseen channel and noise conditions in speech recognition.

Publications, Speech & natural language publications