Speech & natural language publications
-
A Toolkit for the Design of Ambisonic Decoders
We discuss the issues involved and describe a set of tools for generating optimized decoder solutions for irregular loudspeaker arrays and demonstrate those tools with practical examples.
-
Promoting robustness for speaker modeling in the community: the PRISM evaluation set
We introduce a new database for evaluation of speaker recognition systems.
-
Robust speech recognition using articulatory gestures in a dynamic bayesian network framework
We present a Dynamic Bayesian Network based speech recognition architecture that models the articulatory gestures as hidden variables and uses them for speech recognition.
-
SRILM at sixteen: Update and outlook
We review developments in the SRI Language Modeling Toolkit (SRILM) since 2002, when a previous paper on SRILM was published.
-
SRILM at 16: Update and Outlook
We review developments in the SRI Language Modeling Toolkit (SRILM) since 2002, when a previous paper on SRILM was published.
-
Identifying Agreement and Disagreement in Conversational Speech: A Cross-lingual Study
This paper presents models for detecting agreement/disagreement between speakers in English and Arabic broadcast conversation shows.
-
Using Prosodic and Spectral Features in Detecting Depression in Elderly Males
In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not.
-
Constrained cepstral speaker recognition using matched UBM and JFA training
We study constrained speaker recognition systems, or systems that model standard cepstral features that fall within particular types of speech regions.
-
Automatic detection of speaker attributes based on utterance text
We present models for detecting various attributes of a speaker based on uttered text alone.
-
Factor analysis back ends for MLLR transforms in speaker recognition
The purpose of this work is to show how recent developments in cepstral-based systems for speaker recognition can be leveraged for the use of Maximum Likelihood Linear Regression (MLLR) transforms.
-
Effective Arabic dialect classification using diverse phonotactic models
We study the effectiveness of recently developed language recognition techniques based on speech recognition models for the discrimination of Arabic dialects.
-
iVector fusion of prosodic and cepstral features for speaker verification
In this paper we apply the promising iVector extraction technique followed by PLDA modeling to simple prosodic contour features.