Speech & natural language publications
-
SRIUBC: simple similarity features for semantic textual similarity
We describe the systems submitted by SRI International and the University of the Basque Country for the Semantic Textual Similarity SemEval-2012 task.
-
Robust speech recognition using articulatory gestures in a dynamic bayesian network framework
We present a Dynamic Bayesian Network based speech recognition architecture that models the articulatory gestures as hidden variables and uses them for speech recognition.
-
SRILM at sixteen: Update and outlook
We review developments in the SRI Language Modeling Toolkit (SRILM) since 2002, when a previous paper on SRILM was published.
-
Promoting robustness for speaker modeling in the community: the PRISM evaluation set
We introduce a new database for evaluation of speaker recognition systems.
-
SRILM at 16: Update and Outlook
We review developments in the SRI Language Modeling Toolkit (SRILM) since 2002, when a previous paper on SRILM was published.
-
Identifying Agreement and Disagreement in Conversational Speech: A Cross-lingual Study
This paper presents models for detecting agreement/disagreement between speakers in English and Arabic broadcast conversation shows.
-
Using Prosodic and Spectral Features in Detecting Depression in Elderly Males
In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not.
-
Constrained cepstral speaker recognition using matched UBM and JFA training
We study constrained speaker recognition systems, or systems that model standard cepstral features that fall within particular types of speech regions.
-
Automatic detection of speaker attributes based on utterance text
We present models for detecting various attributes of a speaker based on uttered text alone.
-
Factor analysis back ends for MLLR transforms in speaker recognition
The purpose of this work is to show how recent developments in cepstral-based systems for speaker recognition can be leveraged for the use of Maximum Likelihood Linear Regression (MLLR) transforms.
-
Effective Arabic dialect classification using diverse phonotactic models
We study the effectiveness of recently developed language recognition techniques based on speech recognition models for the discrimination of Arabic dialects.
-
iVector fusion of prosodic and cepstral features for speaker verification
In this paper we apply the promising iVector extraction technique followed by PLDA modeling to simple prosodic contour features.