Speech & natural language publications
-
English Access to Structured Data
We present work on using a domain model to guide text interpretation, in the context of a project that aims to interpret English questions as a sequence of queries to…
-
Implementing SRI’s Pashto speech-to-speech translation system on a smartphone
We describe our recent effort implementing SRI's UMPC-based Pashto speech-to-speech (S2S) translation system on a smart phone running the Android operating system.
-
Unbiased discourse segmentation evaluation
We show that the performance measures Pk and Window Diff, commonly used for discourse, topic, and story segmentation evaluation, are biased in favor of segmentations with fewer or adjacent segment…
-
Unsupervised domain adaptation with multiple acoustic models
We investigate the problem of adapting a recognition system with multiple acoustic models to a new domain in unsupervised mode.
-
Detection of Social Roles in Conversations using Dynamic Bayesian Networks
In this paper, we focus on inferring social roles in conversations using information extracted only from the speaking styles of the speakers.
-
A comparative large scale study of MLP features for mandarin ASR
In this paper, all the proposed frontends are compared in systematic manner and we extensively investigate the scalability of these features in terms of the amount of training data (from…
-
Domain adaptation and compensation for emotion detection
Inspired by the recent improvements in domain adaptation and session variability compensation techniques used for speech and speaker processing, we study their effect for emotion prediction.
-
A Corpus Analysis of Patterns of Age-Related Change in Conversational Speech
Conversational speech from over 300 speakers from 17 to 68 years of age was analyzed for age-related changes in the timing and content of spoken language production.
-
Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations
In this paper, we present our research efforts towards multilingual spoken information retrieval with limitations in acoustic training data.
-
The CALO meeting assistant system
This paper presents the CALO-MA architecture and its speech recognition and understanding components.
-
Improving language recognition with multilingual phone recognition and speaker adaptation transforms
We investigate a variety of methods for improving language recognition accuracy based on techniques in speech recognition, and in some cases borrowed from speaker recognition.
-
LDA based similarity modeling for question answering
We investigate Latent Dirichlet Allocation (LDA) models to obtain ranking scores based on a novel similarity measure between a natural language question posed by the user and a candidate passage.