Speech & natural language publications
-
Leveraging speaker diarization for meeting recognition from distant microphones
We investigate using state-of-the-art speaker diarization output for speech recognition purposes.
-
Annotating Participant Reference in English Spoken Conversation
We present a method for annotating verbal reference to people in conversational speech, with a focus on reference to conversation participants.
-
Prosodic speaker verification using subspace multinomial models with intersession compensation
We propose a novel approach to modeling prosodic features.
-
Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?
We find that ISV compensation is remarkably successful on a corpus of intrinsic variation that is highly controlled for channel (a dominant component of ISV). The results are particularly surprising…
-
Classification-Based Strategies for Combining Multiple 5-W Question Answering Systems
We describe and analyze inference strategies for combining outputs from multiple question answering systems each of which was developed independently.
-
Within-Session Variability Modelling for Factor Analysis Speaker Verification
This work presents an extended Joint Factor Analysis model including explicit modelling of unwanted within-session variability.
-
Development of the 2008 SRI Mandarin Speech-To-Text System for Broadcast News and Conversation
We describe the recent progress in SRI's Mandarin speech-to-text system developed for 2008 evaluation in the DARPA GALE program. A data-driven lexicon expansion technique and language model adaptation methods contribute…
-
Building a Highly Accurate Mandarin Speech Recognizer with Language-Independent Technologies and Language-Dependent Modules
We describe a system for highly accurate large-vocabulary Mandarin speech recognition. The prevailing hidden Markov model based technologies are essentially language independent and constitute the backbone of our system.
-
Modeling other talkers for improved dialog act recognition in meetings
We propose a new approach that takes into account speech from other talkers, relying only on speech/non-speech information from all participants.
-
Multifactor Adaptation for Mandarin Broadcast News and Conversation Speech Recognition
We explore the integration of multiple factors such as genre and speaker gender for acoustic model adaptation tasks to improve Mandarin ASR system performance on broadcast news and broadcast conversation…
-
Feature-based and channel-based analyses of intrinsic variability in speaker verification
In this paper we explore the use of other speaker verification systems on the telephone channel data and compare against the GMM baseline. We found the GMM system to be…
-
Genre Effects on Automatic Sentence Segmentation of Speech: a Comparison of Broadcast News and Broadcast Conversations
We investigate genre effects on the task of automatic sentence segmentation, focusing on two important domains - broadcast news (BN) and broadcast conversation (BC).