Speech & natural language publications
-
Spoken Language Understanding
SLU systems contain an automatic speech recognition (ASR) component and must be robust to noise due to the spontaneous nature of spoken language and the errors introduced by ASR. SLU…
-
Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech
We report on results of two experiments designed to compare subjects’ ability to extract information from audio recordings of conversational telephone speech (CTS) with their ability to extract information from…
-
Improved Discriminative Training Using Phone Lattices
We present an efficient discriminative training procedure utilizing phone lattices. Different approaches to expediting lattice generation, statistics collection, and convergence were studied.
-
Meeting Structure Annotation: Data and Tools
We present a set of annotations of hierarchical topic segmentations and action item sub-dialogues collected over 65 meetings from the ICSI and ISL meeting corpora, designed to support automatic meeting…
-
MLLR Transforms as Features in Speaker Recognition
We explore the use of adaptation transforms employed in speech recognition systems as features for speaker recognition. This approach is attractive because, unlike standard frame-based cepstral speaker recognition models, it…
-
Speech Translation for Low-Resource Languages: The Case of Pashto
We present a number of challenges and solutions that have arisen in the development of a speech translation system for American English and Pashto, highlighting those specific to a very…
-
Robust Feature Compensation in Nonstationary and Multiple Noise Environments
We extend the POF algorithm to allow a more accurate way to select noisy-to-clean feature mappings, by allowing different combinations of speech and noise to have combination-specific mappings selected depending…
-
Distinguishing Deceptive from Non-Deceptive Speech
We present results from a study seeking to distinguish deceptive from non-deceptive speech using machine learning techniques on features extracted from a large corpus of deceptive and non-deceptive speech. We…
-
Collaborative and argumentative models of natural discussions
We report in this paper experiences and insights resulting from the first two years of work in two similar projects on meeting tracking and understanding. The projects are the DARPA-funded…
-
Using Conditional Random Fields for Sentence Boundary Detection in Speech
In this paper, we evaluate the use of a conditional random field (CRF) for this task and relate results with this model to our prior work. We evaluate across two…
-
Ontology-based multi-party meeting understanding
This paper describes current and planned research efforts towards developing multimodal discourse understanding for an automated personal office assistant.
-
Structural Metadata Research in the EARS Program
In this paper we provide a brief overview of research on structural metadata extraction in the DARPA EARS rich transcription program. Tasks include detection of sentence boundaries, filler words, and…