Speech & natural language publications

September 1, 2005

Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?

We ask if active learning with lexical cues can help for this task and this domain. To better address this question, we explore active learning for two different types of…

Publications, Speech & natural language publications
September 1, 2005

Using MLP Features in SRI’s Conversational Speech Recognition System

We describe the development of a speech recognition system for conversational telephone speech (CTS) that incorporates acoustic features estimated by multilayer perceptrons (MLP). The acoustic features are based on frame-level…

Publications, Speech & natural language publications
September 1, 2005

Class-dependent Score Combination for Speaker Recognition

In this work, we are presenting a class-based score combination technique that relies on clustering of both the target models and the test utterances in a vector space defined by…

Publications, Speech & natural language publications
September 1, 2005

Development of a Conversational Telephone Speech Recognizer for Levantine Arabic

ByDimitra Vergyri

In this paper, we describe the development of a large-vocabulary speech recognition system for Levantine Arabic, which was a new dialectal recognition task for our existing system. We discuss the…

Publications, Speech & natural language publications
September 1, 2005

Leveraging Speaker-dependent Variation of Adaptation

This work introduces an automatic procedure for determining the size of regression class trees for individual speakers using an ensemble of speaker-level features to control the number of transformations, if…

Publications, Speech & natural language publications
September 1, 2005

Spoken Language Understanding

SLU systems contain an automatic speech recognition (ASR) component and must be robust to noise due to the spontaneous nature of spoken language and the errors introduced by ASR. SLU…

Publications, Speech & natural language publications
September 1, 2005

Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech

We report on results of two experiments designed to compare subjects’ ability to extract information from audio recordings of conversational telephone speech (CTS) with their ability to extract information from…

Publications, Speech & natural language publications
September 1, 2005

Improved Discriminative Training Using Phone Lattices

We present an efficient discriminative training procedure utilizing phone lattices. Different approaches to expediting lattice generation, statistics collection, and convergence were studied.

Publications, Speech & natural language publications
July 1, 2005

Collaborative and argumentative models of natural discussions

ByJohn Niekrasz

We report in this paper experiences and insights resulting from the first two years of work in two similar projects on meeting tracking and understanding. The projects are the DARPA-funded…

Publications, Speech & natural language publications
June 1, 2005

Using Conditional Random Fields for Sentence Boundary Detection in Speech

In this paper, we evaluate the use of a conditional random field (CRF) for this task and relate results with this model to our prior work. We evaluate across two…

Publications, Speech & natural language publications
April 1, 2005

Ontology-based multi-party meeting understanding

ByJohn Niekrasz

This paper describes current and planned research efforts towards developing multimodal discourse understanding for an automated personal office assistant.

Publications, Speech & natural language publications
March 1, 2005

Improved Phonetic Speaker Recognition Using Lattice Decoding

In this paper, we present results on the Switchboard-2 corpus, where we compare 1-best phone decodings versus lattice phone decodings for the purposes of performing phonetic speaker recognition.

Publications, Speech & natural language publications