Speech & natural language publications

March 1, 2005

SRI’s 2004 NIST Speaker Recognition Evaluation System

This paper describes our recent efforts in exploring longer-range features and their statistical modeling techniques for speaker recognition. In particular, we describe a system that uses discriminant features from cepstral…

Publications, Speech & natural language publications
March 1, 2005

Structural Metadata Research in the EARS Program

In this paper we provide a brief overview of research on structural metadata extraction in the DARPA EARS rich transcription program. Tasks include detection of sentence boundaries, filler words, and…

Publications, Speech & natural language publications
March 1, 2005

Improved Phonetic Speaker Recognition Using Lattice Decoding

In this paper, we present results on the Switchboard-2 corpus, where we compare 1-best phone decodings versus lattice phone decodings for the purposes of performing phonetic speaker recognition.

Publications, Speech & natural language publications
March 1, 2005

Automatic Dialog Act Segmentation and Classification in Multiparty Meetings

We explore the two related tasks of dialog act (DA) segmentation and DA classification for speech from the ICSI Meeting Corpus. We employ simple lexical and prosodic knowledge sources, and…

Publications, Speech & natural language publications
January 1, 2005

Modeling Prosodic Feature Sequences for Speaker Recognition

We describe a novel approach to modeling idiosyncratic prosodic behavior for automatic speaker recognition.

Publications, Speech & natural language publications
October 1, 2004

The ICSI-SRI-UW Metadata Extraction System

We describe a state-of-the-art system for automatic detection of "metadata" in both broadcast news and spontaneous telephone conversations, developed as part of the DARPA EARS Rich Transcription program.

Publications, Speech & natural language publications
October 1, 2004

Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection

We investigate machine learning techniques for coping with highly skewed class distributions in two spontaneous speech processing tasks. Both tasks, sentence boundary and disfluency detection, provide important structural information for…

Publications, Speech & natural language publications
October 1, 2004

Morphology-Based Language Modeling for Arabic Speech Recognition

ByDimitra Vergyri

In this paper we investigate the use of morphology-based language models at different stages in a speech recognition system for conversational Arabic.

Publications, Speech & natural language publications
October 1, 2004

From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System

We describe the ICSI-SRI-UW team's entry in the Spring 2004 NIST Meeting Recognition Evaluation. The system was derived from SRI's 5xRT Conversational Telephone Speech (CTS) recognizer by adapting CTS acoustic…

Publications, Speech & natural language publications
October 1, 2004

On Using MLP Features in LVCSR

One of the major research thrusts in the speech group at ICSI is to use Multi-Layer Perceptron (MLP) based features in automatic speech recognition (ASR). This paper presents a study…

Publications, Speech & natural language publications
October 1, 2004

Effective Acoustic Modeling for Rate-of-Speech Variation in Large Vocabulary Conversational Speech Recognition

We investigate several variants of speech-rate-dependent acoustic models for large-vocabulary conversational speech recognition, in the framework of combining rate-specific models in decoding to compensate for speech rate variation.

Publications, Speech & natural language publications
October 1, 2004

SVM Modeling of “SNERF-Grams” for Speaker Recognition

We describe a new approach to modeling idiosyncratic prosodic behavior for automatic speaker recognition. The approach computes prosodic features by syllable, and models the syllable-feature sequences using support vector machines…

Publications, Speech & natural language publications