Speech & natural language publications

September 1, 2002

SRILM – An Extensible Language Modeling Toolkit

SRILM is a collection of C libraries, executable programs, and helper scripts designed to allow both production of and experimentation with statistical language models for speech recognition and other applications.

Publications, Speech & natural language publications
May 1, 2002

Using Prosodic and Lexical Information for Speaker Identification

We investigate the incorporation of larger time-scale information, such as prosody, into standard speaker ID systems. Our study is based on the Extended Data Task of the NIST 2001 Speaker…

Publications, Speech & natural language publications
March 1, 2002

DynaSpeak: SRI’s Scalable Speech Recognizer for Embedded and Mobile Systems

ByHoracio Franco, Victor Abrash

We introduce SRI's new speech recognition engine, DynaSpeak(TM), which is characterized by its scalability and flexibility, high recognition accuracy, memory and speed efficiency, adaptation capability, efficient grammar optimization, support for…

Publications, Speech & natural language publications
January 1, 2002

Prosody Modeling for Automatic Speech Recognition and Understanding

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech.

Publications, Speech & natural language publications
January 1, 2002

Improved Modeling and Efficiency for Automatic Transcription of Broadcast News

In this paper, we report on our research and progress on the DARPA-sponsored Hub-4 continuous speech recognition evaluations, with an emphasis on efficient modeling.

Publications, Speech & natural language publications
January 1, 2002

Building an ASR System for Noisy Environments: SRI’s 2001 SPINE Evaluation System

ByDimitra Vergyri

We describe SRI's recognition system as used in the 2001 DARPA Speech in Noisy Environments (SPINE) evaluation. The SPINE task involves recognition of speech in simulated military environments.

Publications, Speech & natural language publications
December 1, 2001

Multispeaker Speech Activity Detection for the ICSI Meeting Recorder

We have developed a more sophisticated approach for multichannel speech activity detection using a simple hidden Markov model (HMM).

Publications, Speech & natural language publications
October 1, 2001

Prosody Modeling for Automatic Speech Understanding: An Overview of Recent Research at SRI

In this paper, we summarize recent work at SRI International in the area of computational prosody modeling, and results from several recognition tasks where prosodic knowledge proved to be of…

Publications, Speech & natural language publications
October 1, 2001

Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies,and Overlapping Speech

We investigate whether probabilistic modeling of prosody can aid various automatic labeling tasks essential for processing of multi-party meetings.

Publications, Speech & natural language publications, STEM and computer science education publications
October 1, 2001

Modeling Word Durations

We describe a new method of modeling duration at word level. These duration models are easily trained from the acoustic training data and can be used to rescore N-best lists…

Publications, Speech & natural language publications
September 1, 2001

Improved Maximum Mutual Information Estimation Training of Continuous Density HMMs

ByHoracio Franco

We derive a new set of equations for MMIE based on a quasi-Newton algorithm, without relying on EBW. We find that by adopting a generalized form of the MMIE criterion,…

Publications, Speech & natural language publications
September 1, 2001

Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation

We examine the distribution of overlapping speech in different corpora of natural multi-party conversations, including two types of meetings, and two corpora of telephone conversations.

Publications, Speech & natural language publications