Speech & natural language publications

October 1, 2004

On Using MLP Features in LVCSR

One of the major research thrusts in the speech group at ICSI is to use Multi-Layer Perceptron (MLP) based features in automatic speech recognition (ASR). This paper presents a study…

Publications, Speech & natural language publications
August 1, 2004

Automatic Diacritization of Arabic for Acoustic Modeling in Speech Recognition

ByDimitra Vergyri

In this paper we investigate different procedures that enable us to use training data by automatically inserting the missing diacritics into the transcription.

Publications, Speech & natural language publications
July 1, 2004

Managing uncertainty in dialogue information state for real time understanding of multi-human meeting dialogue

ByJohn Niekrasz

Our ultimate aim is to model human-human dialogue (to the extent that it is feasible) in real-time, providing useful services (e.g. relevant document retrieval) and answering queries about the dialogue…

Publications, Speech & natural language publications
July 1, 2004

Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech

We compare and contrast two different models for detecting sentence-like units in continuous speech. Both models combine lexical, syntactic, and prosodic information.

Publications, Speech & natural language publications
July 1, 2004

Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies

We describe a statistical approach for modeling agreements and disagreements in conversational interaction.

Publications, Speech & natural language publications
June 1, 2004

Modeling NERFs for Speaker Recognition

We introduce a new type of feature to capture long-range patterns associated with individual speakers or with speaking styles. NERFs, or Nonuniform Extraction Region Features, are defined based on regions…

Publications, Speech & natural language publications
May 1, 2004

Voicing Feature Integration in SRI’s Decipher LVCSR System

ByMartin Graciarena, Horacio Franco, Dimitra Vergyri

We augment the Mel cepstral (MFCC) feature representation with voicing features from an independent front end.

Publications, Speech & natural language publications
May 1, 2004

Application of the Modified Group Delay Function to Speaker Identification and Discrimination

In this paper, we explore new methods by which speakers can be identified and discriminated, using features derived from the fourier transform phase. A Gaussian mixture model (GMM) based speaker…

Publications, Speech & natural language publications
May 1, 2004

Cross-dialectal Acoustic Data Sharing for Arabic Speech Recognition

ByDimitra Vergyri

In this paper we describe the use of acoustic data from Modern Standard Arabic (MSA) to improve the recognition of Egyptian Conversational Arabic (ECA).

Publications, Speech & natural language publications
May 1, 2004

TRAPping Conversational Speech: Extending TRAP/Tandem Approaches to Conversational Telephone Speech Recognition

In this paper we report experiments with a reduced conversational speech task that led to the adoption of a number of engineering decisions for the design of an acoustic front…

Publications, Speech & natural language publications
May 1, 2004

The Use of a Linguistically Motivated Language Model in Conversational Speech Recognition

In this paper we show that such a model can be used effectively and efficiently in all stages of a complex, multi-pass conversational telephone speech recognition system.

Publications, Speech & natural language publications
April 1, 2004

Improving Automatic Sentence Boundary Detection with Confusion Networks

We extend existing methods for automatic sentence boundary detection by leveraging multiple recognizer hypotheses in order to provide robustness to speech recognition errors.

Publications, Speech & natural language publications