Speech & natural language publications

October 1, 1996

Automatic Linguistic Segmentation of Conversational Speech

We present a simple automatic segmenter of transcripts based on N-gram language modeling. We also study the relevance of several word-level features for segmentation performance. Using only word-level information, we…

Publications, Speech & natural language publications
October 1, 1996

Disfluencies in Switchboard

Disfluencies are prevalent in spontaneous speech, and are relevant to both human speech communication and speech processing by machine. This paper reports selected results on Switchboard and two comparison corpora…

Publications, Speech & natural language publications
October 1, 1996

Word Predictability After Hesitations: A Corpus-based Study

We ask whether lexical hesitations in spontaneous speech tend to precede words that are difficult to predict. We define predictability in terms of both transition probability and entropy, in the…

Publications, Speech & natural language publications
September 1, 1996

Modeling Pitch Range Variation Within and Across Speakers: Predicting F0 Targets when “Speaking Up”

We study F0 variation produced by "speaking up", as part of a larger study of pitch range variation within and across speakers. We provide a function to predict target F0…

Publications, Speech & natural language publications
August 16, 1996

Designing for Cognitive Communication: Epistemic Fidelity or Collaborative Inquiry?

This article examines the generalization of the mental model principle to communication of a system of concepts across worldviews.

Publications, Speech & natural language publications
August 1, 1996

Stressed and Unstressed Pronouns: Complementary Preferences

I present a unified account of interpretation preferences of stressed and unstressed pronouns in discourse. The central intuition is the Complementary Preference Hypothesis that predicts the interpretation preference of a…

Publications, Speech & natural language publications
June 1, 1996

VILTS: The Voice-Interactive Language Training System

ECHOS is a voice interactive language training system being developed to foster improvement in French comprehension and speaking skills, incorporating speech recognition and pronunciation evaluation.

Education & learning publications, Publications, Speech & natural language publications
May 1, 1996

A Maximum-Likelihood Approach to Stochastic Matching for Robust Speech Recognition

We present a maximum-likelihood(ML)stochastic matching approach to decrease the acoustic mismatch between a test utterance and a given set of speech models so as to reduce the recognition performance degradation…

Publications, Speech & natural language publications
May 1, 1996

Acoustic Adaptation Using Non-Linear Transformations of HMM Parameters

Speech recognition performance degrades significantly when there is a mismatch between testing and training conditions. Linear transformation-based maximum-likelihood (ML) techniques have been proposed recently to tackle this problem. In this…

Publications, Speech & natural language publications
January 1, 1996

Statistical Language Modeling for Speech Disfluencies

We introduce a language model that predicts disfluencies probabilistically and uses an edited, fluent context to predict following words. It uses dynamic programming to compute the probability of a word…

Publications, Speech & natural language publications
January 1, 1996

An Experimental Study of Acoustic Adaptation Algorithms

In this paper we focus on transformation-based maximum-likelihood (ML) adaptation.

Publications, Speech & natural language publications
January 1, 1996

A Phone-Dependent Confidence Measure for Utterance Rejection

An acoustic confidence measure for acceptance/rejection of recognition hypotheses for continuous speech utterances is proposed. This measure is useful for rejecting utterances that are out of domain, or contain out-of-vocabulary…

Publications, Speech & natural language publications