Speech & natural language publications

January 1, 1998

Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?

This study asks whether current approaches, which use mainly word information, could be improved by adding prosodic information. The study is based on more than 1000 conversations from the Switchboard…

Publications, Speech & natural language publications
January 1, 1998

MVIEWS: Multimodal Tools for the Video Analyst

SRI has developed MVIEWS, a system for annotating, indexing, extracting, and disseminating information from video streams for surveillance and intelligence applications. MVIEWS is implemented within the Open Agent Architecture, a…

Artificial intelligence publications, Publications, Speech & natural language publications
January 1, 1998

Discriminative Training of Minimum Cost Speaker Verification Systems

This paper presents a new training procedure for speaker verification systems. Results are presented from the 1997 NIST Speaker Recognition Evaluation corpus indicating that the VCF performance can be improved…

Publications, Speech & natural language publications
December 1, 1997

Automatic Detection of Discourse Structure for Speech Recognition and Understanding

We describe a new approach for statistical modeling and detection of discourse structure for natural conversational speech. Our model is based on 42 `Dialog Acts' (DAs), (question, answer, backchannel, agreement,…

Publications, Speech & natural language publications
September 1, 1997

A Lognormal Tied Mixture Model of Pitch for Prosody-Based Speaker Recognition

In this work, we develop a statistical model of pitch that allows unbiased estimation of pitch statistics from pitch tracks which are subject to doubling and/or halving.

Publications, Speech & natural language publications
September 1, 1997

Structure and Performance of a Dependency Language Model

We present a maximum entropy language model that incorporates both syntax and semantics via a dependency grammar.

Publications, Speech & natural language publications
September 1, 1997

A Study of Multilingual Speech Recognition

ByHarry Bratt

This paper describes our work in developing multilingual (Swedish and English) speech recognition systems in the ATIS domain. The acoustic component of the multilingual systems is realized through sharing Gaussian…

Publications, Speech & natural language publications
September 1, 1997

Acoustic Clustering and Adaptation for Robust Speech Recognition

We describe an algorithm based on acoustic clustering and acoustic adaptation to significantly improve speech recognition performance. The method is particularly useful when speech from multiple speakers is to be…

Publications, Speech & natural language publications
September 1, 1997

Modeling Linguistic Segment and Turn Boundaries for N-best Rescoring of Spontaneous Speech

We present an N-best rescoring algorithm that removes the effect of segmentation mismatch. Furthermore, we show that explicit language modeling of hidden linguistic segment boundaries is improved by including turn-boundary…

Publications, Speech & natural language publications
September 1, 1997

Speech: A Privileged Modality

In this article, we use our interaction model to demonstrate that during multimodal fusion, speech should be a privileged modality, driving the interpretation of a query, and that in certain…

Publications, Speech & natural language publications
September 1, 1997

HMM State Clustering Across Allophone Class Boundaries

ByHarry Bratt

We present a novel approach to hidden Markov model (HMM) state clustering based on the use of broad phone classes and an allophone class entropy measure. Our algorithm allows clustering…

Publications, Speech & natural language publications
September 1, 1997

A Prosody-Only Decision-Tree Model for Disfluency Detection

We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn't rely on lexical information, it is…

Publications, Speech & natural language publications