Publications
-
HMM State Clustering Across Allophone Class Boundaries
We present a novel approach to hidden Markov model (HMM) state clustering based on the use of broad phone classes and an allophone class entropy measure. Our algorithm allows clustering…
-
A Study of Multilingual Speech Recognition
This paper describes our work in developing multilingual (Swedish and English) speech recognition systems in the ATIS domain. The acoustic component of the multilingual systems is realized through sharing Gaussian…
-
A Prosody-Only Decision-Tree Model for Disfluency Detection
We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn't rely on lexical information, it is…
-
Automatic Pronunciation Scoring of Specific Phone Segments for Language Instruction
The aim of the work described in this paper is to develop methods for automatically assessing the pronunciation quality of specific phone segments uttered by students learning a foreign language.
-
Multimodal Interfaces for Internet
In this paper, we present a Java-enabled application with a multimodal (pen and voice) interface over the web. Our implementation approach was to add Java to the set of languages…
-
Using Differential Constraints to Reconstruct Complex Surfaces from Stereo
Stereo reconstruction algorithms often fail to properly deal with complex surfaces, because there is not enough image information. We propose to guide the reconstruction process using a priori information about…
-
Neural-Network Based Measures of Confidence for Word Recognition
This paper proposes a probabilstic framework to define and evaluate confidence measures for word recognition. We describe a novel method to combine different knowledge sources and estimate the confidence in…
-
A Collaborative Environment for Authoring Large Knowledge Bases
Collaborative knowledge base (KB) authoring environments are critical for the construction of high-performance KBs. In this paper, we present an environment that satisfies many of these goals.
-
Handset-Dependent Background Models for Robust Text-Independent Speaker Recognition
This paper studies the effects of handset distortion on telephone-based speaker recognition performance. Results on the 1996 NIST Speaker Recognition Evaluation corpus show that using handset-matched background models reduces false…
-
Automatic Pronunciation Scoring for Language Instruction
In this paper we show that we can significantly improve HMM- based scores by using average phone segment posterior probabilities. Correlation between machine and human scores went up from r=0.50…
-
Model Transformation for Robust Speaker Recognition from Telephone Data
In the context of automatic speaker recognition, we propose a model transformation technique that renders speaker models more robust to acoustic mismatches and to data scarcity by appropriately increasing their…
-
HTTP://WWW.SPEECH.SRI.COM/DEMOS/ATIS.HTML
This paper presents a speech-enabled WWW demonstration based on the Air Travel Information System (ATIS) domain. SRI’s speech recognition technology and natural language understanding are fully integrated in a Java…