Author: SRI International
-
Hybrid Planning: An Approach to Integrating Generative and Case-Based Planning
This paper describes ongoing research on the development of a hybrid planning system that integrates case-based reasoning (CBR) methods into SIPE-2, a generative planning system.
-
Connecting With The Twenty-First Century: Technology In Support Of Educational Reform
We present an alternative image of what schools might be like and a set of interlocking social, pedagogical and technological changes that could transform the educational enterprise.
-
Automatic Detection of Discourse Structure for Speech Recognition and Understanding
We describe a new approach for statistical modeling and detection of discourse structure for natural conversational speech. Our model is based on 42 `Dialog Acts’ (DAs), (question, answer, backchannel, agreement, disagreement, apology, etc).
-
Using Information Extraction to Improve Information Retrieval
The authors describe an approach to applying a particular kind of Natural Language Processing NLP system to the TREC routing task in Information Retrieval IR.
-
Structure and Performance of a Dependency Language Model
We present a maximum entropy language model that incorporates both syntax and semantics via a dependency grammar.
-
Diagrammatic Methods for Deriving and Relating Temporal Neural Network Algorithms
We present an alternative approach based on a set of simple block diagram manipulation rules. The approach provides a common framework to derive popular algorithms including backpropagation and backpropagation-through-time, without a single chain rule expansion.
-
A Prosody-Only Decision-Tree Model for Disfluency Detection
We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn’t rely on lexical information, it is widely applicable even when word recognition is unreliable.
-
Speech: A Privileged Modality
In this article, we use our interaction model to demonstrate that during multimodal fusion, speech should be a privileged modality, driving the interpretation of a query, and that in certain cases, speech has even more power to override and modify the combination of other modalities than previously believed.
-
Acoustic Clustering and Adaptation for Robust Speech Recognition
We describe an algorithm based on acoustic clustering and acoustic adaptation to significantly improve speech recognition performance. The method is particularly useful when speech from multiple speakers is to be recognized and the boundary between speakers is not known.
-
Modeling Linguistic Segment and Turn Boundaries for N-best Rescoring of Spontaneous Speech
We present an N-best rescoring algorithm that removes the effect of segmentation mismatch. Furthermore, we show that explicit language modeling of hidden linguistic segment boundaries is improved by including turn-boundary events in the model.
-
Explicit Word Error Minimization in N-best List Rescoring
We show that the standard hypothesis scoring paradigm used in maximum-likelihood-based speech recognition systems is not optimal with regard to minimizing the word error rate, the commonly used performance metric in speech recognition.
-
A Study of Multilingual Speech Recognition
This paper describes our work in developing multilingual (Swedish and English) speech recognition systems in the ATIS domain. The acoustic component of the multilingual systems is realized through sharing Gaussian codebooks across Swedish and English allophones.