Author: SRI International
-
Phonetic Consequences of Speech Disfluency
Analyses of American English show that disfluency affects a variety of phonetic aspects of speech, including segment durations, intonation, voice quality, vowel quality, and coarticulation patterns. These effects provide clues about production processes, and can guide methods for disfluency processing in speech recognition applications.
-
ILU 2.0beta1 reference manual
Reference manual for ILU, an inter-language unification system.
-
Robust Text-Independent Speaker Identification over Telephone Channels
This paper addresses the issue of closed-set text-independent speaker identifcation from samples of speech recorded over the telephone. It focuses on the effects of acoustic mismatches between training and testing data, and concentrates on two approaches: extracting features that are robust against channel variations, and transforming the speaker models to compensate for channel effects.
-
Modeling the Prosody of Hidden Events for Improved Word Recognition
We investigate a new approach for using speech prosody as a knowledge source for speech recognition. The idea is to penalize word hypotheses that are inconsistent with prosodic features such as duration and pitch.
-
Performance Assessment Links In Science (PALS)
SRI International is developing Performance Assessment Links in Science (PALS), an on-line, standards-based, interactive resource bank of science performance assessments.
-
Combining Words and Prosody for Information Extraction from Speech
In this work we demonstrate the use of em prosodic cues, alone and in combination with words, for segmentation and name finding. In experiments, we find that prosodic cues alone allow sentence and topic segmentation that is at least as good as word-based methods alone, and that combining both types of cues gives significant wins.
-
Finding Consensus Among Words: Lattice-based Word Error Minimization
We describe a new algorithm for finding the hypothesis in a recognition lattice that is expected to minimize the word error rate (WER). Our approach thus overcomes the mismatch between the word-based performance metric and the standard MAP scoring paradigm that is sentence-based, and that can lead to sub-optimal recognition results.
-
Characterizing the Performance of Multiple-Image Point-Correspondence Algorithms using Self-Consistency
A new approach to characterizing the performance of point- correspondence algorithms is presented.
-
Task-based Information Management
We are constructing a system, called the Task-based Information Distribution Environment (TIDE), that delivers information to participants in a dynamic collaboration by evaluating the relevance of incoming and newly generated information to collaborators’ current tasks.
-
Construct Validation Of Mathematics Achievement: Evidence From Interview Procedures
This study investigated the validity of measures derived from a large-scale multiple-choice achievement test in mathematics, using evidence from introspective think-aloud protocols of students as they attempted test items.
-
3–D Stereo Reconstruction of Human Faces driven by Differential Constraints
We propose a way to incorporate a priori information in a reconstruction process from a sequence of calibrated face images.
-
How Inner-City Children See Their Family, School, Peers And Neighborhood: Developmental Changes During The Transition To Adolescence