Speech & natural language publications
-
Detecting Deception Using Critical Segments
We present an investigation of segments that map to GLOBAL LIES, that is, the intent to deceive with respect to salient topics of the discourse. We propose that identifying the…
-
Integrating MAP, Marginals, and Unsupervised Language Model Adaptation
We investigate the integration of various language model adaptation approaches for a cross-genre adaptation task to improve Mandarin ASR system performance on a recently introduced new genre, broadcast conversation (BC).
-
Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition
This paper explores the important question of finding a good kernel for a system that models syllable-based prosodic features using support vector machines (SVMs). We introduce two new methods for…
-
Statistical Sentence Extraction for Information Distillation
In this paper, we present a statistical sentence extraction approach for distillation. Basically, we frame this task as a classification problem, where each candidate sentence in documents is classified as…
-
NAP and WCCN: Comparison of Approaches Using MLLR-SVM Speaker Verification System
We compare two recently proposed techniques, within class covariance normalization (WCCN) [1] and nuisance attribute projection (NAP) [2], for intersession variability compensation in speaker verification.
-
Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages
We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic.
-
Unsupervised Language Model Adaptation for Meeting Recognition
We present an application of unsupervised language model (LM) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on related topics are to be recognized, but no…
-
Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition
This paper uses a state-of-the-art Mandarin recognition system as a platform to study the interaction of three techniques. Experiments in the broadcast news and broadcast conversation domains show that the…
-
Noise Robust Speaker Identification for Spontaneous Arabic Speech
We present an approach that integrates multiple components and models for improved speaker identification in spontaneous Arabic speech in adverse acoustic conditions.
-
Speech Recognition as Feature Extraction for Speaker Recognition
We present specific techniques and results from SRI’s NIST speaker recognition evaluation system.
-
Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC.
-
The ICSI-SRI Spring 2006 Meeting Recognition System
We describe the development of the ICSI-SRI speech recognition system for the NIST Spring 2006 Meeting Rich Transcription (RT-06S) evaluation, highlighting improvements, including the delay-and-sum algorithm, the nearfield segmenter, language…