Speech & natural language publications
-
IraqComm: A Next Generation Translation System
This paper describes the IraqComm translation system that mediates and translates spontaneous conversations between an English speaker and a speaker of colloquial Iraqi Arabic.
-
Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification
We propose a method to improve speaker recognition lexical model performance using acoustic-prosodic information. More specifically, the lexical model is trained using duration- and pronunciation-conditioned word N-grams, simultaneously modeling lexical…
-
Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition
This paper explores the important question of finding a good kernel for a system that models syllable-based prosodic features using support vector machines (SVMs). We introduce two new methods for…
-
Statistical Sentence Extraction for Information Distillation
In this paper, we present a statistical sentence extraction approach for distillation. Basically, we frame this task as a classification problem, where each candidate sentence in documents is classified as…
-
NAP and WCCN: Comparison of Approaches Using MLLR-SVM Speaker Verification System
We compare two recently proposed techniques, within class covariance normalization (WCCN) [1] and nuisance attribute projection (NAP) [2], for intersession variability compensation in speaker verification.
-
Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages
We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic.
-
Unsupervised Language Model Adaptation for Meeting Recognition
We present an application of unsupervised language model (LM) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on related topics are to be recognized, but no…
-
Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition
This paper uses a state-of-the-art Mandarin recognition system as a platform to study the interaction of three techniques. Experiments in the broadcast news and broadcast conversation domains show that the…
-
Noise Robust Speaker Identification for Spontaneous Arabic Speech
We present an approach that integrates multiple components and models for improved speaker identification in spontaneous Arabic speech in adverse acoustic conditions.
-
Speech Recognition as Feature Extraction for Speaker Recognition
We present specific techniques and results from SRI’s NIST speaker recognition evaluation system.
-
The ICSI-SRI Spring 2006 Meeting Recognition System
We describe the development of the ICSI-SRI speech recognition system for the NIST Spring 2006 Meeting Rich Transcription (RT-06S) evaluation, highlighting improvements, including the delay-and-sum algorithm, the nearfield segmenter, language…
-
Ambisonic Localisation – PART 2