Speech & natural language publications

August 1, 2007

fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains

This paper introduces a new adaptation approach, fMPE-MAP, which is an extension to the original fMPE (feature minimum phone error) algorithm, with the enhanced ability in porting Gaussian models and…

Publications, Speech & natural language publications
August 1, 2007

A Semi-Supervised Learning Approach for Morpheme Segmentation for an Arabic Dialect

We evaluate our approach by applying morpheme segmentation to the training data of a statistical machine translation (SMT) system. Experiments show that our approach is less sensitive to the availability…

Publications, Speech & natural language publications
April 1, 2007

NAP and WCCN: Comparison of Approaches Using MLLR-SVM Speaker Verification System

We compare two recently proposed techniques, within class covariance normalization (WCCN) [1] and nuisance attribute projection (NAP) [2], for intersession variability compensation in speaker verification.

Publications, Speech & natural language publications
April 1, 2007

Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages

We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic.

Publications, Speech & natural language publications
April 1, 2007

Unsupervised Language Model Adaptation for Meeting Recognition

We present an application of unsupervised language model (LM) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on related topics are to be recognized, but no…

Publications, Speech & natural language publications
April 1, 2007

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

This paper uses a state-of-the-art Mandarin recognition system as a platform to study the interaction of three techniques. Experiments in the broadcast news and broadcast conversation domains show that the…

Publications, Speech & natural language publications
April 1, 2007

Noise Robust Speaker Identification for Spontaneous Arabic Speech

We present an approach that integrates multiple components and models for improved speaker identification in spontaneous Arabic speech in adverse acoustic conditions.

Publications, Speech & natural language publications
April 1, 2007

Speech Recognition as Feature Extraction for Speaker Recognition

We present specific techniques and results from SRI’s NIST speaker recognition evaluation system.

Publications, Speech & natural language publications
April 1, 2007

Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition

This paper explores the important question of finding a good kernel for a system that models syllable-based prosodic features using support vector machines (SVMs). We introduce two new methods for…

Publications, Speech & natural language publications
April 1, 2007

Statistical Sentence Extraction for Information Distillation

In this paper, we present a statistical sentence extraction approach for distillation. Basically, we frame this task as a classification problem, where each candidate sentence in documents is classified as…

Publications, Speech & natural language publications
January 1, 2007

The ICSI-SRI Spring 2006 Meeting Recognition System

We describe the development of the ICSI-SRI speech recognition system for the NIST Spring 2006 Meeting Rich Transcription (RT-06S) evaluation, highlighting improvements, including the delay-and-sum algorithm, the nearfield segmenter, language…

Publications, Speech & natural language publications
January 1, 2007

Ambisonic Localisation – PART 2

ByAaron J Heller

Publications, Speech & natural language publications