Author: Dimitra Vergyri
-
Recent advances in SRI’s IraqComm Iraqi Arabic-English speech-to-speech translation system
We summarize recent progress on SRI’s IraqComm™ IraqiArabic-English two-way speech-to-speech translation system.
-
Development of the SRI/Nightingale Arabic ASR System
We describe the large vocabulary automatic speech recognition system developed for Modern Standard Arabic used for the 2007 GALE evaluation as part of the speech translation system.
-
Open-vocabulary spoken term detection using graphone-based hybrid recognition systems
We address the problem of retrieving out-of-vocabulary (OOV) words/queries from audio archives for spoken term detection (STD) task. In this work, we employ hybrid recognition systems which contain both words and subword units (graphones) to generate hybrid lattice indexes.
-
OOV Detection by Joint Word/Phone Lattice Alignment
We propose a new method for detecting out-of-vocabulary (OOV) words for large vocabulary continuous speech recognition (LVCSR) systems. Our method is based on performing a joint alignment between independently generated word and phone lattices, where the word-lattice is aligned via a recognition lexicon.
-
IraqComm: A Next Generation Translation System
This paper describes the IraqComm translation system that mediates and translates spontaneous conversations between an English speaker and a speaker of colloquial Iraqi Arabic.
-
The SRI/OGI 2006 Spoken Term Detection System
This paper describes the system developed jointly at SRI and OGI for participation in the 2006 NIST Spoken Term Detection (STD) evaluation.
-
Morphology-based Language Modeling for Conversational Arabic Speech Recognition
In this paper, we investigate improvements in Arabic language modeling by developing various morphology-based language models. We present four different approaches to morphology-based language modeling, including a novel technique called factored language models.
-
Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons
In this paper we investigate how portable such features are across domains and languages. We show that even without retraining, English-trained MLP features can provide a significant boost to recognition accuracy in new domains within the same language, as well as in entirely different languages such as Mandarin and Arabic.
-
The Use of Word N-grams and Parts of Speech for Hierarchical Cluster Language Modeling
We present extensions to the work of backoff hierarchical class n-gram language modeling of Zitouni et al. (2003) by studying the efficacy of exploring the use of parts of speech (POS) information in hierarchical word clustering.
-
Speech Translation for Low-Resource Languages: The Case of Pashto
We present a number of challenges and solutions that have arisen in the development of a speech translation system for American English and Pashto, highlighting those specific to a very low resource language.
-
Development of a Conversational Telephone Speech Recognizer for Levantine Arabic
In this paper, we describe the development of a large-vocabulary speech recognition system for Levantine Arabic, which was a new dialectal recognition task for our existing system. We discuss the dialect-specific modeling choices and investigate to what extent techniques previously tested on other languages are portable to the present task.
-
Morphology-Based Language Modeling for Arabic Speech Recognition
In this paper we investigate the use of morphology-based language models at different stages in a speech recognition system for conversational Arabic.