Dimitra Vergyri

May 1, 2014

Feature Fusion for High-Accuracy Keyword Spotting

This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spotting—KWS) under heavily degraded channel and noise corrupted conditions.

May 1, 2014

Calibration and Multiple System Fusion for Spoken Term Detection Using Linear Logistic Regression

This study presents an efficient and effective score calibration technique for keyword detection that is based on the logistic regression calibration approach commonly used in forensic speaker identification.

May 1, 2014

Medium-Duration Modulation Cepstral Feature for Robust Speech Recognition

In this paper, we present the Modulation of Medium Duration Speech Amplitude feature, which is a composite feature capturing subband speech modulations and a summary modulation.

August 1, 2013

Strategies for high accuracy keyword detection in noisy channels

We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.

September 1, 2012

Discriminatively trained phoneme confusion model for keyword spotting

This work proposes the use of discriminative training to construct a phoneme confusion model, which expands the phonemic index of a KWS system by adding phonemic variation to handle the abovementioned problems.

August 1, 2011

Using Prosodic and Spectral Features in Detecting Depression in Elderly Males

In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not.

August 1, 2011

Effective Arabic dialect classification using diverse phonotactic models

We study the effectiveness of recently developed language recognition techniques based on speech recognition models for the discrimination of Arabic dialects.

May 1, 2011

Acoustic data sharing for Afghan and Persian languages

In this work, we compare several known approaches for multilingual acoustic modeling for three languages, Dari, Farsi and Pashto, which are of recent geo-political interest.

December 1, 2010

Implementing SRI’s Pashto speech-to-speech translation system on a smartphone

We describe our recent effort implementing SRI’s UMPC-based Pashto speech-to-speech (S2S) translation system on a smart phone running the Android operating system.

January 1, 2010

Speech-Based Automated Cognitive Status Assessment

The aim in this paper is to study the usability of automated methods for evaluating verbal cognitive status assessment tests for the elderly.

January 1, 2010

Automatic Speech Recognition of Multiple Accented English Data

We investigate the effect of multiple accents on an English broadcast news recognition system.

June 1, 2009

Anchored Speech Recognition for Question Answering

In this paper, we propose a novel question answering system that searches for responses from spoken documents such as broadcast news stories and conversations.

Author: Dimitra Vergyri