Author: Dimitra Vergyri
-
Calibration and Multiple System Fusion for Spoken Term Detection Using Linear Logistic Regression
This study presents an efficient and effective score calibration technique for keyword detection that is based on the logistic regression calibration approach commonly used in forensic speaker identification.
-
Medium-Duration Modulation Cepstral Feature for Robust Speech Recognition
In this paper, we present the Modulation of Medium Duration Speech Amplitude feature, which is a composite feature capturing subband speech modulations and a summary modulation.
-
Feature Fusion for High-Accuracy Keyword Spotting
This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spotting—KWS) under heavily degraded channel and noise corrupted conditions.
-
Strategies for high accuracy keyword detection in noisy channels
We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.
-
Discriminatively trained phoneme confusion model for keyword spotting
This work proposes the use of discriminative training to construct a phoneme confusion model, which expands the phonemic index of a KWS system by adding phonemic variation to handle the abovementioned problems.
-
Using Prosodic and Spectral Features in Detecting Depression in Elderly Males
In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not.
-
Effective Arabic dialect classification using diverse phonotactic models
We study the effectiveness of recently developed language recognition techniques based on speech recognition models for the discrimination of Arabic dialects.
-
Acoustic data sharing for Afghan and Persian languages
In this work, we compare several known approaches for multilingual acoustic modeling for three languages, Dari, Farsi and Pashto, which are of recent geo-political interest.
-
Implementing SRI’s Pashto speech-to-speech translation system on a smartphone
We describe our recent effort implementing SRI’s UMPC-based Pashto speech-to-speech (S2S) translation system on a smart phone running the Android operating system.
-
Automatic Speech Recognition of Multiple Accented English Data
We investigate the effect of multiple accents on an English broadcast news recognition system.
-
Speech-Based Automated Cognitive Status Assessment
The aim in this paper is to study the usability of automated methods for evaluating verbal cognitive status assessment tests for the elderly.
-
Anchored Speech Recognition for Question Answering
In this paper, we propose a novel question answering system that searches for responses from spoken documents such as broadcast news stories and conversations.