Publications
-
Unscented Transform for iVector-Based Noisy Speaker Recognition
In this paper, it is proposed to substitute the first order VTS by an unscented transform, where unlike VTS, the nonlinear function is not applied over the clean model parameters…
-
Effective Use of DCTS for Contextualizing Features for Speaker Recognition
This article proposes a new approach for contextualizing features for speaker recognition through the discrete cosine transform (DCT).
-
Effects of Thermal Treatment on Radiative Properties of HVPE Grown InP Layers
We have studied radiative properties of 21 micron thick InP layers grown by HVPE and found them comparable to those of best luminescent bulk InP virgin wafers. This opens up…
-
Forensic Prescreening System Using Coded Aperture Snapshot Spectral Imager
We present a camera system for instantaneous, non-destructive capture of spectral signatures for forensic analysis. Our system detects highly probative samples in the forensic scene mixed by the multiple target…
-
Late Fusion and Calibration for Multimedia Event Detection Using Few Examples
In this paper, we present two parametric approaches to late fusion: a normalization scheme for arithmetic mean fusion (logistic averaging) and a fusion scheme based on logistic regression, and compare…
-
Calibration and Multiple System Fusion for Spoken Term Detection Using Linear Logistic Regression
This study presents an efficient and effective score calibration technique for keyword detection that is based on the logistic regression calibration approach commonly used in forensic speaker identification.
-
Studies of a Prototype Linear Stationary X-Ray Source for Tomosynthesis Imaging
A prototype linear x-ray source to implement stationary source–stationary detector tomosynthesis (TS) imaging has been studied.
-
Tripod Fall: Concept and Experiments of a Novel Approach to Humanoid Robot Fall Damage Reduction
This paper addresses a new control strategy to reduce the damage to a humanoid robot during a fall. Instead of following the traditional approach of finding a favorable configuration with…
-
Global Ethics and Virtual Worlds: Ensuring Functional Integrity in Transnational Research Studies
This paper examines a number of issues in this research context, with particular stress on the challenges posed by transnational experimental projects in virtual worlds and social networks.
-
Highly Accurate Phonetic Segmentation Using Boundary Correction Models and System Fusion
Accurate phone-level segmentation of speech remains an important task for many subfields of speech research. We investigate techniques for boosting the accuracy of automatic phonetic segmentation based on HMM acoustic-phonetic…
-
Adaptive and Discriminative Modeling for Improved Mispronunciation Detection
In the context of computer-aided language learning, automatic detection of specific phone mispronunciations by nonnative speakers can be used to provide detailed feedback about specific pronunciation problems.
-
Simplified VTS-Based I-Vector Extraction in Noise-Robust Speaker Recognition
In this work, we propose an efficient simplification scheme, named sVTS, in order to show that the VTS approach gives improvements in large scale applications compared to state-of-the-art systems.