Author: SRI International
-
Combining Feature Sets with Support Vector Machines: Application to Speaker Recognition
In this paper, we describe a general technique for optimizing the relative weights of feature sets in a support vector machine (SVM) and show how it can be applied to the field of speaker recognition. Our training procedure uses an objective function that maps the relative weights of the feature sets directly to a classification…
-
Task Management under Change and Uncertainty: Constraint Solving Experience with the CALO Project
We outline the challenges and opportunities presented by constraint solving in the presence of change and uncertainty, embodied in CALO’s personalized time management and task reasoning and execution systems.
-
Mapping The Distribution Of Expertise And Resources In A School: Investigating The Potential Of Using Social Network Analysis In Evaluation
This paper describes results of a study investigating the potential of using social network analysis to evaluate the capacity of a school to undertake a schoolwide educational reform.
-
VERL: An ontology framework for representing and annotating video events
This article describes the findings of a recent workshop series that has produced an ontology framework for representing video events-called Video Event Representation Language (VERL) -and a companion annotation framework, called Video Event Markup Language (VEML). One of the key concepts in this work is the modeling of events as composable, whereby complex events are…
-
Singapore Tablet PC Program Study: Executive Summary and Final Report, Volume 1, Technical Findings
-
Singapore Tablet PC Program Study: Executive Summary and Final Report, Volume 2, Technical Appendicies
-
Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?
We ask if active learning with lexical cues can help for this task and this domain. To better address this question, we explore active learning for two different types of DA models — hidden Markov models (HMMs) and maximum entropy (maxent).
-
Pushing the Envelope — Aside
Despite successes, there are still significant limitations to speech recognition performance. For this reason, authors have proposed methods that incorporate different (and larger) analysis windows, which are described in this article.
-
Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection
We compare a generative hidden Markov model (HMM)-based approach and two conditional models — a maximum entropy (Maxent) model and a conditional random field (CRF) — for detecting disfluencies in speech. The conditional modeling approaches provide a more principled way to model correlated features.
-
Improved Discriminative Training Using Phone Lattices
We present an efficient discriminative training procedure utilizing phone lattices. Different approaches to expediting lattice generation, statistics collection, and convergence were studied.
-
Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech
We report on results of two experiments designed to compare subjects’ ability to extract information from audio recordings of conversational telephone speech (CTS) with their ability to extract information from text transcripts of these conversations, with and without the ability to hear the audio recordings.
-
Class-dependent Score Combination for Speaker Recognition
In this work, we are presenting a class-based score combination technique that relies on clustering of both the target models and the test utterances in a vector space defined by a set of speaker-specific transformation parameters estimated during transcription of the talker.