Speech, technology and research lab

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Search sri.com

Communicating with, and through, computer applications

The Speech Technology and Research (STAR) Laboratory brings together a multidisciplinary mix of engineers, computer scientists and linguists. Together, our experts build systems for a wide range of applications including signal processing; data indexing and mining; and computer-aided learning. SRI’s speech and language technologies allow us to interact more naturally with computing applications and provide a wealth of actionable information about our intentions, health, and emotional state.

Core technologies and applications

Noise robustness
Speech production and perception-based features
Keyword spotting
Prosodic modeling and disfluencies

Voice biometrics
Language/accent identification
Speaker and speaker-state characterization
Audio event detection
Speaker diarization

Speech-to-Speech translation
Cross-lingual information retrieval
Machine-mediated cross-lingual communication

Human-computer interaction
Dialog systems and virtual personal assistants (VPAs)
Error detection and recovery
Semantic and syntactic parsing

Multi-lingual information extraction
Topic and event identification
Summarization;
Question answering

Real-world impact

March 26, 2024

SRI’s AI-driven voice analysis could help screen for mental health conditions

Researchers at SRI are developing tools to help clinicians keep a close eye on depression, PTSD, and other mental health issues.
October 16, 2023

SRI is developing textiles that record audio

Turning piezoelectric materials and lithium-ion batteries into thread, innovators will weave fabrics that record sound.
July 5, 2022

Nuance Partners with SCIENTIA Puerto Rico

SRI spin-out Nuance Communications to expand access its Dragon Medical One for the island’s physicians and nurses

Featured researchers

September 8, 2021

Dimitra Vergyri

Director, Speech Technology and Research Laboratory (STAR)
September 8, 2021

Horacio Franco

Chief Scientist, Speech Technology and Research Laboratory
September 8, 2021

Aaron Lawson

Assistant Laboratory Director, Speech Technology and Research Laboratory
September 8, 2021

Martin Graciarena

Technical Manager, Speech Technology and Research Laboratory
September 8, 2021

Mitchell McLaren

Senior Computer Scientist, Speech Technology and Research Laboratory
September 8, 2021

Harry Bratt

Senior Computer Scientist, Speech Technology and Research Laboratory

Platforms

Novel speech processing technology leverages AI algorithms to enable speech activity detection in high levels of noise and distortion.

Learn more

Real-time speaker state platform estimates speaker state—such as emotion, sentiment, cognition, health, mental health and communication quality—in a range of end applications.

Learn more

Small-footprint, high-accuracy engine incorporates patented techniques that increase recognition performance using speaker adaptation, microphone adaptation, end-of- speech detection, distributed speech recognition and noise robustness.

Learn more

Toolkit specifically designed for language-learning applications and other educational and training software. Works for both adult and child voices, it excels at recognizing native and non-native speakers.

Learn more

Toolkit helps build and apply statistical language models for speech recognition, statistical tagging and segmentation, and machine translation. Can be downloaded and used free of charge.

Learn more

Publications

November 18, 2022

Toward Fail-Safe Speaker Recognition: Trial-Based Calibration with a Reject Option

In this work, we extend the TBC method, proposing a new similarity metric for selecting training data that results in significant gains over the one proposed in the original work.
October 1, 2021

Resilient Data Augmentation Approaches to Multimodal Verification in the News Domain

Building on multimodal embedding techniques, we show that data augmentation via two distinct approaches improves results: entity linking and cross-domain local similarity scaling.
July 27, 2021

Natural Language Access: When Reasoning Makes Sense

We argue that to use natural language effectively, we must have both a deep understanding of the subject domain and a general-purpose reasoning capability.

Join Our Team

Build your own legacy

Explore careers

Hire Us

Solutions to your most complex challenges

Send an inquiry

Contact Us

General inquiries

Reach out

Get the latest news from SRI

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

DMCA

Communicating with, and through, computer applications

Core technologies and applications

Speech recognition .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Speech & audio analytics .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Machine translation .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Natural language understanding .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Information extraction .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Real-world impact

Featured researchers

Platforms

Open Language Interface for Voice Exploitation (OLIVE) .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

SenSay .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

DynaSpeak® speech recognition engine .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

EduSpeak® speech recognition toolkit .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

SRI Language Modeling (SRILM) .cls-1, .cls-2 { stroke-width: 0px; } .cls-2 { fill: #231f20; } .cls-1 { stroke-width: 0px; }

Publications