Computer vision publications

October 8, 2021

Global Heading Estimation for Wide Area Augmented Reality Using Road Semantics for Geo-referencing

BySupun Samarasekera, Rakesh Kumar

We present a method to estimate global camera heading by associating directional information from road segments in the camera view with annotated satellite imagery.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
August 27, 2021

Long-Range Augmented Reality with Dynamic Occlusion Rendering

BySupun Samarasekera, Han-Pang Chiu, Rakesh Kumar

This paper addresses the problem of fast and accurate dynamic occlusion reasoning by real objects in the scene for large scale outdoor AR applications.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
July 14, 2021

“How to best say it?” : Translating Directives in Machine Language into Natural Language in the Blocks World

We propose a method to generate optimal natural language for block placement directives generated by a machine's planner during human-agent interactions in the blocks world.

Computer vision publications, Machine learning publications, Publications
June 8, 2021

Comprehension Based Question Answering Using Bloom’s Taxonomy

ByAjay Divakaran, Sara Rutherford-Quach

Our experiments focus on zero-shot question answering, using the taxonomy to provide proximal context that helps the model answer questions by being relevant to those questions.

Computer vision publications, Machine learning publications, Publications
May 30, 2021

MaAST: Map Attention with Semantic Transformers for Efficient Visual Navigation

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

Through this work, we design a novel approach that focuses on performing better or comparable to the existing learning-based solutions but under a clear time/computational budget.

Collaborative human robot autonomy publications, Computer vision publications, Publications
April 23, 2021

Towards Explainable Student Group Collaboration Assessment Models Using Temporal Representations of Individual Student Role and Behavioral Cues

ByNonye M. Alozie, Bladimir Lopez-Prado

In this paper we propose using simple temporal-CNN deep-learning models to assess student group collaboration that take in temporal representations of individual student roles as input.

Computer vision publications, Human behavior modeling publications, Publications
April 1, 2021

Hyper-Dimensional Analytics of Video Action at the Tactical Edge

ByMichael A. Isnardi, David Zhang, Michael Piacentino, Gooitzen van der Wal

We review HyDRATE, a low-SWaP reconfigurable neural network architecture developed under the DARPA AIE HyDDENN (Hyper-Dimensional Data Enabled Neural Network) program.

Computational sensing-low-power processing publications, Computer vision publications, Publications
April 1, 2021

Modular Adaptation for Cross-Domain Few-Shot Learning

ByAjay Divakaran, Yi Yao

While literature has demonstrated great successes via representation learning, in this work, we show that improvement of downstream tasks can also be achieved by appropriate designs of the adaptation process.

Computer vision publications, Machine learning publications, Publications
April 1, 2021

Confidence Calibration for Domain Generalization under Covariate Shift

ByYi Yao, Ajay Divakaran, Melinda Gervasio

We present novel calibration solutions via domain generalization. Our core idea is to leverage multiple calibration domains to reduce the effective distribution disparity between the target and calibration domains for…

Computer vision publications, Machine learning publications, Publications
November 19, 2020

Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning

ByYi Yao, Ajay Divakaran

We introduce Hybrid Consistency Training to jointly leverage interpolation consistency, including interpolating hidden features, that imposes linear behavior locally and data augmentation consistency that learns robust embeddings against sample variations.

Computer vision publications, Machine learning publications, Publications
October 12, 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We study an important, yet largely unexplored problem of large-scale cross-modal visual localization by matching ground RGB images to a geo-referenced aerial LIDAR 3D point cloud.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
July 14, 2020

Lifelong learning using Eigentasks: Task separation, skill acquisition, and selective transfer

ByAjay Divakaran, Jesse Hostetler

We introduce the eigentask framework for lifelong learning. An eigentask is a pairing of a skill that solves a set of related tasks, paired with a generative model that can…

Computer vision publications, Machine learning publications, Publications