Multi-modal data analytics publications
-
Semantic Pooling for Complex Event Detection
We propose a semantic pooling approach to tackle this issue. Unlike the conventional pooling over the entire video or specific spatial regions of a video, we employ a discriminative approach…
-
Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions
In this paper, we propose a novel approach to extract primary object segments in videos in the ‘object proposal’ domain. The extracted primary object regions are then used to build…
-
Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions
In this paper, we propose a novel approach to extract primary object segments in videos in the ‘object proposal’ domain. The extracted primary object regions are then used to build…
-
3D Visual Proxemics: Recognizing Human Interactions in 3D from a Single Image
We present a unified framework for detecting and classifying people interactions in unconstrained user generated images.
-
On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video
This paper explores how unsupervised audio segmentation systems like speaker diarization can be adapted to automatically identify low-level sound concepts similar to annotator defined concepts and how these concepts can…
-
Domain Adaptive Object Detection
We study the use of domain adaptation and transfer learning techniques as part of a framework for adaptive object detection.
-
Multimedia Event Recounting with Concept Based Representation
We conduct a pilot study of the multimedia event recounting problem, which answers the question why this video is recognized as this event, i.e. what evidences this decision is made…
-
Multi-Modal Pedestrian Detection on the Move
This paper presents an on-the-move pedestrian detection system that utilizes multiple sensor modalities to improve detection rates at deployable computational loads.
-
Geo-Localization of Street Views with Aerial Image Databases
We study the feasibility of solving the challenging problem of geolocalizing ground level images in urban areas with respect to a database of images captured from the air such as…
-
Skeleton Search: Category-Specific Object Recognition and Segmentation Using a Skeletal Shape Model
We describe a top-down object detection and segmentation approach that uses a skeleton-based shape model and that works directly on real images.
-
Vehicle Tracking Across Nonoverlapping Cameras Using Joint Kinematic and Appearance Features
We describe a vehicle tracking algorithm using input from a network of nonoverlapping cameras.
-
Capturing Culture and Effects Variables Using Structured Argumentation
We describe the intended use of S-CAT on an illustrative use case, and discuss our use of structured argumentation as a representation technique to capture both culture variables and effects…