I propose to create a semantic multimodal search engine for collections of transcribed and aligned videos using state-of-the-art artificial...
Extraction of Gesture Features
Dhruv Tyagi
The way humans interact with each other occurs in multimodality. We not only articulate words but also, show them. Expressing different concepts such...
Multi-modal Stance Detection on Television News
KaranjotSingh
With the increased American viewership of cable news stations, it has become important to understand the stance of the news stories, relating to...
Red Hen Anonymizer
Saksham Gautam
The Red Hen Anonymizer is a software that uses deep learning and signal processing techniques to anonymize audio-visual data. The proposed project...
Classification of body-keypose trajectories of gesture co-occurring with time expressions using GNNs
SSP
This project aims to improve the previous iteration of the project where the hypothesis of it was the existence of a relation between the body...