Search Results for author: Ravi Kiran Sarvadevabhatla

Found 38 papers, 24 papers with code

Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios

no code implementations • 8 May 2024 • Chirag Parikh, Ravi Shankar Mishra, Rohan Chandra, Ravi Kiran Sarvadevabhatla

Recognizing driving behaviors is important for downstream tasks such as reasoning, planning, and navigation.

Paper
Add Code

IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic

1 code implementation • 12 Apr 2024 • Chirag Parikh, Rohit Saluja, C. V. Jawahar, Ravi Kiran Sarvadevabhatla

Intelligent vehicle systems require a deep understanding of the interplay between road conditions, surrounding entities, and the ego vehicle's driving behavior for safe and efficient navigation.

Object Object Localization

Paper
Code

A Fine-Grained Vehicle Detection (FGVD) Dataset for Unconstrained Roads

1 code implementation • 30 Dec 2022 • Prafful Kumar Khoba, Chirag Parikh, Rohit Saluja, Ravi Kiran Sarvadevabhatla, C. V. Jawahar

Along with providing baseline results for existing object detectors on FGVD Dataset, we also present the results of a combination of an existing detector and the recent Hierarchical Residual Network (HRN) classifier for the FGVD task.

Paper
Code

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation

no code implementations • 28 Nov 2022 • Sai Shashank Kalakonda, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla

We show that utilizing these detailed descriptions instead of the original action phrases leads to better alignment of text and motion spaces.

Action Generation

Paper
Add Code

DrawMon: A Distributed System for Detection of Atypical Sketch Content in Concurrent Pictionary Games

no code implementations • 10 Nov 2022 • Nikhil Bansal, Kartik Gupta, Kiruthika Kannan, Sivani Pentapati, Ravi Kiran Sarvadevabhatla

Pictionary, the popular sketch-based guessing game, provides an opportunity to analyze shared goal cooperative game play in restricted communication settings.

Paper
Add Code

UAV-based Visual Remote Sensing for Automated Building Inspection

no code implementations • 27 Sep 2022 • Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna

Unmanned Aerial Vehicle (UAV) based remote sensing system incorporated with computer vision has demonstrated potential for assisting building construction and in disaster management like damage assessment during earthquakes.

Management

Paper
Add Code

PSUMNet: Unified Modality Part Streams are All You Need for Efficient Pose-based Action Recognition

1 code implementation • 11 Aug 2022 • Neel Trivedi, Ravi Kiran Sarvadevabhatla

At the representation level, we propose a global frame based part stream approach as opposed to conventional modality based streams.

Ranked #13 on Skeleton Based Action Recognition on NTU RGB+D 120

Action Recognition Skeleton Based Action Recognition

Paper
Code

Detecting, Tracking and Counting Motorcycle Rider Traffic Violations on Unconstrained Roads

1 code implementation • 18 Apr 2022 • Aman Goyal, Dev Agarwal, Anbumani Subramanian, C. V. Jawahar, Ravi Kiran Sarvadevabhatla, Rohit Saluja

In many Asian countries with unconstrained road traffic conditions, driving violations such as not wearing helmets and triple-riding are a significant source of fatalities involving motorcycles.

Paper
Code

Counting in the 2020s: Binned Representations and Inclusive Performance Measures for Deep Crowd Counting Approaches

no code implementations • 10 Apr 2022 • Sravya Vardhani Shivapuja, Ashwin Gopinath, Ayush Gupta, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla

This skew affects all stages within the pipelines of deep crowd counting approaches.

Crowd Counting

Paper
Add Code

Automatic Quantification and Visualization of Street Trees

1 code implementation • 17 Jan 2022 • Arpit Bahety, Rohit Saluja, Ravi Kiran Sarvadevabhatla, Anbumani Subramanian, C. V. Jawahar

We obtain TCDCA of 96. 77% on the test videos, with a remarkable improvement of 22. 58% over baseline, and demonstrate that our counting module's performance is close to human level.

Paper
Code

MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion

1 code implementation • 21 Oct 2021 • Shubh Maheshwari, Debtanu Gupta, Ravi Kiran Sarvadevabhatla

We introduce MUGL, a novel deep neural model for large-scale, diverse generation of single and multi-person pose-based action sequences with locomotion.

Action Generation

Paper
Code

MeronymNet: A Hierarchical Approach for Unified and Controllable Multi-Category Object Generation

1 code implementation • 17 Oct 2021 • Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, Ravi Kiran Sarvadevabhatla

We introduce MeronymNet, a novel hierarchical approach for controllable, part-based generation of multi-category objects using a single unified model.

Object

Paper
Code

F3: Fair and Federated Face Attribute Classification with Heterogeneous Data

1 code implementation • 6 Sep 2021 • Samhita Kanaparthy, Manisha Padala, Sankarshan Damle, Ravi Kiran Sarvadevabhatla, Sujit Gujar

F3 adopts multiple heuristics to improve fairness across different demographic groups without requiring data homogeneity assumption.

Attribute Classification +2

Paper
Code

Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts

1 code implementation • 21 Aug 2021 • Prema Satish Sharan, Sowmya Aitha, Amandeep Kumar, Abhishek Trivedi, Aaron Augustine, Ravi Kiran Sarvadevabhatla

Handwritten documents are often characterized by dense and uneven layout.

Instance Segmentation Segmentation +1

Paper
Code

BoundaryNet: An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation

1 code implementation • 21 Aug 2021 • Abhishek Trivedi, Ravi Kiran Sarvadevabhatla

Precise boundary annotations of image regions can be crucial for downstream applications which rely on region-class semantics.

Paper
Code

Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting

1 code implementation • 19 Aug 2021 • Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla

We analyze the performance of representative crowd counting approaches across standard datasets at per strata level and in aggregate.

Crowd Counting

Paper
Code

Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization

1 code implementation • 27 Jun 2021 • Anurag Bagchi, Jazib Mahmood, Dolton Fernandes, Ravi Kiran Sarvadevabhatla

State of the art architectures for untrimmed video Temporal Action Localization (TAL) have only considered RGB and Flow modalities, leaving the information-rich audio modality totally unexploited.

Ranked #1 on Temporal Action Localization on THUMOS'14

Action Recognition Temporal Action Localization

Paper
Code

Monocular Multi-Layer Layout Estimation for Warehouse Racks

1 code implementation • 16 Mar 2021 • Meher Shashwat Nigam, Avinash Prabhu, Anurag Sahu, Puru Gupta, Tanvi Karandikar, N. Sai Shankar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction.

Paper
Code

NTU-X: An Enhanced Large-scale Dataset for Improving Pose-based Recognition of Subtle Human Actions

1 code implementation • 27 Jan 2021 • Neel Trivedi, Anirudh Thatipelli, Ravi Kiran Sarvadevabhatla

The lack of fine-grained joints (facial joints, hand fingers) is a fundamental performance bottleneck for state of the art skeleton action recognition models.

Ranked #1 on Skeleton Based Action Recognition on NTU60-X

Action Recognition Skeleton Based Action Recognition

Paper
Code

Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition

1 code implementation • 27 Jan 2021 • Pranay Gupta, Divyanshu Sharma, Ravi Kiran Sarvadevabhatla

We deploy SynSE for the task of skeleton-based action sequence recognition.

Ranked #1 on Zero Shot Skeletal Action Recognition on NTU RGB+D 120

Action Recognition Generalized Zero-Shot Learning +2

Paper
Code

Early Bird: Loop Closures from Opposing Viewpoints for Perceptually-Aliased Indoor Environments

no code implementations • 3 Oct 2020 • Satyajit Tourani, Dhagash Desai, Udit Singh Parihar, Sourav Garg, Ravi Kiran Sarvadevabhatla, Michael Milford, K. Madhava Krishna

In particular, our integration of VPR with SLAM by leveraging the robustness of deep-learned features and our homography-based extreme viewpoint invariance significantly boosts the performance of VPR, feature correspondence, and pose graph submodules of the SLAM pipeline.

Visual Place Recognition

Paper
Add Code

Quo Vadis, Skeleton Action Recognition ?

1 code implementation • 4 Jul 2020 • Pranay Gupta, Anirudh Thatipelli, Aditya Aggarwal, Shubh Maheshwari, Neel Trivedi, Sourav Das, Ravi Kiran Sarvadevabhatla

To study skeleton-action recognition in the wild, we introduce Skeletics-152, a curated and 3-D pose-annotated subset of RGB videos sourced from Kinetics-700, a large-scale action dataset.

Ranked #1 on Skeleton Based Action Recognition on Skeletics-152

Action Recognition Benchmarking +1

Paper
Code

OPAL-Net: A Generative Model for Part-based Object Layout Generation

no code implementations • 30 May 2020 • Rishabh Baghel, Ravi Kiran Sarvadevabhatla

We propose OPAL-Net, a novel hierarchical architecture for part-based layout generation of objects from multiple categories using a single unified model.

Paper
Add Code

Topological Mapping for Manhattan-like Repetitive Environments

1 code implementation • 16 Feb 2020 • Sai Shubodh Puligilla, Satyajit Tourani, Tushar Vaidya, Udit Singh Parihar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

At the intermediate level, the map is represented as a Manhattan Graph where the nodes and edges are characterized by Manhattan properties and as a Pose Graph at the lower-most level of detail.

Paper
Code

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts

1 code implementation • 15 Dec 2019 • Abhishek Prusty, Sowmya Aitha, Abhishek Trivedi, Ravi Kiran Sarvadevabhatla

To address this deficiency, we introduce Indiscapes, the first ever dataset with multi-regional layout annotations for historical Indic manuscripts.

Instance Segmentation Optical Character Recognition (OCR) +1

Paper
Code

Operator-in-the-Loop Deep Sequential Multi-camera Feature Fusion for Person Re-identification

no code implementations • 19 Jul 2018 • K L Navaneet, Ravi Kiran Sarvadevabhatla, Shashank Shekhar, R. Venkatesh Babu, Anirban Chakraborty

Therefore, target identifications by operator in a subset of cameras cannot be utilized to improve ranking of the target in remaining set of network cameras.

Person Re-Identification

Paper
Add Code

Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing

1 code implementation • 29 Jan 2018 • Ravi Kiran Sarvadevabhatla, Shiv Surya, Trisha Mittal, Venkatesh Babu Radhakrishnan

Similarly, performance on multi-disciplinary tasks such as Visual Question Answering (VQA) is considered a marker for gauging progress in Computer Vision.

Question Answering Visual Question Answering

Paper
Code

SketchParse : Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks

1 code implementation • 5 Sep 2017 • Ravi Kiran Sarvadevabhatla, Isht Dwivedi, Abhijat Biswas, Sahil Manocha, R. Venkatesh Babu

We propose SketchParse, the first deep-network architecture for fully automatic parsing of freehand object sketches.

Object Pose Prediction +2

Paper
Code

DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data

2 code implementations • CVPR 2017 • Swaminathan Gurumurthy, Ravi Kiran Sarvadevabhatla, Venkatesh Babu Radhakrishnan

A class of recent approaches for generating images, called Generative Adversarial Networks (GAN), have been used to generate impressively realistic images of objects, bedrooms, handwritten digits and a variety of other image modalities.

Image Generation

111

Paper
Code

Object category understanding via eye fixations on freehand sketches

no code implementations • 20 Mar 2017 • Ravi Kiran Sarvadevabhatla, Sudharshan Suresh, R. Venkatesh Babu

In this paper, we analyze the results of a free-viewing gaze fixation study conducted on 3904 freehand sketches distributed across 160 object categories.

Object

Paper
Add Code

'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems

no code implementations • 23 Nov 2016 • Ravi Kiran Sarvadevabhatla, Shanthakumar Venkatraman, R. Venkatesh Babu

Our results show that the proposed benchmarking procedure enables additional differentiation among state-of-the-art object classifiers in terms of their ability to handle missing content and insufficient object detail.

Benchmarking Object +3

Paper
Add Code

Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch Recognition

1 code implementation • 11 Aug 2016 • Ravi Kiran Sarvadevabhatla, Jogendra Kundu, Babu R. Venkatesh

In our work, we propose a recurrent neural network architecture for sketch object recognition which exploits the long-term sequential and structural regularities in stroke data in a scalable manner.

Object Object Recognition +1

Paper
Code

SwiDeN : Convolutional Neural Networks For Depiction Invariant Object Recognition

1 code implementation • 29 Jul 2016 • Ravi Kiran Sarvadevabhatla, Shiv Surya, Srinivas S. S. Kruthiventi, Venkatesh Babu R

Current state of the art object recognition architectures achieve impressive performance but are typically specialized for a single depictive style (e. g. photos only, sketches only).

Ranked #1 on Depiction Invariant Object Recognition on Photo-Art-50

Depiction Invariant Object Recognition Object

Paper
Code

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

no code implementations • 25 Jan 2016 • Suraj Srinivas, Ravi Kiran Sarvadevabhatla, Konda Reddy Mopuri, Nikita Prabhu, Srinivas S. S. Kruthiventi, R. Venkatesh Babu

With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective.

Paper
Add Code

Analyzing structural characteristics of object category representations from their semantic-part distributions

no code implementations • 15 Sep 2015 • Ravi Kiran Sarvadevabhatla, Venkatesh Babu R

Studies from neuroscience show that part-mapping computations are employed by human visual system in the process of object recognition.

Object Object Recognition

Paper
Add Code

Expresso : A user-friendly GUI for Designing, Training and Exploring Convolutional Neural Networks

1 code implementation • 25 May 2015 • Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu

With a view to provide a user-friendly interface for designing, training and developing deep learning frameworks, we have developed Expresso, a GUI tool written in Python.

Paper
Code

Freehand Sketch Recognition Using Deep Features

no code implementations • 1 Feb 2015 • Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu

Therefore, analyzing such sparse sketches can aid our understanding of the neuro-cognitive processes involved in visual representation and recognition.

Retrieval Sketch-Based Image Retrieval +1

Paper
Add Code

Category-Epitomes : Discriminatively Minimalist Representations for Object Categories

no code implementations • 31 Jan 2015 • Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu

Freehand line sketches are an interesting and unique form of visual representation.

Object

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.