5 dataset results for Decision Making AND Videos

Charades-STA

Charades-STA is a new dataset built on top of Charades by adding sentence temporal annotations.

187 PAPERS • 4 BENCHMARKS

ROAD

ROAD (ROAD: The ROad event Awareness Dataset for Autonomous Driving)

ROAD is designed to test an autonomous vehicle's ability to detect road events, defined as triplets composed by an active agent, the action(s) it performs and the corresponding scene locations. ROAD comprises videos originally from the Oxford RobotCar Dataset, annotated with bounding boxes showing the location in the image plane of each road event.

20 PAPERS • NO BENCHMARKS YET

BLVD

BLVD is a large scale 5D semantics dataset collected by the Visual Cognitive Computing and Intelligent Vehicles Lab. This dataset contains 654 high-resolution video clips owing 120k frames extracted from Changshu, Jiangsu Province, China, where the Intelligent Vehicle Proving Center of China (IVPCC) is located. The frame rate is 10fps/sec for RGB data and 3D point cloud. The dataset contains fully annotated frames which yield 249,129 3D annotations, 4,902 independent individuals for tracking with the length of overall 214,922 points, 6,004 valid fragments for 5D interactive event recognition, and 4,900 individuals for 5D intention prediction. These tasks are contained in four kinds of scenarios depending on the object density (low and high) and light conditions (daytime and nighttime).

9 PAPERS • NO BENCHMARKS YET

GolfDB

GolfDB is a high-quality video dataset created for general recognition applications in the sport of golf, and specifically for the task of golf swing sequencing.

4 PAPERS • NO BENCHMARKS YET

D2City

A large-scale comprehensive collection of dashcam videos collected by vehicles on DiDi's platform. D2-City contains more than 10000 video clips which deeply reflect the diversity and complexity of real-world traffic scenarios in China.

1 PAPER • NO BENCHMARKS YET

Datasets

5 dataset results for Decision Making AND Videos