Videos

CRIPP-VQA (Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering)

Introduced by Patel et al. in CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering

CRIPP-VQA is a video question answering dataset for reasoning about the implicit physical properties of objects in a scene. It contains videos of object in motion, annotated with questions that involve counterfactual reasoning about actions, questions about planning in order to reach a goal, and descriptive questions about visible properties of objects.

Source: CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering

Homepage

Benchmarks

Add a new result Link an existing benchmark

Task	Dataset Variant	Best Model
Descriptive	CRIPP-VQA	Aloe*+BERT
Remove - PQ	CRIPP-VQA	Aloe*+BERT
Remove - PO	CRIPP-VQA	Aloe*+BERT
Replace - PQ	CRIPP-VQA	Aloe*+BERT
Replace - PO	CRIPP-VQA	Aloe*+BERT
Add - PQ	CRIPP-VQA	Aloe*+BERT
Add - PO	CRIPP-VQA	Aloe*+BERT
Counterfactual Planning	CRIPP-VQA	Aloe*+BERT