TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Recognition	HMDB-51	MARS+RGB+FLow (64 frames, Kinetics pretrained)	Average accuracy of 3 splits	80.9	# 18
Action Classification	Kinetics-400	MARS+RGB+Flow (16 frames)	Acc@1	68.9	# 175
Action Classification	Kinetics-400	MARS+RGB+Flow (64 frames)	Acc@1	74.9	# 152
Action Classification	MiniKinetics	MARS+RGB+Flow (16 frames)	Top-1 Accuracy	73.5	# 1
Action Recognition	Something-Something V1	MARS+RGB+Flow (64 frames, Kinetics pretrained)	Top 1 Accuracy	53.0	# 38
Action Recognition	Something-Something V1	MARS+RGB+Flow (16 frames, Kinetics pretrained)	Top 1 Accuracy	40.4	# 72
Action Recognition	UCF101	MARS+RGB+Flow (64 frames, Kinetics pretrained)	3-fold Accuracy	97.8	# 11
Action Recognition	UCF101	MARS+RGB+Flow (16 frames)	3-fold Accuracy	95.8	# 38

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mars-motion-augmented-rgb-stream-for-action/action-classification-on-minikinetics)](https://paperswithcode.com/sota/action-classification-on-minikinetics?p=mars-motion-augmented-rgb-stream-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mars-motion-augmented-rgb-stream-for-action/action-recognition-in-videos-on-ucf101)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ucf101?p=mars-motion-augmented-rgb-stream-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mars-motion-augmented-rgb-stream-for-action/action-recognition-in-videos-on-hmdb-51)](https://paperswithcode.com/sota/action-recognition-in-videos-on-hmdb-51?p=mars-motion-augmented-rgb-stream-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mars-motion-augmented-rgb-stream-for-action/action-recognition-in-videos-on-something-1)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something-1?p=mars-motion-augmented-rgb-stream-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mars-motion-augmented-rgb-stream-for-action/action-classification-on-kinetics-400)](https://paperswithcode.com/sota/action-classification-on-kinetics-400?p=mars-motion-augmented-rgb-stream-for-action)`

MARS: Motion-Augmented RGB Stream for Action Recognition

CVPR 2019 · Nieves Crasto, Philippe Weinzaepfel, Karteek Alahari, Cordelia Schmid ·

Most state-of-the-art methods for action recognition consist of a two-stream architecture with 3D convolutions: an appearance stream for RGB frames and a motion stream for optical flow frames. Although combining flow with RGB improves the performance, the cost of computing accurate optical flow is high, and increases action recognition latency. This limits the usage of two-stream approaches in real-world applications requiring low latency. In this paper, we introduce two learning approaches to train a standard 3D CNN, operating on RGB frames, that mimics the motion stream, and as a result avoids flow computation at test time. First, by minimizing a feature-based loss compared to the Flow stream, we show that the network reproduces the motion stream with high fidelity. Second, to leverage both appearance and motion information effectively, we train with a linear combination of the feature-based loss and the standard cross-entropy loss for action recognition. We denote the stream trained using this combined loss as Motion-Augmented RGB Stream (MARS). As a single stream, MARS performs better than RGB or Flow alone, for instance with 72.7% accuracy on Kinetics compared to 72.0% and 65.6% with RGB and Flow streams respectively.

PDF Abstract

Code

Add Remove Mark official

craston/MARS

160

Tasks

Add Remove

Action Classification

Action Recognition

Optical Flow Estimation

Temporal Action Localization

Datasets

UCF101

Kinetics

HMDB51

Kinetics 400

Something-Something V1

Results from the Paper

Add Remove

Ranked #1 on Action Classification on MiniKinetics

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Recognition	HMDB-51	MARS+RGB+FLow (64 frames, Kinetics pretrained)	Average accuracy of 3 splits	80.9	# 18	Compare
Action Classification	Kinetics-400	MARS+RGB+Flow (16 frames)	Acc@1	68.9	# 175	Compare
Action Classification	Kinetics-400	MARS+RGB+Flow (64 frames)	Acc@1	74.9	# 152	Compare
Action Classification	MiniKinetics	MARS+RGB+Flow (16 frames)	Top-1 Accuracy	73.5	# 1	Compare
Action Recognition	Something-Something V1	MARS+RGB+Flow (64 frames, Kinetics pretrained)	Top 1 Accuracy	53.0	# 38	Compare
Action Recognition	Something-Something V1	MARS+RGB+Flow (16 frames, Kinetics pretrained)	Top 1 Accuracy	40.4	# 72	Compare
Action Recognition	UCF101	MARS+RGB+Flow (64 frames, Kinetics pretrained)	3-fold Accuracy	97.8	# 11	Compare
Action Recognition	UCF101	MARS+RGB+Flow (16 frames)	3-fold Accuracy	95.8	# 38	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

MARS: Motion-Augmented RGB Stream for Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove