TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	<h2>oi</h2>	MnasNet-A3	Top 1 Accuracy	76.7%	# 829
Image Classification	<h2>oi</h2>	MnasNet-A3	Number of params	5.2M	# 412
Image Classification	<h2>oi</h2>	MnasNet-A3	Hardware Burden	None	# 1
Image Classification	<h2>oi</h2>	MnasNet-A3	Operations per network pass	0.0403G	# 1
Image Classification	<h2>oi</h2>	MnasNet-A3	GFLOPs	0.806	# 94
Image Classification	<h2>oi</h2>	MnasNet-A2	Top 1 Accuracy	75.6%	# 868
Image Classification	<h2>oi</h2>	MnasNet-A2	Number of params	4.8M	# 396
Image Classification	<h2>oi</h2>	MnasNet-A2	GFLOPs	0.680	# 80
Image Classification	<h2>oi</h2>	MnasNet-A1	Top 1 Accuracy	75.2%	# 879
Image Classification	<h2>oi</h2>	MnasNet-A1	Number of params	3.9M	# 378
Image Classification	<h2>oi</h2>	MnasNet-A1	GFLOPs	0.624	# 73

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mnasnet-platform-aware-neural-architecture/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=mnasnet-platform-aware-neural-architecture)`

MnasNet: Platform-Aware Neural Architecture Search for Mobile

CVPR 2019 · Mingxing Tan, Bo Chen, Ruoming Pang, Vijay Vasudevan, Mark Sandler, Andrew Howard, Quoc V. Le ·

Designing convolutional neural networks (CNN) for mobile devices is challenging because mobile models need to be small and fast, yet still accurate. Although significant efforts have been dedicated to design and improve mobile CNNs on all dimensions, it is very difficult to manually balance these trade-offs when there are so many architectural possibilities to consider. In this paper, we propose an automated mobile neural architecture search (MNAS) approach, which explicitly incorporate model latency into the main objective so that the search can identify a model that achieves a good trade-off between accuracy and latency. Unlike previous work, where latency is considered via another, often inaccurate proxy (e.g., FLOPS), our approach directly measures real-world inference latency by executing the model on mobile phones. To further strike the right balance between flexibility and search space size, we propose a novel factorized hierarchical search space that encourages layer diversity throughout the network. Experimental results show that our approach consistently outperforms state-of-the-art mobile CNN models across multiple vision tasks. On the ImageNet classification task, our MnasNet achieves 75.2% top-1 accuracy with 78ms latency on a Pixel phone, which is 1.8x faster than MobileNetV2 [29] with 0.5% higher accuracy and 2.3x faster than NASNet [36] with 1.2% higher accuracy. Our MnasNet also achieves better mAP quality than MobileNets for COCO object detection. Code is at https://github.com/tensorflow/tpu/tree/master/models/official/mnasnet

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Code

Add Remove Mark official

tensorflow/tpu official

5,189

rwightman/pytorch-image-models

30,242

pytorch/vision

15,587

tensorflow/tpu

5,189

osmr/imgclsmob

2,927

See all 28 implementations

Tasks

Add Remove

Image Classification

Neural Architecture Search

object-detection

Object Detection

Real-Time Object Detection

Datasets

MS COCO

ssd

Results from the Paper

Edit

Ranked #833 on Image Classification on <h2>oi</h2> (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	<h2>oi</h2>	MnasNet-A3	Top 1 Accuracy	76.7%	# 829	Compare
			Number of params	5.2M	# 412	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	0.0403G	# 1	Compare
			GFLOPs	0.806	# 94	Compare
Image Classification	<h2>oi</h2>	MnasNet-A2	Top 1 Accuracy	75.6%	# 868	Compare
			Number of params	4.8M	# 396	Compare
			GFLOPs	0.680	# 80	Compare
Image Classification	<h2>oi</h2>	MnasNet-A1	Top 1 Accuracy	75.2%	# 879	Compare
			Number of params	3.9M	# 378	Compare
			GFLOPs	0.624	# 73	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Convolution • Dense Connections • Depthwise Convolution • Depthwise Separable Convolution • Dropout • Global Average Pooling • Inverted Residual Block • Linear Warmup With Linear Decay • LSTM • MnasNet • Pointwise Convolution • Random Horizontal Flip • Random Resized Crop • ReLU • RMSProp • Sigmoid Activation • Softmax • Squeeze-and-Excitation Block • Tanh Activation • Weight Decay

Edit Social Preview

MnasNet: Platform-Aware Neural Architecture Search for Mobile

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove