Deep Q-Network

Introduced by Mnih et al. in Playing Atari with Deep Reinforcement Learning

A DQN, or Deep Q-Network, approximates a state-value function in a Q-Learning framework with a neural network. In the Atari Games case, they take in several frames of the game as an input and output state values for each action as an output.

It is usually used in conjunction with Experience Replay, for storing the episode steps in memory for off-policy learning, where samples are drawn from the replay memory at random. Additionally, the Q-Network is usually optimized towards a frozen target network that is periodically updated with the latest weights every $k$ steps (where $k$ is a hyperparameter). The latter makes training more stable by preventing short-term oscillations from a moving target. The former tackles autocorrelation that would occur from on-line learning, and having a replay memory makes the problem more like a supervised learning problem.

Image Source: here

Source: Playing Atari with Deep Reinforcement Learning

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Reinforcement Learning (RL)	280	38.57%
Atari Games	65	8.95%
Decision Making	34	4.68%
Management	20	2.75%
Multi-agent Reinforcement Learning	15	2.07%
Efficient Exploration	14	1.93%
OpenAI Gym	13	1.79%
Autonomous Driving	11	1.52%
Continuous Control	9	1.24%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Convolution	Convolutions
Dense Connections	Feedforward Networks
Q-Learning	Off-Policy TD Control

Categories

Add Remove

Q-Learning Networks