A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration

MAO Jian-huan; YIN Lu-jia

MAO Jian-huan, YIN Lu-jia. A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration[J]. Microelectronics & Computer, 2016, 33(6): 139-142.

Citation:

MAO Jian-huan, YIN Lu-jia. A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration[J]. Microelectronics & Computer, 2016, 33(6): 139-142.

Citation:

MAO Jian-huan, YIN Lu-jia. A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration[J]. Microelectronics & Computer, 2016, 33(6): 139-142.

A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration

Abstract

Abstract

To find a balance between exploration and exploitation, this paper proposes a VDBE(Value-Difference Based Exploration) based algorithm. The algorithm proposes a state-based control strategy depends on the value difference. In order to achieve the ideal exploration/exploitation behavior state, agent takes positive actions to explore environments in the initial stage of learning when agent is unfamiliar with surrounding environment. As learning time goes on and agent is more familiar with surrounding, it gradually reduces the exploration rate.

FullText(HTML)

References (11)

Relative Articles

Supplements (0)

Cited By

Turn off MathJax

Article Contents

A Deep Reinforcement Learning Algorithm Based on Adaptive Exploration

Abstract

Catalog

Export File

Citation

Format

Content