Action Value Function Approximation Based on Radial Basis Function Network for Reinforcement Learning

Derhami, Vali; Mehrabi, Omid

Volume 5, Issue 1 (Journal of Control, V.5, N.1 Spring 2011) JoC 2011, 5(1): 50-63 | Back to browse issues page

‎ 20.1001.1.20088345.1390.5.1.4.9

Mendeley

Zotero

RefWorks

Derhami V, Mehrabi O. Action Value Function Approximation Based on Radial Basis Function Network for Reinforcement Learning. JoC 2011; 5 (1) :50-63
URL: http://joc.kntu.ac.ir/article-1-95-en.html

Action Value Function Approximation Based on Radial Basis Function Network for Reinforcement Learning

Vali Derhami ^*¹

, Omid Mehrabi

Abstract: (16197 Views)

One of the challenges encountered in the application of classical reinforcement learning methods to real-control problems is the curse of dimensiality. In order to overcome this difficulty, hybrid algorithms that combine reinforcement learning with various function approximators have attracted many research interests. In this paper, a novel Neural Reinforcement Learning (NRL) scheme which is based on Sarsa learning and Radial Basis Function (RBF) network is proposed. The RBF network is used to approximate the Action Value Function (AVF) on-line. The inputs of RBF network are state-action pairs of system and its outputs are corresponding approximated AVF. As the necessary condition for the convergence of NSL to the optimal task performance, the existence of stationary points for NSL which coincide with the fixed points of Approximate Action Value Iteration (AAVI) are proved. The validity of the proposed algorithm is tested through simulation examples: mountain car control task, and acrobot problem. Overall results demonstrate that our algorithm can effectively improve convergence speed and the efficiency of experience exploitation.

Keywords: Neural reinforcement learning, Critic-only architecture, RBF neural network, Sarsa, stationary points.

Full-Text [PDF 538 kb] (5476 Downloads)

Type of Article: Research paper | Subject: Special
Received: 2014/06/16 | Accepted: 2014/06/16 | Published: 2014/06/16

Send email to the article author

Rights and permissions
	This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Designed & Developed by : Yektaweb

Related Websites

Site Keywords