Keyword : Q-learning


Convergence of the Q-ae Learning on Deterministic MDPs and Its Efficiency on the Stochastic Environment
Gang ZHAO Shoji TATSUMI Ruoying SUN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2000/09/25
Vol. E83-A  No. 9 ; pp. 1786-1795
Type of Manuscript:  PAPER
Category: Algorithms and Data Structures
Keyword: 
Q-learningQ-ae learningexplorationdynamic programmingplanning
 Summary | Full Text:PDF(871.7KB)