Shiyao DING


Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor
Shiyao DING Toshimitsu USHIO 
Publication:   
Publication Date: 2019/04/01
Vol. E102-A  No. 4  pp. 708-711
Type of Manuscript:  LETTER
Category: Mathematical Systems Science
Keyword: 
reinforcement learningpolicy gradientmulti-agent systemsmatrix game
 Summary | Full Text:PDF