Keyword : reinforcement learning


Multi-Autonomous Robot Enhanced Ad-Hoc Network under Uncertain and Vulnerable Environment
Ming FENG Lijun QIAN Hao XU 
Publication:   
Publication Date: 2019/10/01
Vol. E102-B  No. 10 ; pp. 1925-1932
Type of Manuscript:  INVITED PAPER (Special Section on Exploring Drone for Mobile Sensing, Coverage and Communications: Theory and Applications)
Category: 
Keyword: 
reinforcement learninggame theorymobile ad-hoc networkmission oriented metricsmulti-agent systems
 Summary | Full Text:PDF

Learning in Two-Player Matrix Games by Policy Gradient Lagging Anchor
Shiyao DING Toshimitsu USHIO 
Publication:   
Publication Date: 2019/04/01
Vol. E102-A  No. 4 ; pp. 708-711
Type of Manuscript:  LETTER
Category: Mathematical Systems Science
Keyword: 
reinforcement learningpolicy gradientmulti-agent systemsmatrix game
 Summary | Full Text:PDF

A Robot Model That Obeys a Norm of a Human Group by Participating in the Group and Interacting with Its Members
Yotaro FUSE Hiroshi TAKENOUCHI Masataka TOKUMARU 
Publication:   
Publication Date: 2019/01/01
Vol. E102-D  No. 1 ; pp. 185-194
Type of Manuscript:  PAPER
Category: Kansei Information Processing, Affective Information Processing
Keyword: 
social robotgroup normreinforcement learninghuman-robot interaction
 Summary | Full Text:PDF

Incremental Estimation of Natural Policy Gradient with Relative Importance Weighting
Ryo IWAKI Hiroki YOKOYAMA Minoru ASADA 
Publication:   
Publication Date: 2018/09/01
Vol. E101-D  No. 9 ; pp. 2346-2355
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
reinforcement learningnatural policy gradientadaptive step size
 Summary | Full Text:PDF

A Real-Time Subtask-Assistance Strategy for Adaptive Services Composition
Li QUAN Zhi-liang WANG Xin LIU 
Publication:   
Publication Date: 2018/05/01
Vol. E101-D  No. 5 ; pp. 1361-1369
Type of Manuscript:  PAPER
Category: Data Engineering, Web Information Systems
Keyword: 
service compositionreinforcement learningQ-learningsubtask-assistance
 Summary | Full Text:PDF

Optimal Digital Control with Uncertain Network Delay of Linear Systems Using Reinforcement Learning
Taishi FUJITA Toshimitsu USHIO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2016/02/01
Vol. E99-A  No. 2 ; pp. 454-461
Type of Manuscript:  Special Section PAPER (Special Section on Mathematical Systems Science and its Applications)
Category: 
Keyword: 
reinforcement learningadaptive controloutput feedback controloptimal controllinear system
 Summary | Full Text:PDF

Sarsa Learning Based Route Guidance System with Global and Local Parameter Strategy
Feng WEN Xingqiao WANG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2015/12/01
Vol. E98-A  No. 12 ; pp. 2686-2693
Type of Manuscript:  PAPER
Category: Intelligent Transport System
Keyword: 
reinforcement learningSarsa learningglobal and local parameter strategyroute guidance
 Summary | Full Text:PDF

Adaptive Q-Learning Cell Selection Method for Open-Access Femtocell Networks: Multi-User Case
Chaima DHAHRI Tomoaki OHTSUKI 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2014/08/01
Vol. E97-B  No. 8 ; pp. 1679-1688
Type of Manuscript:  PAPER
Category: Network Management/Operation
Keyword: 
open access femtocell networkshandoverreinforcement learningQ-learningfuzzy logic
 Summary | Full Text:PDF

An Intelligent Fighting Videogame Opponent Adapting to Behavior Patterns of the User
Koichi MORIYAMA Simón Enrique ORTIZ BRANCO Mitsuhiro MATSUMOTO Ken-ichi FUKUI Satoshi KURIHARA Masayuki NUMAO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/04/01
Vol. E97-D  No. 4 ; pp. 842-851
Type of Manuscript:  PAPER
Category: Information Network
Keyword: 
entertainment computingadapting agentpattern matchingreinforcement learning
 Summary | Full Text:PDF

Heuristic Function Negotiation for Markov Decision Process and Its Application in UAV Simulation
Fengfei ZHAO Zheng QIN Zhuo SHAO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/01/01
Vol. E97-D  No. 1 ; pp. 89-97
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
Markov decision processesheuristic functionreinforcement learningUAV
 Summary | Full Text:PDF

An Improved Model of Ant Colony Optimization Using a Novel Pheromone Update Strategy
Pooia LALBAKHSH Bahram ZAERI Ali LALBAKHSH 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/11/01
Vol. E96-D  No. 11 ; pp. 2309-2318
Type of Manuscript:  PAPER
Category: Fundamentals of Information Systems
Keyword: 
ant colony optimizationant colony systemant-minerclassification rule mininglearning automatareinforcement learning
 Summary | Full Text:PDF

Multi-Channel Cooperative Spectrum Sensing in Cognitive Radio Networks
Ji-Hoon LEE Woo-Jin SONG 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2013/09/01
Vol. E96-A  No. 9 ; pp. 1909-1913
Type of Manuscript:  LETTER
Category: Communication Theory and Signals
Keyword: 
cognitive radiocooperative spectrum sensingmulti-channelcooperator selectionreinforcement learning
 Summary | Full Text:PDF

Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting
Ning XIE Hirotaka HACHIYA Masashi SUGIYAMA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/05/01
Vol. E96-D  No. 5 ; pp. 1134-1144
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
painterly renderingstroke-based renderingreinforcement learningpolicy gradient
 Summary | Full Text:PDF

Reinforcement Learning of Optimal Supervisor for Discrete Event Systems with Different Preferences
Koji KAJIWARA Tatsushi YAMASAKI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2013/02/01
Vol. E96-A  No. 2 ; pp. 525-531
Type of Manuscript:  Special Section PAPER (Special Section on Mathematical Systems Science and its Applications)
Category: Concurrent Systems
Keyword: 
discrete event systemssupervisory controldecentralized systemreinforcement learningoptimal control
 Summary | Full Text:PDF

Multi-Task Approach to Reinforcement Learning for Factored-State Markov Decision Problems
Jaak SIMM Masashi SUGIYAMA Hirotaka HACHIYA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/10/01
Vol. E95-D  No. 10 ; pp. 2426-2437
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
reinforcement learningmulti-task learningtransfer learningfactored state modelsMarkov decision process
 Summary | Full Text:PDF

An Adaptive Method to Acquire QoS Class Allocation Policy Based on Reinforcement Learning
Nagao OGINO Hajime NAKAMURA 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2012/09/01
Vol. E95-B  No. 9 ; pp. 2828-2837
Type of Manuscript:  PAPER
Category: Network
Keyword: 
multi-domain path-based networkend-to-end QoS guaranteeadaptive QoS class allocationacquisition of allocation policyreinforcement learning
 Summary | Full Text:PDF

Option-Based Monte Carlo Algorithm with Conditioned Updating to Learn Conflict-Free Task Allocation in Transport Applications
Alex VALDIVIELSO Toshiyuki MIYAMOTO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2011/12/01
Vol. E94-A  No. 12 ; pp. 2810-2820
Type of Manuscript:  Special Section PAPER (Special Section on Mathematical Systems Science and its Applications)
Category: 
Keyword: 
option frameworkreinforcement learningtask allocationconflict preventionmulti-car elevators
 Summary | Full Text:PDF

An Adaptive Cooperative Spectrum Sensing Scheme Using Reinforcement Learning for Cognitive Radio Sensor Networks
Thuc KIEU-XUAN Insoo KOO 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2011/05/01
Vol. E94-B  No. 5 ; pp. 1456-1459
Type of Manuscript:  LETTER
Category: Network
Keyword: 
cognitive radio sensor networkcooperative spectrum sensingdecision fusionreinforcement learning
 Summary | Full Text:PDF

Least Absolute Policy Iteration--A Robust Approach to Value Function Approximation
Masashi SUGIYAMA Hirotaka HACHIYA Hisashi KASHIMA Tetsuro MORIMURA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/09/01
Vol. E93-D  No. 9 ; pp. 2555-2565
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Data Mining
Keyword: 
reinforcement learningvalue function approximationleast-squares policy iterationoutlier1-loss functionlinear programming
 Summary | Full Text:PDF

Synchronization of Chaotic Systems without Direct Connections Using Reinforcement Learning
Norihisa SATO Masaharu ADACHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2009/04/01
Vol. E92-A  No. 4 ; pp. 958-965
Type of Manuscript:  Special Section PAPER (Special Section on Advanced Technologies Emerging Mainly from the 21st Workshop on Circuits and Systems in Karuizawa)
Category: 
Keyword: 
chaos synchronizationreinforcement learning
 Summary | Full Text:PDF

A Nonlinear Approach to Robust Routing Based on Reinforcement Learning with State Space Compression and Adaptive Basis Construction
Hideki SATOH 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2008/07/01
Vol. E91-A  No. 7 ; pp. 1733-1740
Type of Manuscript:  PAPER
Category: Nonlinear Problems
Keyword: 
robust routingreinforcement learningmultivariate analysisfunction approximation
 Summary | Full Text:PDF

Reinforcement Learning with Orthonormal Basis Adaptation Based on Activity-Oriented Index Allocation
Hideki SATOH 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2008/04/01
Vol. E91-A  No. 4 ; pp. 1169-1176
Type of Manuscript:  PAPER
Category: Nonlinear Problems
Keyword: 
orthonormal basisfunction approximationnon-linearreinforcement learningactivity
 Summary | Full Text:PDF

A State Space Compression Method Based on Multivariate Analysis for Reinforcement Learning in High-Dimensional Continuous State Spaces
Hideki SATOH 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2006/08/01
Vol. E89-A  No. 8 ; pp. 2181-2191
Type of Manuscript:  PAPER
Category: Nonlinear Problems
Keyword: 
reinforcement learningcurse of dimensionalitymultivariate analysisapproximationnonlinear
 Summary | Full Text:PDF

Exploiting Intelligence in Fighting Action Games Using Neural Networks
Byeong Heon CHO Sung Hoon JUNG Yeong Rak SEONG Ha Ryoung OH 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2006/03/01
Vol. E89-D  No. 3 ; pp. 1249-1256
Type of Manuscript:  PAPER
Category: Biocybernetics, Neurocomputing
Keyword: 
computer gameartificial intelligenceneural networkreinforcement learningadaptive system
 Summary | Full Text:PDF

Decentralized Supervisory Control of Discrete Event Systems Based on Reinforcement Learning
Tatsushi YAMASAKI Toshimitsu USHIO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/11/01
Vol. E88-A  No. 11 ; pp. 3045-3050
Type of Manuscript:  Special Section PAPER (Special Section on Concurrent/Hybrid Systems: Theory and Applications)
Category: 
Keyword: 
discrete event systemsdecentralized controlsupervisory controlreinforcement learningoptimal control
 Summary | Full Text:PDF

Analysis on the Parameters of the Evolving Artificial Agents in Sequential Bargaining Game
Seok-Cheol CHANG Joung-Il YUN Ju-Sang LEE Sang-Uk LEE Nitaigour-Premchand MAHALIK Byung-Ha AHN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/09/01
Vol. E88-D  No. 9 ; pp. 2098-2101
Type of Manuscript:  Special Section LETTER (Special Section on Software Agent and Its Applications)
Category: 
Keyword: 
sequential bargaining gameartificial agentstatistics analysisgenetic algorithmreinforcement learning
 Summary | Full Text:PDF

CHQ: A Multi-Agent Reinforcement Learning Scheme for Partially Observable Markov Decision Processes
Hiroshi OSADA Satoshi FUJITA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/05/01
Vol. E88-D  No. 5 ; pp. 1004-1011
Type of Manuscript:  PAPER
Category: Artificial Intelligence and Cognitive Science
Keyword: 
multi-agent systemreinforcement learningpartially observable MDPQ-learning
 Summary | Full Text:PDF

On the Effects of Domain Size and Complexity in Empirical Distribution of Reinforcement Learning
Kazunori IWATA Kazushi IKEDA Hideaki SAKAI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2005/01/01
Vol. E88-D  No. 1 ; pp. 135-142
Type of Manuscript:  PAPER
Category: Artificial Intelligence and Cognitive Science
Keyword: 
reinforcement learningMarkov decision processLempel-Ziv codingdomain sizestochastic complexity
 Summary | Full Text:PDF

An Approach to the Piano Mover's Problem Using Hierarchic Reinforcement Learning
Yuko ISHIWAKA Tomohiro YOSHIDA Hiroshi YOKOI Yukinori KAKAZU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2004/08/01
Vol. E87-D  No. 8 ; pp. 2106-2113
Type of Manuscript:  PAPER
Category: Distributed Cooperation and Agents
Keyword: 
reinforcement learningpiano mover's problemheterogeneous multi-agentfind-path problemobstacle avoidance
 Summary | Full Text:PDF

Multiagent Cooperating Learning Methods by Indirect Media Communication
Ruoying SUN Shoji TATSUMI Gang ZHAO 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2003/11/01
Vol. E86-A  No. 11 ; pp. 2868-2878
Type of Manuscript:  PAPER
Category: Neural Networks and Bioengineering
Keyword: 
reinforcement learningant colony systemmultiagent cooperatingindirect media communication
 Summary | Full Text:PDF

Does Reinforcement Learning Simulate Threshold Public Goods Games?: A Comparison with Subject Experiments
Atsushi IWASAKI Shuichi IMURA Sobei H. ODA Itsuo HATONO Kanji UEDA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/08/01
Vol. E86-D  No. 8 ; pp. 1335-1343
Type of Manuscript:  Special Section PAPER (Special Issue on Software Agent and Its Applications)
Category: 
Keyword: 
reinforcement learningagent-based computational economicsexperimental economicspublic goods
 Summary | Full Text:PDF

An Intelligent Stock Trading System Based on Reinforcement Learning
Jae Won LEE Sung-Dong KIM Jongwoo LEE Jinseok CHAE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2003/02/01
Vol. E86-D  No. 2 ; pp. 296-305
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Cognitive Science
Keyword: 
reinforcement learningTD algorithmstock selectionneural networkmultiple agents
 Summary | Full Text:PDF

Learning of Virtual Words Utilized in Negotiation Process between Agents
Hiroyuki IIZUKA Keiji SUZUKI Masahito YAMAMOTO Azuma OHUCHI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2000/06/25
Vol. E83-A  No. 6 ; pp. 1075-1082
Type of Manuscript:  Special Section PAPER (Special Section of Papers Selected from 1999 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC'99))
Category: 
Keyword: 
price negotiationreinforcement learningagent-based simulation
 Summary | Full Text:PDF

Controlling Multiple Cranes Using Multi-Agent Reinforcement Learning: Emerging Coordination among Competitive Agents
Sachiyo ARAI Kazuteru MIYAZAKI Shigenobu KOBAYASHI 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2000/05/25
Vol. E83-B  No. 5 ; pp. 1039-1047
Type of Manuscript:  Special Section PAPER (IEICE/IEEE Joint Special Issue on Autonomous Decentralized Systems)
Category: Real Time Control
Keyword: 
reinforcement learningmulti-agent systemprofit-sharingconflict resolution
 Summary | Full Text:PDF

A Constructive Compound Neural Networks. II Application to Artificial Life in a Competitive Environment
Jianjun YAN Naoyuki TOKUDA Juichi MIYAMICHI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2000/04/25
Vol. E83-D  No. 4 ; pp. 845-856
Type of Manuscript:  PAPER
Category: Artificial Intelligence, Cognitive Science
Keyword: 
neural networks constructionartificial lifefuzzy logicgenetic algorithmreinforcement learning
 Summary | Full Text:PDF

Strategy Acquisition for the Game "Othello" Based on Reinforcement Learning
Taku YOSHIOKA Shin ISHII Minoru ITO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 1999/12/25
Vol. E82-D  No. 12 ; pp. 1618-1626
Type of Manuscript:  PAPER
Category: Bio-Cybernetics and Neurocomputing
Keyword: 
reinforcement learningnormalized Gaussian networkOthellomin-max strategy
 Summary | Full Text:PDF

Learning the Balance between Exploration and Exploitation via Reward
Tetsuya YOSHIDA Koichi HORI Shinichi NAKASUKA 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/11/25
Vol. E82-A  No. 11 ; pp. 2538-2545
Type of Manuscript:  Special Section PAPER (Special Section on Concurrent Systems Technology)
Category: 
Keyword: 
multi-agent systemreinforcement learningrewardexplorationexploitation
 Summary | Full Text:PDF

RTP-Q: A Reinforcement Learning System with Time Constraints Exploration Planning for Accelerating the Learning Rate
Gang ZHAO Shoji TATSUMI Ruoying SUN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 1999/10/25
Vol. E82-A  No. 10 ; pp. 2266-2273
Type of Manuscript:  PAPER
Category: Artificial Intelligence and Knowledge
Keyword: 
reinforcement learningplanningreactingexplorationexploitation
 Summary | Full Text:PDF