IMPROVING MONTE CARLO TREE SEARCH BY COMBINING RAVE AND QUALITY-BASED REWARDS ALGORITHMS

Masoud Masoumi Moghadam; Mohammad Pourmahmood Aghababa; Jamshid Bagherzadeh

IMPROVING MONTE CARLO TREE SEARCH BY COMBINING RAVE AND QUALITY-BASED REWARDS ALGORITHMS

محل انتشار: چهارمین کنفرانس بین المللی مطالعات نوین در علوم کامپیوتر و فناوری اطلاعات

سال انتشار: 1396

نوع سند: مقاله کنفرانسی

زبان: انگلیسی

مشاهده: 984

فایل این مقاله در 10 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/779194

شناسه ملی سند علمی:

CONFITC04_172

تاریخ نمایه سازی: 6 مهر 1397

چکیده مقاله:

Monte-Carlo Tree Search is a state-of-the-art method for building intelligent agents ingames and has been focus of many researchs through past decade. By using thismethod, the agents are able to master the games through building a search tree basedon samples gathered by randomized simulations. In most of the researchs, the rewardfrom simulations are discrete values representing final state of the games (win, loss,draw), e.g., r ∈ {-1, 0, 1}. In this paper, we introduce a method which modifies rewardfor each playout. Then it backpropagates the reward through UCT and AMAF values.RAVE algorithm is used to evaluate the nodes more accurately in each tree breadth.We implemented the algorithm along with Last-Good-Reply, Decisive-move andPoolrave heuristics. In the end we used leaf parallelization in order to increase thesamples gathered by simulations. All implementations are examined in the game ofHEX in 9 × 9 board. We show the proposed method can improve the performance inthe domain discussed.

کلیدواژه ها:

Monte-Carlo Tree Search ، Game of HEX ، Reinforcement Learning ، Heuristics

نویسندگان

Masoud Masoumi Moghadam

M.Sc Student, Urmia University of Technology

Mohammad Pourmahmood Aghababa

Associate Professor, Urmia University of Technology

Jamshid Bagherzadeh

Associate Professor, Urmia University