Bayesian Continuous-State Reinforcement Learning

Saeed Amizadeh; Majid Nili Ahmadabadi; Caro Lucas

Bayesian Continuous-State Reinforcement Learning

محل انتشار: دوازدهمین کنفرانس سالانه انجمن کامپیوتر ایران

سال انتشار: 1385

نوع سند: مقاله کنفرانسی

زبان: انگلیسی

مشاهده: 2,197

فایل این مقاله در 7 صفحه با فرمت PDF قابل دریافت می باشد

دریافت فایل کامل مقاله

صدور گواهی نمایه سازی
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

https://civilica.com/doc/44512

شناسه ملی سند علمی:

ACCSI12_125

تاریخ نمایه سازی: 23 دی 1386

چکیده مقاله:

Continuous-State Reinforcement Learning (RL) has been recently favored because of the continuous nature of the real world RL problems and many theoretical approaches have been devised to handle the case. However, most of these methods presume that the structure of the agent's perceptual environment is fed to it. But this is not the case in many real situations. Inspired from the subjective view existing in the Cognitive Constructivist learning theory, in this paper, a new method is presented to discover and construct the structure of the environment in parallel with learning the optimal policy. To achieve these goals, the proposed approach incorporates the Bayesian formalism to or ganize the perceptual space while it tries to learn the optimal behavior using a Q-learning-like learning algorithm. These characteristics as a whole define a Reinforcement Learning algorithm which is developed based on a mixture of Cognitive Constructivism and traditional Behaviorism ideas. Simulation results demonstrate the viability and efficiency of the proposed algorithm on continuous state RL problems.

کلیدواژه ها:

Bayesian Framework ، Continuous-State Reinforcement Learning ، Cognitive Constructivism ، Mixture Densities

نویسندگان

Saeed Amizadeh

Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran

Majid Nili Ahmadabadi

Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran، School of Cognitive Sciences,Institute for studies in theoretical Physics and Mathematics, Tehran, Iran

Caro Lucas

Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, University of Tehran, Tehran, Iran، School of Cognitive Sciences,Institute for studies in theoretical Physics and Mathematics, Tehran, Iran

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :

R. S. Sutton and A. G. Barto, R einforcement Learning: ...
S. Mahadevan and J. Connell, "Automatic programming of behavio r-based ...

نمایش کامل مراجع