Journal of Control

fa همزمانسازی بهینه برخط سیستم های چندعاملی غیر خطی با دینامیک های نامعلوم Online Optimal Synchronization of Nonlinear Multi-agent Systems under Unknown Dynamics تخصصي Special پژوهشي Research paper در این مقاله، الگوریتم بهینه توزیع شده تطبیقی برخط برای همزمانسازی عامل های غیرخطی یک سیستم چندعاملی با دینامیک های نامعلوم به عامل رهبر  بر اساس تکنیک های برنامه ریزی پویای تقریبی و شناساگرهای شبکه های عصبی ارایه شده&rlm; است. الگوریتم پیشنهاد شده به یادگیری حل برخط معادلات همیلتون-جاکوبی تزویج شده<a href="#_ftn1" name="_ftnref1" title="">[1]</a> (CHJ) تحت دینامیک های نامعلوم پرداخته است. هر عامل جهت یادگیری سیاست بهینه محلی از ساختار عملگر-نقاد بهره برده و دینامیک نامعلوم هر عامل نیز با به کارگیری یک تقریبگر شبکه عصبی، تقریب زده شده است. شناسایی دینامیک های نامعلوم با استفاده از قانون تکرار تجربیات انجام شده است به طوری که از اطلاعات ثبت شده به همراه داده های لحظه ای برای انطباق وزن های شبکه عصبی شناساگر دینامیک عامل ها، استفاده شده است. در حالی که وزن های تقریبگرهای دینامیک و شبکه های عملگر-نقاد به صورت همزمان در حال انطباق هستند، کرانداری تمامی سیگنال های حلقه بسته توسط تئوری لیاپانوف تضمین شده است.  در انتها صحت الگوریتم پیشنهاد شده با ذکر نتایج شبیه سازی، نشان داده شده است. <div>  <hr align="left" size="1" width="33%" > <div id="ftn1" style="text-align: justify;"><a href="#_ftnref1" name="_ftn1" title="">[1]</a> Coupled Hamilton-Jacobi</div> </div> In this paper an online optimal distributed algorithm is introduced for multi-agent systems synchronization under unknown dynamics based on approximate dynamic programming and neural networks. Every agent has employed an actor-critic structure to learn its distributed optimal policy and the unknown dynamics of every agent is identified by employing a neural network approximator. The unknown dynamics are identified based on the experience replay technique where the recorded data and current data are used to adopt the approximators weights. The introduced algorithm learns the solution of coupled Hamilton-Jacobi equations under unknown dynamics in an online fashion. While the weights of the identifiers and actor-critic approximators are being tuned, the boundedness of the closed loop system signals are assured using Lyapunov theory. The effectiveness of the proposed algorithm is shown through the simulation results. برنامه ریزی پویای تقریبی, تقریبگرهای عملگر-نقاد, سیستم های چندعاملی, کنترل بهینه توزیع شده, همزمانسازی. Actor-Critic Approximators, Approximate Dynamic Programming, Multi-Agent Systems, Optimal Distributed Control, Synchronization. 13 28 http://joc.kntu.ac.ir/browse.php?a_code=A-10-178-2&slc_lang=fa&sid=1 Farzaneh Tatari فرزانه تاتاری ftatari@semnan.ac.ir 10031947532846005716 10031947532846005716 Yes Electrical engineering department, Electrical and Computer engineering faculty, Semnan university, Semnan, Iran سمنان، دانشگاه سمنان، دانشکده مهندسی برق و کامپیوتر، گروه مهندسی برق Mohammad-B. Naghibi-S. محمدباقر نقیبی سیستانی mb-naghibi@um.ac.ir 10031947532846005717 10031947532846005717 No Electrical Engineering Department, Ferdowsi University of Mashhad, Mashhad, Iran مشهد، دانشگاه فردوسی مشهد، دانشکده مهندسی، گروه مهندسی برق