Journal of Control

fa کنترل بهینه تطبیقی برخط سیستم‌های دوخطی زمان پیوسته با دینامیک ناشناخته An online policy iteration for adaptive optimal control of unknown bilinear systems تخصصي Special پژوهشي Research paper <div style="text-align: justify;">طراحی کنترل‌کننده‌ی بهینه برای سیستم‌های دوخطی زمان پیوسته با معلوم بودن دینامیک سیستم طبق اصل بهینگی بلمن پیچیدگی محاسباتی بالایی دارد و عموماً از روش‌های تقریبی وابسته به دانستن دینامیک سیستم برای طراحی کنترل‌کننده استفاده می شود.‌ هنگامی‌که دینامیک سیستم نامعلوم است این مسئله بسیار پیچیده‌تر می‌شود. اولین چیزی که برای حل این مشکل به  نظر می‌رسد شناسایی سیستم دوخطی به کمک روش‌های شناسایی سیستم است. همان‌طور که می‌دانیم روش‌های شناسایی مدلی خطی شده بر اساس داده‌های ورودی  و خروجی سیستم در اختیار طراح قرار می‌دهد تا به سراغ طراحی کنترل‌کننده برود. در این مقاله  با استفاده از رویه‌ای برخط و تطبیقی، یک روش تکراری جدید به‌منظور طراحی کنترل‌کننده بهینه برای یک سیستم دوخطی که دینامیک آن نامعلوم است پیشنهاد می‌گردد. در روش تکرای پیشنهادی و به صورتی تطبیقی، به‌جای دانستن دینامیک سیستم دوخطی با استفاده از اطلاعات برخط ورودی و اندازه‌گیری حالت‌ها، کنترل‌کننده‌ی بهینه طراحی می‌گردد. همچنین با اعمال نویز به‌منزله ورودی به سیستم در یک بازه‌ی زمانی خاص، نیاز به‌ اندازه‌گیری مجدد حالت‌ها برای تکرارهای بعدی برطرف می‌گردد. همگرایی روش تکراری تطبیقی به کنترل‌کننده بهینه به‌صورت قضیه ارائه و اثبات شده است.  </div> <div style="text-align: justify;">Bellman's optimality principle states that designing an optimal controller for continuous-time bilinear systems with known system dynamics has a high computational complexity. As a result, controller design typically uses approximation techniques that depend on system dynamics knowledge. This problem will become more challenging when the system dynamics are unknown. Identifying the bilinear system dynamics through identification techniques is the first step toward overcoming this. It is well known that the identification methods give the designer a linear model to use in the controller design, based on the input and output data of the system. This paper proposes a new iterative method to design an optimal controller for a bilinear system whose dynamics are unknown, using an online adaptive policy iteration. In the proposed iterative method, instead of knowing the dynamics of the bilinear system, the optimal controller is designed by using the online input information and measurement of states. Also, by applying noise as an input for the system in a certain time interval, the need to measure the states for the next iterations is eliminated. The convergence of the adaptive iterative process to the optimal controller has been presented and proved in a theorem.</div> کنترل بهینه, سیستم‌های دوخطی, دینامیک ناشناخته, تطبیقی, سیاست تکرار. Optimal control, Bilinear systems, Unknown dynamics, Adaptive policy iteration (PI) 75 87 http://joc.kntu.ac.ir/browse.php?a_code=A-10-72-8&slc_lang=fa&sid=1 Seyyede Nafiseh Manoochehri Rahbar سیده نفیسه منوچهری رهبر sn.manoochehri@gmail.com 10031947532846009638 10031947532846009638 No Department of Mathematics, Payame Noor University(PNU),P.O.Box19395-4697 گروه ریاضی ، دانشگاه پیام نور، ص.پ. 19395-4697، تهران ، ایران. Naser Pariz ناصر پریز n-pariz@um.ac.ir 10031947532846009639 10031947532846009639 Yes professor Ferdowsi University of Mashhad گروه مهندسی برق، دانشکده فنی و مهندسی، دانشگاه فردوسی مشهد،مشهد، ایران Mohammad Reza Ramezani-al محمد رضا رمضانی آل m-ramezani@qiet.ac.ir 10031947532846009640 10031947532846009640 No Department of Electrical Engineering, Quchan University of Technology گروه مهندسی برق، دانشکده مهندسی برق و کامپیوتر، دانشگاه صنعتی قوچان،قوچان، ایران Aghileh Heydari عقیله حیدری A_heidari@pnu.ac.ir 10031947532846009641 10031947532846009641 No Department of Mathematics, Payame Noor University(PNU),P.O.Box19395-4697 گروه ریاضی ، دانشگاه پیام نور، ص.پ. 19395-4697، تهران ، ایران.