Journal of Control

fa طراحی تنظیم‌گر خطی درجه دومِ پس‌خور خروجی برای سیستم‌های زمان گسسته‌ی دینامیک نامعلوم بر اساس بازسازی حالت‌ها و برنامه‌ریزی نیمه معین Model-Free Synthesis of Output Feedback Linear Quadratic Regulators for Discrete-Time Systems with Unknown Dynamics via State Reconstruction and Semidefinite Programming عمومى General پژوهشي Research paper در این مقاله یک روش جدید مستقل از مدل، غیر تکراری و تطبیقی برای طراحی برخط تنظیم‌گر خطی درجه دوم زمان گسسته با پس‌خور خروجی ارائه می‌شود. پیش‌تر، در ادبیات تحقیق، از روش‌های یادگیری تقویتی برای حل این مسأله استفاده شده است. این روش‌های تکراری نیازمند نمونه‌برداری از تعداد نسبتاً زیادی داده‌های ورودی و خروجی سیستم هستند که این امر هزینه‌ی طراحی را نیز افزایش می‌دهد. در این مقاله صرفاً با یک بار نمونه برداری از داده‌های ورودی و خروجی در یک بازه¬ی زمانی بسیار کوتاه و با بازسازی حالت‌ها به روش مستقل از مدل، روشی برای بازنویسی مسأله‌ی تنظیم‌گر خطی درجه دوم به صورت یک مسأله‌ی برنامه‌ریزی نیمه معین با قیود ناتساوی ماتریسی خطی، معرفی می‌شود. همچنین در الگوریتم پیشنهادی با بهره‌گیری از معادله‌ی بلمن امکان باز طراحی کنترل کننده، برای تطبیق با تغییرات احتمالی در دینامیک سیستم فراهم می‌گردد. در نهایت، شبیه‌سازی‌ها نشان می‌دهند که الگوریتم پیشنهادی، در مقایسه با الگوریتم‌های یادگیری Q، با تعداد داده بسیار کمتر و هزینه طراحی بسیار پایین&lrm;تری قادر به حل مسأله است. همچنین، اجرای این الگوریتم برای یک سیستم با دو ورودی و از مرتبه چهار، کاربرد آن را در طراحی کنترل کننده برای سیستم‌های پیچیده‌تر نشان می‌دهد.   In this paper, we present a novel model-free, non-iterative, and adaptive approach for online design of a discrete-time linear quadratic regulator (LQR) with output feedback. Previously, reinforcement learning methods have been used to solve this problem. These iterative methods require sampling a relatively large number of input and output data from the system, which increases the design cost. In this paper, we introduce a method that reformulates the LQR problem as a semidefinite programming problem with linear matrix inequality constraints by sampling input and output data only once over a very short time interval and reconstructing the states using a model-free approach. Moreover, by utilizing the Bellman equation in the proposed algorithm, we enable the redesign of the controller to adapt to possible changes in system dynamics. Finally, through simulations, we demonstrate that our proposed algorithm can solve the problem with significantly fewer data samples and lower design costs compared to Q-learning algorithms. Additionally, by implementing this algorithm on a fourth-order two-input system, we illustrate its applicability to more complex systems.   تنظیم‌گر خطی درجه دوم, برنامه‌ریزی نیمه معین, کنترل کننده‌ی پس‌خور خروجی, یادگیری Q Linear quadratic regulator, semidefinite programming, output feedback, Q-learning 0 0 http://joc.kntu.ac.ir/browse.php?a_code=A-10-997-2&slc_lang=fa&sid=1 Ahmad Akbari احمد اکبری a.akbari@sut.ac.ir 100319475328460010074 100319475328460010074 Yes Sahand University of Technology دانشگاه صنعتی تبریز (سهند) Ahmad Pishro Asl احمد پیشرو اصل a_pishro@sut.ac.ir 100319475328460010075 100319475328460010075 No Sahand University of Technology دانشگاه صنعتی تبریز (سهند)،