References

Dynamic programming (DP) is a fairly powerful and yet general framework that finds its use in many disciplines. Optimal control is not the only one. But in this overview of the literature we deliberately focus on the DP references with optimal control flavour.

Our introductory treatment was based almost exclusively on the (also just introductory) Chapter 6 in [1]. Electronic version of the book is freely available on the author’s webpage.

Comparable introduction is provided in [2]. Although it does not appear to be legally available for free in an electronic form, its reprint by a low-cost publisher makes it an affordable (and recommendable) classic reference. Another classic [3] actually uses dynamic programming as the key technique to derive all those LQ-optimal regulation and tracking results. A few copies of this book are available in the faculty library at NTK. The authors also made an electronic version available for free on their website.

Fairly comprehensive treatment of control-oriented DP is in the two-volume monograph [4] and [5]. It is not available online for free, but the book webpage contains links to other supporting materials including lecture notes. Furthermore, the latest book by the same author [6], which is available for free download, contains a decent introduction to dynamic programming.

Having just referenced a book on reinforcement learning (RL), indeed, this popular concept — or at least some of its flavours — is closely related to dynamic programming. In fact, it offers a way to overcome some of the limitations of dynamic programming. In our introductory lecture we are not covering RL, but an interested student can take advantage of availability of high-quality resources such as the the RL-related books and other resources by D. Bertsekas and another recommendable introduction to RL from control systems perspective [7], which is also available for free download.

The book [8] often regarded as the bible of RL is nice (and freely available for download) but may be rather difficult to read for a control engineer because of major differences in terminology and notation.

References

[1]

F. L. Lewis, D. Vrabie, and V. L. Syrmo, Optimal Control, 3rd ed. John Wiley & Sons, 2012. Accessed: Mar. 09, 2022. [Online]. Available: https://lewisgroup.uta.edu/FL%20books/Lewis%20optimal%20control%203rd%20edition%202012.pdf

[2]

D. E. Kirk, Optimal Control Theory: An Introduction, Reprint of the 1970 edition. Dover Publications, 2004.

[3]

B. D. O. Anderson and J. B. Moore, Optimal Control: Linear Quadratic Methods, Reprint of the 1989 edition. Dover Publications, 2007. Available: http://users.cecs.anu.edu.au/~john/papers/BOOK/B03.PDF

[4]

D. P. Bertsekas, Dynamic Programming and Optimal Control, 4th ed., vol. I. Belmont, Massachusetts: Athena Scientific, 2017. Available: http://athenasc.com/dpbook.html

[5]

D. P. Bertsekas, Dynamic Programming and Optimal Control, 4th ed., vol. II. Belmont, Massachusetts: Athena Scientific, 2012. Available: http://athenasc.com/dpbook.html

[6]

D. P. Bertsekas, A Course in Reinforcement Learning. Belmont, Massachusetts: Athena Scientific, 2023. Accessed: Sep. 15, 2023. [Online]. Available: https://web.mit.edu/dimitrib/www/RLCOURSECOMPLETE.pdf

[7]

S. Meyn, Control Systems and Reinforcement Learning. Cambridge University Press, 2022. Accessed: Aug. 25, 2021. [Online]. Available: https://meyn.ece.ufl.edu/control-systems-and-reinforcement-learning/

[8]

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, Massachusetts: A Bradford Book, 2018. Available: http://incompleteideas.net/book/the-book-2nd.html