Discussion about this post

User's avatar
Daniel González Arribas's avatar

One particular nuance in this discussion that I think is worth mentioning is that many different applications walk under the banner of "optimal control", but they're all over the place in the spectrum between "optimal" and "control". In some applications, you're basically doing optimal decisionmaking over a long horizon of a defined scenario, the objective function corresponds to tangible quantities (such as cost, or mass, or time), and sometimes you don't even care about stability because it's either the concern of another engineer (in other words, you're doing more "optimal guidance" than optimal control) or the system itself is already stable. In other applications, you're basically using "optimal control" to design a controller, the scenario is sometimes just representative, and the objective function (e.g. the Q and R matrices in LQR) is kind of arbitrary or just a different tuning parameter for the engineer to tweak. Again, using the LQR example, rarely do we actually care about "the square of the control or control deviation" except as a vague proxy for output behavior; in many instances, the actual cost is linear on the integral of the control magnitude - but the associated control problem is then harder (switching between saturated control arcs), more brittle against uncertainty and does not allow "solve once" solutions as in LQR. Of course, the more "optimal" you go, the more optimization pathologies you will encounter.

Lalitha Sankar's avatar

Max: I really enjoyed this piece. It reminded me of stories I've heard of interactions between control theorists and power systems engineers in the 70s-90s -- suffices to say there was a lot of sneer from one side of the other. Felix Wu tried to fix it, but the other side can also be deeply rooted in not changing things unless absolutely needed. Lead-lag compensators are so closely related to primary frequency control in power systems. Feels a bit like black magic but they do have differential equations ;). I am often amazed by how LQR and LQG systems were largely enough to help us fly spacecrafts! But I wonder if one shoule compare such seemingly "model-able" dynamical systems to where RL is presently used. On the other hand, every time I see a Waymo (which in AZ is everyday and many times a day), I have to wonder if there is any RL at all in it or if it is just Kalman at its best.

No posts

Ready for more?