Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Receding Horizon Games

Avatar for Florian Dörfler Florian Dörfler
June 27, 2025
2

Receding Horizon Games

2025 slides by my doctoral student Sophie Hall

Avatar for Florian Dörfler

Florian Dörfler

June 27, 2025
Tweet

Transcript

  1. Receding Horizon Games For safe and fair resource allocation in

    dynamic settings Sophie Hall, PhD student, ETH Zurich June 2025
  2. 2 Short intro to my research Overview of research areas

    Game-theoretic control Receding Horizon Games Model Predictive Control RHG Stability analysis Social choice theory Dynamic resource allocation (energy, water, thermal load…) Economic MPC Optimal job scheduling in computing centers Optimal economic battery operation
  3. 3 Communication network https://bit.ly/3ALJslC Thermal control https://bit.ly/3iaEs2N Traffic https://bit.ly/2ZqoIC0 Water

    network https://bit.ly/3ocqlxp A world of finite resources which need to be shared amongst selfish agents. Ever more allocation decisions are automated. Controllers decide who gets how much when. Game theory models cooperation between self- interested decision makers. → Everyone's decision is optimal given what others do. J. F. Nash, “Equilibrium points in n-person games”, 1950 Solution concept: Nash Equilibrium Predictive Control in Competitive & Cooperative Environments
  4. 4 Drone coordination Truck platooning Autonomous driving Robotics applications Many

    problems can be cast as Dynamic Resource Allocation Bioresources Fish Forests Water A world of finite resources which need to be shared amongst selfish agents. Ever more allocation decisions are automated. Controllers decide who gets how much when. Traffic control Smart grid Thermal building control Infrastructure ▪ Everyone wants the resource, heterogeneous needs → M conflicting objectives ▪ Limited resource is available → Coupling constraint ▪ System follows dynamics ▪ Need to predict into future (to avoid depletion, collision) Model predictive control Game Theory Receding Horizon Games
  5. 5 Receding Horizon Games Various application examples S. L. Cleac’h

    et al., “ALGAMES: A Fast Augmented Lagrangian Solver for Constrained Dynamic Games, 2021 Autonomous driving A. Liniger & J. Lygeros, “A Noncooperative Game Approach to Autonomous Racing, 2020 Car racing R. Spica et al., “A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing,” 2020 Drone racing E. R. Stephens et al., “Game Theoretic Model Predictive Control for Distributed Energy Demand-Side Management,” 2015 Demand-Side Management A. D. Paola et al., “Distributed Coordination of Price-Responsive Electric Loads: A Receding Horizon Approach,” 2018 Electric load scheduling
  6. MPC Paradigms: Overview ▪ 1 objective function ▪ 1 optimization

    problem ▪ centrally solved Exemplary applications: Process, Aerospace, Automotive industry….. Exemplary applications: Mobile robot coordination, Sensor coverage, Microgrids … Centralised MPC Distributed MPC Game-theoretic MPC (Receding Horizon Games) Exemplary applications: Autonomous driving/ racing, Resource allocation problems….. ▪ M conflicting objectives of heterogeneous agents ▪ 1 generalized game ▪ distributed computation ▪ 1 higher level objective function, split into Multiple ▪ M optimization problems ▪ distributed computation
  7. MPC Paradigms: Overview Distributed MPC Game-theoretic MPC (Receding Horizon Games)

    MPC Agent 1 MPC 2 MPC 3 MPC 4 Coupled Coupled Coupled 1 Global MPC Split into M separate controllers ▪ M MPC problems inherently coupled ▪ Can not be combined into one
  8. Receding Horizon Games Sources of coupling 10 A group of

    self-interested agents aim to control a dynamical system. Three sources of coupling arise: 1) Coupled global (local) dynamics: → shared battery system, aquifer water level 2) Coupled stage cost: → energy price increases with consumption 3) Local and coupling constraints: → joint battery state constraints Control input of agent v: Control input of other agents:
  9. At each iteration, agents solve the following coupled optimization problems

    11 Receding Horizon Games The optimization problem Generalized Nash equilibrium (GNE) problem Stage cost Dynamics Constraints Initial condition
  10. A short Introduction Receding Horizon Games RHG Generalized (constrained) Game

    MPC Optimization Problem is an optimal control problem (OCP) is a generalized game
  11. A short Introduction Receding Horizon Games RHG MPC → 1

    Optimal Control Problem (OCP) → M coupled OCPs
  12. 15 Introduction to Generalized Nash Equilibrium problems Belgioioso et al.,

    IEEE Control System Magazine, 2022. | Facchinei et al., 40R, 2009. We need a “fixed point” concept, equivalent to a minimizer in optimization problems → In competitive games, this is an equilibrium point, specifically a Nash equilibrium
  13. 16 GNE solution set Two characterizations → The set of

    GNEs is large and possibly infinite & hard to solve for → Need to select one!
  14. 17 GNE solution set Two characterizations A set of prices

    for which there is no advantage to the agents in being allowed to pay for constraint violations. → Uniform “Shadow price”
  15. variational GNEs in Different Applications Variational GNEs are used in

    a wide variety of applications as the solution concept! 18
  16. variational GNEs Reasons why it is a desirable solution concept

    Why select v-GNE from the entire set? 1) Existence & uniqueness results under broad set of assumptions 2) Fast algorithms solving for v-GNEs based on different communication structures 3) Explicit bounds & guarantees on efficiency in terms of utilitarian sum of costs 4) Fairness in terms of equal penalty, or common “shadow price” with access to resource Set of all GNEs is large and potentially infinite BUT for real-time implementation MUST select one Generalized Normalized Variational 20
  17. ▪ Do I curtail the wind farm or the solar

    panel if the power line cannot feed both? ▪ How to decide the charging rate of multiple electric vehicles charging in parallel? ▪ Who gets water in a water distribution network? ▪ Which drivers deserve to use the short path? ▪ How much bandwidth do I allocate to each server? The world of comparability Comparisons in optimization and control
  18. Interpersonal comparisons are implicit in a lot of real-world situations

    A B “Person B values time more than person A” “Person A values renewable energy 2 times more than person B” “It would be better to allocate water to person A than to person B” Comparability Intrapersonal & interpersonal comparisons
  19. Arrow’s Impossibility Theorem Often unwittingly assumed in the design of

    multi-agent engineering systems Comparability Types of comparability (ONC) Ordinal non-comparability (OLC) Ordinal-level Comparability (CNC) Cardinal Non-comparability (CUC) Cardinal unit comparability (CFC) Cardinal Full Comparability I. Shilov, E. Elokda, S. Hall, H. H. Nax, and S. Bolognani, “Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why?
  20. Comparability Generalized vs Variational Equilibrium S. Hall, F. Dörfler, H.

    H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,” For uniform “shadow price” to be meaningful require exact information on gradients of cost functions! Selecting the v-GNE imposes stronger comparability requirements than required for the set of GNEs.
  21. variational GNEs Fairness measures relevant in control S. Hall, F.

    Dörfler, H. H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,”
  22. variational GNEs Fair GNE Generalized Normalized Variational We want to

    select a “fair” GNE Step 1: Pick appropriate comparability notion Step 2: Select desired fairness notion Step 3: Select GNE maximizing fairness BUT very difficult to solve!! Only in 1D can characterize set of GNEs S. Hall, F. Dörfler, H. H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,”
  23. 36 Stability results for game-theoretic MPC Two approaches Result 1:

    Potential games in receding-horizon Pro: ▪ Potential game can be cast as 1 OCP → Apply economic MPC results ▪ State and input constraints Con: ▪ Requires potential structure of game (smaller class) → coupling in cost to be symmetric → restricts “selfishness” of agents Pro: ▪ Treats the M interdependent OCPs, the” full game” ▪ Certificates numerically verifiable, even in a distributed manner Con: ▪ Results currently only hold for input constraints & stable systems ▪ Insights for cost function design are limited Result 2: General games in receding-horizon S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management” S. Hall, D. Liao-McPherson, G. Belgioioso, and F. Dörfler, “Stability Certificates for Receding Horizon Games,”
  24. 37 Receding Horizon Games Stability Certificates Problem set-up What happens

    to the closed loop system if we play the GNE feedback policy in receding-horizon? Question: Stability in RHG
  25. 38 The full RHG problem per agent: In compact form:

    As a generalized equation: Problem Setup Compact form Pseudo-gradient:
  26. 39 Problem Setup RHG feedback policy Standing Assumptions Ensures uniqueness

    of v-GNE , particularly important in receding-horizon implementation! RHG feedback policy: uniqueness Closed-loop system:
  27. 40 Receding Horizon Games as Feedback Interconnection System 1: System

    2: Stable LTI system Cocoercive, Lipschitz static nonlinearity
  28. 41 Receding Horizon Games as Main Stability Theorem (i) There

    exists a globally asymptotically stable equilibrium point , (ii) the OCP is recursively feasible; (iii) constraints are satisfied for all times. We get an LMI condition that can be verified numerically!
  29. 42 For decoupled dynamics of the form LMI condition can

    be verified locally Distributed LMI check Decoupled Dynamics Particularly relevant in large-scale multi-agent settings: (1) Allows for plug-and-play: Agents can join & leave the game any time (e.g. disconnect from grid, leave allocation mechanism …) (2) Verification can be performed locally, reduces communication requirements & computational burden
  30. 43 Large-scale LMI conditions are “cryptic” we would like some

    insights! Use Schur complements for a 1D example The equilibrium is asymptotically stable if Feasibility of LMI Intuition from a 1D example
  31. 45 Receding Horizon Games in Dynamic Resource Allocation problems Demand

    Side Management Electric vehicle charging Groundwater allocation Supply chains S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management,” S. Hall, L. Guerrini, F. Dörfler, and D. Liao- McPherson, “Receding Horizon Games for Modeling Competitive Supply Chains,”
  32. 46 Can charge and discharge battery and shift consumption to:

    ▪ Reduce cost (energy prices vary over day), buy from grid when energy cheap, discharge battery or shift consumption when energy is expensive ▪ Store energy for times when solar supply is low or blackouts happen (safety backup) Kolter et al., 2011. A single household (smart home) wants to manage their energy consumption optimally. The individual control problem encompasses several components: ▪ Thermal control, temperature & humidity in rooms ▪ EV charging ▪ Electricity consumption of large appliances ▪ Solar panel control ▪ Battery charging Receding Horizon Gams for Demand-Side Management
  33. Solution: Explicitly take into account coupling in the problem! 1.

    Cost function: My energy price depends on aggregate demand, i.e., on how much everyone else is buying from the grid 47 Energy Management For a collection of households 2. Coupling constraint: All players together can only buy a max amount of energy Problem: Aggregate demand peaks Production peaks but coupled through same grid! Hours of the day Aggregate energy load Shift peaks!
  34. Simulation data: Real household consumption, mostly single family homes, state

    of New York, collected in 2019 Peak shaving RHG for Demand-Side Management Simulation results Disturbance rejection Advantages of Receding Horizon Games 1. Better approximation of infinite horizon cost; 2. Avoids undesired “end-of-day” effects; 3. Able to reject disturbances, fulfil joint constraints. S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management,”
  35. Stability ▪ Closed-loop stability for potential case ▪ Closed-loop certificate

    for non-potential case ▪ Stability for time-varying RHGs ▪ Real-time iterations in RHGs ▪ Turnpike and dissipativity in general RHGs Application ▪ RHG for Demand-side Management ▪ RHG for competitive supply chains ▪ RHG for ground water management ▪ Full-scale DSM ◦ Large Scale Dataset, Distributed algorithm, IEEE 37-bus network, Physical line constraints, other price function, uncertain forecasts etc… 50 Summary & Future Work Receding Horizon Games
  36. 51 A word of gratitude and credits to my academic

    advisors & collaborators Giuseppe Belgioioso KTH Stockholm Dominic Liao-McPherson UBC Vancouver Heinrich Nax University of Zürich Saverio Bolognani ETH Zürich Florian Dörfler ETH Zürich And my industry collaborators at Timm Faulwasser TU Hamburg