Receding Horizon Games

Receding Horizon Games For safe and fair resource allocation in
dynamic settings Sophie Hall, PhD student, ETH Zurich June 2025

2 Short intro to my research Overview of research areas
Game-theoretic control Receding Horizon Games Model Predictive Control RHG Stability analysis Social choice theory Dynamic resource allocation (energy, water, thermal load…) Economic MPC Optimal job scheduling in computing centers Optimal economic battery operation

3 Communication network https://bit.ly/3ALJslC Thermal control https://bit.ly/3iaEs2N Traffic https://bit.ly/2ZqoIC0 Water
network https://bit.ly/3ocqlxp A world of finite resources which need to be shared amongst selfish agents. Ever more allocation decisions are automated. Controllers decide who gets how much when. Game theory models cooperation between self- interested decision makers. → Everyone's decision is optimal given what others do. J. F. Nash, “Equilibrium points in n-person games”, 1950 Solution concept: Nash Equilibrium Predictive Control in Competitive & Cooperative Environments

4 Drone coordination Truck platooning Autonomous driving Robotics applications Many
problems can be cast as Dynamic Resource Allocation Bioresources Fish Forests Water A world of finite resources which need to be shared amongst selfish agents. Ever more allocation decisions are automated. Controllers decide who gets how much when. Traffic control Smart grid Thermal building control Infrastructure ▪ Everyone wants the resource, heterogeneous needs → M conflicting objectives ▪ Limited resource is available → Coupling constraint ▪ System follows dynamics ▪ Need to predict into future (to avoid depletion, collision) Model predictive control Game Theory Receding Horizon Games

5 Receding Horizon Games Various application examples S. L. Cleac’h
et al., “ALGAMES: A Fast Augmented Lagrangian Solver for Constrained Dynamic Games, 2021 Autonomous driving A. Liniger & J. Lygeros, “A Noncooperative Game Approach to Autonomous Racing, 2020 Car racing R. Spica et al., “A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing,” 2020 Drone racing E. R. Stephens et al., “Game Theoretic Model Predictive Control for Distributed Energy Demand-Side Management,” 2015 Demand-Side Management A. D. Paola et al., “Distributed Coordination of Price-Responsive Electric Loads: A Receding Horizon Approach,” 2018 Electric load scheduling

6 There is lots of interest but not much theory!

RECEDING HORIZON GAMES 7

MPC Paradigms: Overview ▪ 1 objective function ▪ 1 optimization
problem ▪ centrally solved Exemplary applications: Process, Aerospace, Automotive industry….. Exemplary applications: Mobile robot coordination, Sensor coverage, Microgrids … Centralised MPC Distributed MPC Game-theoretic MPC (Receding Horizon Games) Exemplary applications: Autonomous driving/ racing, Resource allocation problems….. ▪ M conflicting objectives of heterogeneous agents ▪ 1 generalized game ▪ distributed computation ▪ 1 higher level objective function, split into Multiple ▪ M optimization problems ▪ distributed computation

MPC Paradigms: Overview Distributed MPC Game-theoretic MPC (Receding Horizon Games)
MPC Agent 1 MPC 2 MPC 3 MPC 4 Coupled Coupled Coupled 1 Global MPC Split into M separate controllers ▪ M MPC problems inherently coupled ▪ Can not be combined into one

Receding Horizon Games Sources of coupling 10 A group of
self-interested agents aim to control a dynamical system. Three sources of coupling arise: 1) Coupled global (local) dynamics: → shared battery system, aquifer water level 2) Coupled stage cost: → energy price increases with consumption 3) Local and coupling constraints: → joint battery state constraints Control input of agent v: Control input of other agents:

At each iteration, agents solve the following coupled optimization problems
11 Receding Horizon Games The optimization problem Generalized Nash equilibrium (GNE) problem Stage cost Dynamics Constraints Initial condition

A short Introduction Receding Horizon Games RHG Generalized (constrained) Game
MPC Optimization Problem is an optimal control problem (OCP) is a generalized game

A short Introduction Receding Horizon Games RHG MPC → 1
Optimal Control Problem (OCP) → M coupled OCPs

SOLUTION CONCEPT GENERALIZED NASH EQUILIBRIUM 14

15 Introduction to Generalized Nash Equilibrium problems Belgioioso et al.,
IEEE Control System Magazine, 2022. | Facchinei et al., 40R, 2009. We need a “fixed point” concept, equivalent to a minimizer in optimization problems → In competitive games, this is an equilibrium point, specifically a Nash equilibrium

16 GNE solution set Two characterizations → The set of
GNEs is large and possibly infinite & hard to solve for → Need to select one!

17 GNE solution set Two characterizations A set of prices
for which there is no advantage to the agents in being allowed to pay for constraint violations. → Uniform “Shadow price”

variational GNEs in Different Applications Variational GNEs are used in
a wide variety of applications as the solution concept! 18

“FAIRNESS” IN VARIATIONAL GNES 19

variational GNEs Reasons why it is a desirable solution concept
Why select v-GNE from the entire set? 1) Existence & uniqueness results under broad set of assumptions 2) Fast algorithms solving for v-GNEs based on different communication structures 3) Explicit bounds & guarantees on efficiency in terms of utilitarian sum of costs 4) Fairness in terms of equal penalty, or common “shadow price” with access to resource Set of all GNEs is large and potentially infinite BUT for real-time implementation MUST select one Generalized Normalized Variational 20

variational GNEs Fairness measures relevant in control 21

ISSUE # 1 “FAIRNESS” REQUIRES COMPARABILITY 22

▪ Do I curtail the wind farm or the solar
panel if the power line cannot feed both? ▪ How to decide the charging rate of multiple electric vehicles charging in parallel? ▪ Who gets water in a water distribution network? ▪ Which drivers deserve to use the short path? ▪ How much bandwidth do I allocate to each server? The world of comparability Comparisons in optimization and control

Interpersonal comparisons are implicit in a lot of real-world situations
A B “Person B values time more than person A” “Person A values renewable energy 2 times more than person B” “It would be better to allocate water to person A than to person B” Comparability Intrapersonal & interpersonal comparisons

Arrow’s Impossibility Theorem Often unwittingly assumed in the design of
multi-agent engineering systems Comparability Types of comparability (ONC) Ordinal non-comparability (OLC) Ordinal-level Comparability (CNC) Cardinal Non-comparability (CUC) Cardinal unit comparability (CFC) Cardinal Full Comparability I. Shilov, E. Elokda, S. Hall, H. H. Nax, and S. Bolognani, “Welfare and Cost Aggregation for Multi-Agent Control: When to Choose Which Social Cost Function, and Why?

Comparability Generalized vs Variational Equilibrium S. Hall, F. Dörfler, H.
H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,” For uniform “shadow price” to be meaningful require exact information on gradients of cost functions! Selecting the v-GNE imposes stronger comparability requirements than required for the set of GNEs.

ISSUE # 2 V-GNE “FAIRNESS” NOT RELATED TO EXISTING NOTIONS
32

variational GNEs Fairness measures relevant in control S. Hall, F.
Dörfler, H. H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,”

variational GNEs Fair GNE Generalized Normalized Variational We want to
select a “fair” GNE Step 1: Pick appropriate comparability notion Step 2: Select desired fairness notion Step 3: Select GNE maximizing fairness BUT very difficult to solve!! Only in 1D can characterize set of GNEs S. Hall, F. Dörfler, H. H. Nax, and S. Bolognani, “The Limits of “Fairness” of the Variational Generalized Nash Equilibrium,”

STABILITY CERTIFICATES FOR RECEDING HORIZON GAMES 35

36 Stability results for game-theoretic MPC Two approaches Result 1:
Potential games in receding-horizon Pro: ▪ Potential game can be cast as 1 OCP → Apply economic MPC results ▪ State and input constraints Con: ▪ Requires potential structure of game (smaller class) → coupling in cost to be symmetric → restricts “selfishness” of agents Pro: ▪ Treats the M interdependent OCPs, the” full game” ▪ Certificates numerically verifiable, even in a distributed manner Con: ▪ Results currently only hold for input constraints & stable systems ▪ Insights for cost function design are limited Result 2: General games in receding-horizon S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management” S. Hall, D. Liao-McPherson, G. Belgioioso, and F. Dörfler, “Stability Certificates for Receding Horizon Games,”

37 Receding Horizon Games Stability Certificates Problem set-up What happens
to the closed loop system if we play the GNE feedback policy in receding-horizon? Question: Stability in RHG

38 The full RHG problem per agent: In compact form:
As a generalized equation: Problem Setup Compact form Pseudo-gradient:

39 Problem Setup RHG feedback policy Standing Assumptions Ensures uniqueness
of v-GNE , particularly important in receding-horizon implementation! RHG feedback policy: uniqueness Closed-loop system:

40 Receding Horizon Games as Feedback Interconnection System 1: System
2: Stable LTI system Cocoercive, Lipschitz static nonlinearity

41 Receding Horizon Games as Main Stability Theorem (i) There
exists a globally asymptotically stable equilibrium point , (ii) the OCP is recursively feasible; (iii) constraints are satisfied for all times. We get an LMI condition that can be verified numerically!

42 For decoupled dynamics of the form LMI condition can
be verified locally Distributed LMI check Decoupled Dynamics Particularly relevant in large-scale multi-agent settings: (1) Allows for plug-and-play: Agents can join & leave the game any time (e.g. disconnect from grid, leave allocation mechanism …) (2) Verification can be performed locally, reduces communication requirements & computational burden

43 Large-scale LMI conditions are “cryptic” we would like some
insights! Use Schur complements for a 1D example The equilibrium is asymptotically stable if Feasibility of LMI Intuition from a 1D example

RESOURCE ALLOCATION PROBLEMS 44

45 Receding Horizon Games in Dynamic Resource Allocation problems Demand
Side Management Electric vehicle charging Groundwater allocation Supply chains S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management,” S. Hall, L. Guerrini, F. Dörfler, and D. Liao- McPherson, “Receding Horizon Games for Modeling Competitive Supply Chains,”

46 Can charge and discharge battery and shift consumption to:
▪ Reduce cost (energy prices vary over day), buy from grid when energy cheap, discharge battery or shift consumption when energy is expensive ▪ Store energy for times when solar supply is low or blackouts happen (safety backup) Kolter et al., 2011. A single household (smart home) wants to manage their energy consumption optimally. The individual control problem encompasses several components: ▪ Thermal control, temperature & humidity in rooms ▪ EV charging ▪ Electricity consumption of large appliances ▪ Solar panel control ▪ Battery charging Receding Horizon Gams for Demand-Side Management

Solution: Explicitly take into account coupling in the problem! 1.
Cost function: My energy price depends on aggregate demand, i.e., on how much everyone else is buying from the grid 47 Energy Management For a collection of households 2. Coupling constraint: All players together can only buy a max amount of energy Problem: Aggregate demand peaks Production peaks but coupled through same grid! Hours of the day Aggregate energy load Shift peaks!

Simulation data: Real household consumption, mostly single family homes, state
of New York, collected in 2019 Peak shaving RHG for Demand-Side Management Simulation results Disturbance rejection Advantages of Receding Horizon Games 1. Better approximation of infinite horizon cost; 2. Avoids undesired “end-of-day” effects; 3. Able to reject disturbances, fulfil joint constraints. S. Hall, G. Belgioioso, D. Liao-McPherson, and F. Dorfler, “Receding Horizon Games with Coupling Constraints for Demand-Side Management,”

CONCLUSION & OUTLOOK 49

Stability ▪ Closed-loop stability for potential case ▪ Closed-loop certificate
for non-potential case ▪ Stability for time-varying RHGs ▪ Real-time iterations in RHGs ▪ Turnpike and dissipativity in general RHGs Application ▪ RHG for Demand-side Management ▪ RHG for competitive supply chains ▪ RHG for ground water management ▪ Full-scale DSM ◦ Large Scale Dataset, Distributed algorithm, IEEE 37-bus network, Physical line constraints, other price function, uncertain forecasts etc… 50 Summary & Future Work Receding Horizon Games

51 A word of gratitude and credits to my academic
advisors & collaborators Giuseppe Belgioioso KTH Stockholm Dominic Liao-McPherson UBC Vancouver Heinrich Nax University of Zürich Saverio Bolognani ETH Zürich Florian Dörfler ETH Zürich And my industry collaborators at Timm Faulwasser TU Hamburg

52 Thank you for your attention! Questions? Sophie Hall PhD
student , ETH Zürich [email protected]

Receding Horizon Games

Receding Horizon Games

More Decks by Florian Dörfler

Featured

Transcript