Enrique Mallada's Home Page

Associate Professor • Electrical & Computer Engineering • (he/him/his)

Barton Hall 312 • 3400 N Charles St • Baltimore • MD 21218
phone: 410-516-7018 • fax: 410-516-5566 • mallada [at] jhu [dot] edu

I have been an associate professor in Electrical and Computer Engineering (ECE) at Johns Hopkins University (JHU) since July 2022. I earned my Ph.D. in ECE with a minor in Applied Mathematics from Cornell University in Jan 2014 under the supervision of an awesome advisor and person, Prof. A. Kevin Tang. Before joining JHU as an assistant professor in Jan 2016, I was a postdoctoral scholar at the Center for the Mathematics of Information (CMI) in the Computational and Mathematical Sciences (CMS) department at Caltech from 2013 to 2015, where I had the pleasure to be mentored by Prof. Steven Low and Prof. Adam Wierman. Originally from Montevideo, Uruguay, I completed my B.S. in Telecommunications Engineering at ORT University, where I was also a research assistant in the MATE group led by Prof. Fernando Paganini, the person who introduced me to the joy of research.

Research Interests

Networked Systems: coupled oscillators, clock synchronization, saddle-flows, network coherence, distributed coordination, consensus
Power Systems: frequency control, inverter-based control, real-time congestion management, electricity markets, reduced-order models
Optimization: time-varying optimization, primal-dual algorithms, semidefinite programming, sum-of-squares optimization
Machine Learning: reinforcement learning, sparse recovery, subspace preserving recovery, network tomography, multi-armed bandits

Recent Talks

A complete list of talks can be found here.

2026-01-27: Reliability Challenges in IBR-Rich Power Grids, Newcastle.
[BibTeX] [Abstract] [Download PDF]

The rapid integration of inverter-based resources (IBRs) is transforming power system dynamics, eroding traditional sources of inertia and voltage support and introducing new forms of instability. In IBR-rich grids, uncertainty in network conditions, operating points, and proprietary inverter controls has led to an increasing number of oscillatory events, including sub-synchronous oscillations driven by inverter–grid interactions. This talk examines the mechanisms underlying these instabilities and argues that their onset can be understood through small-signal models and impedance-based analysis, where critical transitions arise via Hopf bifurcations. Building on this perspective, we present robust, decentralized stability criteria that account for inverter heterogeneity and operating-point dependence while requiring only local measurements and testing. These results expose a fundamental trade-off between robustness and efficiency: expanding the admissible dispatch region necessitates tighter constraints on inverter dynamic behavior. Together, these insights provide a foundation for stability-aware planning, control, and operation of future low-inertia power systems dominated by inverter-based resources.

@talk{EPICS26-AU,
  abstract = {The rapid integration of inverter-based resources (IBRs) is transforming power system dynamics, eroding traditional sources of inertia and voltage support and introducing new forms of instability. In IBR-rich grids, uncertainty in network conditions, operating points, and proprietary inverter controls has led to an increasing number of oscillatory events, including sub-synchronous oscillations driven by inverter--grid interactions.

This talk examines the mechanisms underlying these instabilities and argues that their onset can be understood through small-signal models and impedance-based analysis, where critical transitions arise via Hopf bifurcations. Building on this perspective, we present robust, decentralized stability criteria that account for inverter heterogeneity and operating-point dependence while requiring only local measurements and testing. These results expose a fundamental trade-off between robustness and efficiency: expanding the admissible dispatch region necessitates tighter constraints on inverter dynamic behavior.

Together, these insights provide a foundation for stability-aware planning, control, and operation of future low-inertia power systems dominated by inverter-based resources.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {01/27/2026},
  day = {27},
  event = {Newcastle},
  host = {Behrooz Bahrani},
  month = {01},
  role = {Speaker},
  title = {Reliability Challenges in IBR-Rich Power Grids},
  url = {https://mallada.ece.jhu.edu/talks/202601-EPICS-AU.pdf},
  year = {2026}
}

2025-12-05: Nonparametric Analysis and Control of Dynamical Systems, Control Workshop @ Uruguay.
[BibTeX] [Abstract] [Download PDF]

This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency. First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity. Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.

@talk{controluy25,
  abstract = {This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency.

First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity.

Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {12/05/2025},
  day = {05},
  event = {Control Workshop @ Uruguay},
  host = {Andres Ferragut, Enrique Mallada},
  month = {12},
  role = {Lecturer},
  title = {Nonparametric Analysis and Control of Dynamical Systems},
  url = {https://mallada.ece.jhu.edu/talks/202512-Control-UY.pdf},
  year = {2025}
}

2025-12-05: Nonparametric Analysis and Control of Dynamical Systems, Neurocomputing and Dynamics Workshop.
[BibTeX] [Abstract] [Download PDF]

This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency. First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity. Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.

@talk{cdcworkshop25,
  abstract = {This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency.

First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity.

Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {12/05/2025},
  day = {05},
  event = {Neurocomputing and Dynamics Workshop},
  host = {Francesco Bullo, Adilson Motte, Arthur Montanari},
  month = {12},
  role = {Lecturer},
  title = {Nonparametric Analysis and Control of Dynamical Systems},
  url = {https://mallada.ece.jhu.edu/talks/202512-CDC-Workshop.pdf},
  year = {2025}
}

2025-10-31: Reliability Challenges in IBR-Rich Power Grids, 50 Hertz.
[BibTeX] [Abstract] [Download PDF]

The rapid integration of inverter-based resources (IBRs) is transforming power system dynamics, eroding traditional sources of inertia and voltage support and introducing new forms of instability. In IBR-rich grids, uncertainty in network conditions, operating points, and proprietary inverter controls has led to an increasing number of oscillatory events, including sub-synchronous oscillations driven by inverter–grid interactions. This talk examines the mechanisms underlying these instabilities and argues that their onset can be understood through small-signal models and impedance-based analysis, where critical transitions arise via Hopf bifurcations. Building on this perspective, we present robust, decentralized stability criteria that account for inverter heterogeneity and operating-point dependence while requiring only local measurements and testing. These results expose a fundamental trade-off between robustness and efficiency: expanding the admissible dispatch region necessitates tighter constraints on inverter dynamic behavior. Together, these insights provide a foundation for stability-aware planning, control, and operation of future low-inertia power systems dominated by inverter-based resources.

@talk{EPICS25,
  abstract = {The rapid integration of inverter-based resources (IBRs) is transforming power system dynamics, eroding traditional sources of inertia and voltage support and introducing new forms of instability. In IBR-rich grids, uncertainty in network conditions, operating points, and proprietary inverter controls has led to an increasing number of oscillatory events, including sub-synchronous oscillations driven by inverter--grid interactions.

This talk examines the mechanisms underlying these instabilities and argues that their onset can be understood through small-signal models and impedance-based analysis, where critical transitions arise via Hopf bifurcations. Building on this perspective, we present robust, decentralized stability criteria that account for inverter heterogeneity and operating-point dependence while requiring only local measurements and testing. These results expose a fundamental trade-off between robustness and efficiency: expanding the admissible dispatch region necessitates tighter constraints on inverter dynamic behavior.

Together, these insights provide a foundation for stability-aware planning, control, and operation of future low-inertia power systems dominated by inverter-based resources.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {10/31/2025},
  day = {31},
  event = {50 Hertz},
  host = {Ben Hobbs},
  month = {10},
  role = {Speaker},
  title = {Reliability Challenges in IBR-Rich Power Grids},
  url = {https://mallada.ece.jhu.edu/talks/202510-EPICS.pdf},
  year = {2025}
}

2025-10-26: Nonparametric Policy Improvement in Continuous Action Spaces via Expert Demonstrations, INFORMS .
[BibTeX] [Abstract] [Download PDF]

The policy improvement theorem is a fundamental building block of classical reinforcement learning for discrete action spaces. Unfortunately, the lack of an analogous result for continuous action spaces with function approximation has historically limited theoretical guarantees of policy optimization algorithms, undermining their reliability. Here, we introduce a novel nonparametric policy that relies purely on data to take actions and that admits a policy improvement theorem for deterministic Markov Decision Processes (MDPs). By imposing mild regularity assumptions on the optimal policy, we show that, when data come from expert demonstrations, one can construct a nonparametric lower bound on the value of the policy, thus enabling its robust evaluation. The constructed lower bound naturally leads to a simple improvement mechanism based on adding more demonstrations. We also provide conditions to identify regions of the state space where additional demonstrations are needed to meet specific performance goals. Finally, we propose a policy optimization algorithm that ensures a monotonic improvement of the lower bound and leads to high probability performance guarantees. These contributions provide a foundational step toward establishing a rigorous framework for policy improvement in continuous action spaces.

@talk{informs25,
  abstract = {The policy improvement theorem is a fundamental building block of classical reinforcement learning for discrete action spaces. Unfortunately, the lack of an analogous result for continuous action spaces with function approximation has historically limited theoretical guarantees of policy optimization algorithms, undermining their reliability. Here, we introduce a novel nonparametric policy that relies purely on data to take actions and that admits a policy improvement theorem for deterministic Markov Decision Processes (MDPs). By imposing mild regularity assumptions on the optimal policy, we show that, when data come from expert demonstrations, one can construct a nonparametric lower bound on the value of the policy, thus enabling its robust evaluation. The constructed lower bound naturally leads to a simple improvement mechanism based on adding more demonstrations. We also provide conditions to identify regions of the state space where additional demonstrations are needed to meet specific performance goals. Finally, we propose a policy optimization algorithm that ensures a monotonic improvement of the lower bound and leads to high probability performance guarantees. These contributions provide a foundational step toward establishing a rigorous framework for policy improvement in continuous action spaces.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {10/26/2025},
  day = {26},
  event = {INFORMS },
  host = {Laixi Shi (JHU)},
  month = {10},
  role = {Speaker},
  title = {Nonparametric Policy Improvement in Continuous Action Spaces via Expert Demonstrations},
  url = {https://mallada.ece.jhu.edu/talks/202510-Informs.pdf},
  year = {2025}
}

2025-09-26: Interconnection Compliance in High-IBR Grids, 50 Hertz.
[BibTeX] [Abstract] [Download PDF]

As inverter-based resources (IBRs) become dominant in modern power grids, interconnection compliance faces growing challenges driven by reduced inertia, weak grid conditions, and limited transparency of proprietary inverter controls. A particularly critical concern is the emergence of sub-synchronous oscillations (SSOs), which have been observed across a wide range of systems and operating conditions. Existing compliance practices—largely based on static screening metrics and ad hoc dynamic studies—struggle to account for inverter heterogeneity, operating-point dependence, and uncertainty while remaining technology-agnostic. This talk revisits the mechanisms underlying IBR-induced SSOs and argues that their onset can be reliably characterized using linearized small-signal and impedance-based models. Building on this foundation, we present a robust, decentralized stability analysis framework that enables certifiable stability margins using black-box inverter models and local testing. The resulting criteria explicitly expose a trade-off between robustness and operational efficiency, clarifying how conservative compliance requirements can restrict dispatch flexibility, while permissive rules risk instability. These insights point toward stability-aware, scalable approaches for interconnection compliance in high-IBR power systems

@talk{50hertz25,
  abstract = {As inverter-based resources (IBRs) become dominant in modern power grids, interconnection compliance faces growing challenges driven by reduced inertia, weak grid conditions, and limited transparency of proprietary inverter controls. A particularly critical concern is the emergence of sub-synchronous oscillations (SSOs), which have been observed across a wide range of systems and operating conditions. Existing compliance practices---largely based on static screening metrics and ad hoc dynamic studies---struggle to account for inverter heterogeneity, operating-point dependence, and uncertainty while remaining technology-agnostic.

This talk revisits the mechanisms underlying IBR-induced SSOs and argues that their onset can be reliably characterized using linearized small-signal and impedance-based models. Building on this foundation, we present a robust, decentralized stability analysis framework that enables certifiable stability margins using black-box inverter models and local testing. The resulting criteria explicitly expose a trade-off between robustness and operational efficiency, clarifying how conservative compliance requirements can restrict dispatch flexibility, while permissive rules risk instability. These insights point toward stability-aware, scalable approaches for interconnection compliance in high-IBR power systems},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {09/26/2025},
  day = {26},
  event = {50 Hertz},
  host = {Mark O'Malley},
  month = {09},
  role = {Speaker},
  title = {Interconnection Compliance in High-IBR Grids},
  url = {https://mallada.ece.jhu.edu/talks/202509-50Hertz.pdf},
  year = {2025}
}

2025-09-19: Nonparametric Analysis and Control of Dynamical Systems: Stability, Safety and Policy Improvement, Texas AM.
[BibTeX] [Abstract] [Download PDF]

This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency. First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity. Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.

@talk{texas-am25,
  abstract = {This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency.

First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity.

Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {09/19/2025},
  day = {19},
  event = {Texas AM},
  host = {Alfredo Garcia},
  month = {09},
  role = {Lecturer},
  title = {Nonparametric Analysis and Control of Dynamical Systems: Stability, Safety and Policy Improvement},
  url = {https://mallada.ece.jhu.edu/talks/202509-Texas-AM.pdf},
  year = {2025}
}

2025-07-09: Nonparametric Analysis and Control of Dynamical Systems: Stability, Safety and Policy Improvement, Chinese University of Hong Kong @ Shenzhen.
[BibTeX] [Abstract] [Download PDF]

This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency. First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity. Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.

@talk{cuhk-sz25,
  abstract = {This talk presents a novel nonparametric framework for analyzing dynamical systems and synthesizing control policies that relies purely on trajectory data and is designed to exploit GPU parallelization for scalability. The key insight behind this work is to relax strict objectives, such as invariance and optimality, and replace them with weaker conditions that enable a flexible trade-off between accuracy, computational complexity, and sample efficiency.

First, we introduce the concept of recurrence, a relaxation of invariance that allows trajectories to leave a set temporarily before returning within a finite time. This relaxed condition serves as a functional substitute for invariance and provides an alternative foundation for analyzing dynamical systems. By leveraging recurrence, we develop integral Lyapunov and barrier function conditions, where function values are required to be eventually monotonic over a finite time window rather than strictly increasing or decreasing. This relaxation offers a more flexible framework for stability and safety verification, enabling a trade-off between verification accuracy and computational complexity.

Next, we turn to the policy optimization problem and introduce a class of nonparametric policies designed for continuous action spaces. These policies rely purely on (expert) trajectory data to construct a nonparametric lower bound, Q_lb, on the optimal action-value function Q^⋆. Crucially, we show that this policy representation admits a policy improvement theorem, overcoming a key limitation faced by function approximation methods in continuous action spaces. Building on this result, we develop a practical algorithm that drives continual policy improvement by selectively incorporating new expert demonstrations, ensuring efficient data use while achieving monotonic performance gains.},
  annote = {Enrique Mallada is an Associate Professor of Electrical and Computer Engineering at Johns Hopkins University, where he has been a faculty member since 2016. He received his Ph.D. in Electrical and Computer Engineering with a minor in Applied Mathematics from Cornell University and a Telecommunications Engineering degree from ORT University, Uruguay. Before joining Hopkins, he was a Postdoctoral Fellow at Caltech's Center for the Mathematics of Information. His honors include the Johns Hopkins Alumni Association Teaching Award (2021), NSF CAREER Award (2018), Caltech's CMI Fellowship (2014), and Cornell ECE Director's Thesis Award (2014). His research spans control and dynamical systems, machine learning, and optimization, with applications to safety-critical systems, networks, and power grids.},
  date = {07/09/2025},
  day = {09},
  event = {Chinese University of Hong Kong @ Shenzhen},
  host = {Yan Jiang},
  month = {07},
  role = {Lecturer},
  title = {Nonparametric Analysis and Control of Dynamical Systems: Stability, Safety and Policy Improvement},
  url = {https://mallada.ece.jhu.edu/talks/202507-CUHK-SZ.pdf},
  year = {2025}
}

Publications Snapshot

A complete list of publications can be found here or on my Google Scholar profile.

Preprints

A. Castellano, S. Pan, and E. Mallada, Data-driven Acceleration of MPC with Guarantees, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

Model Predictive Control (MPC) is a powerful framework for optimal control but can be too slow for low-latency applications. We present a data-driven framework to accelerate MPC by replacing online optimization with a nonparametric policy constructed from offline MPC solutions. Our policy is greedy with respect to a constructed upper bound on the optimal cost-to-go, and can be implemented as a nonparametric lookup rule that is orders of magnitude faster than solving MPC online. Our analysis shows that under sufficient coverage condition of the offline data, the policy is recursively feasible and admits provable, bounded optimality gap. These conditions establish an explicit trade-off between the amount of data collected and the tightness of the bounds. Our experiments show that this policy is between $100$ and $1000$ times faster than standard MPC, with only a modest hit to optimality, showing potential for real-time control tasks.

@unpublished{cpm2025a-preprint,
  abstract = {Model Predictive Control (MPC) is a powerful framework for optimal control but can be too slow for low-latency applications. We present a data-driven framework to accelerate MPC by replacing online optimization with a nonparametric policy constructed from offline MPC solutions. Our policy is greedy with respect to a constructed upper bound on the optimal cost-to-go, and can be implemented as a nonparametric lookup rule that is orders of magnitude faster than solving MPC online. Our analysis shows that under sufficient coverage condition of the offline data, the policy is recursively feasible and admits provable, bounded optimality gap. These conditions establish an explicit trade-off between the amount of data collected and the tightness of the bounds. Our experiments show that this policy is between $100$ and $1000$ times faster than standard MPC, with only a modest hit to optimality, showing  potential for real-time control tasks.},
  author = {Castellano, Agustin and Pan, Shijie and Mallada, Enrique},
  grants = {CPS-2136324; Global-Centers-2330450},
  month = {11},
  organization = {PMLR},
  title = {Data-driven Acceleration of MPC with Guarantees},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-CPM.pdf},
  year = {2025, submitted}
}

G. H. Oral, K. Prabakar, D. Anand, S. Ganguly, and E. Mallada, Delayed-Stability Evaluation and Experimental Validation of Grid-Forming Inverter Dynamic Power-Hardware-in-the-Loop Tests, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

Grid-forming (GFM) inverters can sustain a large envelope of grid dynamics and provide resilience to outages and contingencies. Sophisticated control algorithms and the expanded functional use cases of GFM inverters do not have standardized testing protocols, which make their power hardware-in-the-loop (PHIL) validation particularly valuable. But, ensuring stable GFM PHIL tests is challenging as various experimental artifacts can be sources of destabilizing excitations for a GFM control loop. In this paper, we present methodologies to address challenges in performing GFM PHIL tests, and provide numerical analyses and experimental results that validate our approach. We first discuss the choice of empirical parameters, and tuning of closed-loop controllers that improve the stability and tracking performance of GFM PHIL experiments. We then provide analytical and numerical calculations evaluating the robustness of the closed-loop setup to experimental artifacts; particularly to delays, while accounting for destabilizing dynamic modes of the physical PHIL interconnection. Our experimental results validate closed-loop stability and tracking performance for PHIL tests of various operational modes of GFMs.

@unpublished{opagm2025a-preprint,
  abstract = {Grid-forming (GFM) inverters can sustain a large envelope of grid dynamics and provide resilience to outages and contingencies. Sophisticated control algorithms and the expanded functional use cases of GFM inverters do not have standardized testing protocols, which make their power hardware-in-the-loop (PHIL) validation particularly valuable. But, ensuring stable GFM PHIL tests is challenging as various experimental artifacts can be sources of destabilizing excitations for a GFM control loop. In this paper, we present methodologies to address challenges in performing GFM PHIL tests, and provide numerical analyses and experimental results that validate our approach. We first discuss the choice of empirical parameters, and tuning of closed-loop controllers that improve the stability and tracking performance of GFM PHIL experiments. We then provide analytical and numerical calculations evaluating the robustness of the closed-loop setup to experimental artifacts; particularly to delays, while accounting for destabilizing dynamic modes of the physical PHIL interconnection. Our experimental results validate closed-loop stability and tracking performance for PHIL tests of various operational modes of GFMs.},
  author = {Oral, H. Giray and Prabakar, Kumaraguru and Anand, Dhananjay and Ganguly, Subhankar and Mallada, Enrique },
  grants = {Global-Centers-2330450},
  month = {11},
  pages = {1--13},
  record = {submitted Nov 2025},
  title = {Delayed-Stability Evaluation and Experimental Validation of Grid-Forming Inverter Dynamic Power-Hardware-in-the-Loop Tests},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-OPAGM.pdf},
  year = {2025, submitted}
}

R. Siegelmann, Y. Shen, F. Paganini, and E. Mallada, Stability Analysis and Data-driven Verification via Recurrent Lyapunov Functions, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

Lyapunov’s direct method is an instrumental tool that provides a rigorous framework for stability analysis and control design for dynamical systems. A critical step that enables the application of the method is the availability of a Lyapunov function $V$—a function whose value monotonically decreases along the trajectories of the dynamical system. Unfortunately, finding a Lyapunov function is often tricky and requires ingenuity, domain knowledge, or significant computational power. At the core of this challenge is the fact that the method requires every sub-level set of $V$ ($V_łeq c$) to be forward invariant, thus implicitly coupling the geometry of $V_łeq c$ and the trajectories of the system. In this paper, we seek to disentangle this dependence by developing a direct method that substitutes the concept of invariance with the more flexible notion of recurrence. A set is ($τ$-)recurrent if every trajectory that starts in the set returns to it (within $τ$ seconds). We show that, under mild conditions, the recurrence of sub-level sets $V_łeq c$ is sufficient to guarantee stability and introduce the appropriate stronger notions to obtain asymptotic stability and exponential stability. Most notably, we provide norm-agnostic converse theorems showing that, under mild conditions, any norm satisfies our relaxed stability conditions, provided one is willing to certify a slightly weaker stability condition. We further develop GPU-based algorithms that can verify (practical) stability notions using purely trajectory data, and without the need of computing a Lyapunov function. Our analysis and methods further highlight an intrinsic trade-off between the sample/computational complexity and the certified performance that our algorithms navigate.

@unpublished{sspm2025a-preprint,
  abstract = {Lyapunov's direct method is an instrumental tool that provides a rigorous framework for stability analysis and control design for dynamical systems. A critical step that enables the application of the method is the availability of a Lyapunov function $V$---a function whose value monotonically decreases along the trajectories of the dynamical system. Unfortunately, finding a Lyapunov function is often tricky and requires ingenuity, domain knowledge, or significant computational power. At the core of this challenge is the fact that the method requires every sub-level set of $V$ ($V_łeq c$) to be forward invariant, thus implicitly coupling the geometry of $V_łeq c$ and the trajectories of the system. In this paper, we seek to disentangle this dependence by developing a direct method that substitutes the concept of invariance with the more flexible notion of recurrence. A set is ($τ$-)recurrent if every trajectory that starts in the set returns to it (within $τ$ seconds). We show that, under mild conditions,  the recurrence of sub-level sets $V_łeq c$ is sufficient to guarantee stability and introduce the appropriate stronger notions to obtain asymptotic stability and exponential stability. 
Most notably, we provide norm-agnostic converse theorems showing that, under mild conditions, any norm satisfies our relaxed stability conditions, provided one is willing to certify a slightly weaker stability condition.
We further develop GPU-based algorithms that can verify (practical) stability notions using purely trajectory data, and without the need of computing a Lyapunov function. 
Our analysis and methods further highlight an intrinsic trade-off between the sample/computational complexity and the certified performance that our algorithms navigate.},
  author = {Siegelmann, Roy and Shen, Yue and Paganini, Fernando and Mallada, Enrique},
  grants = {Global Centers-2330450},
  month = {07},
  title = {Stability Analysis and Data-driven Verification via Recurrent Lyapunov Functions},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-SSPM.pdf},
  year = {2025, submitted}
}

R. K. Bansal, E. Mallada, and P. Hidalgo-Gonzalez, A Market Mechanism for a Two‐stage Settlement Electricity Market with Energy Storage, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

The main goal of a sequential two-stage electricity market—e.g., day-ahead and real-time markets—is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Inde- pendent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in addition to existing local policies. These policies aim to substitute noncompetitive bids with a default bid based on estimated generator costs. However, these policies may lead to unintended consequences when implemented without accounting for the conflicting interest of participants. In this paper, we model the competition between generators (bidding supply functions) and loads (bidding quantity) in a two-stage market with a stage-wise MPM policy. An equilibrium analysis shows that a real-time MPM policy leads to equilibrium loss, meaning no stable market outcome (Nash equilibrium) exists. A day-ahead MPM policy, besides, leads to a Stackelberg-Nash game with loads acting as leaders and generators as followers. In this setting, loads become winners, i.e., their aggregate payment is always less than competitive payments. Moreover, comparison with standard market equilibrium highlights that markets are better off without such policies. Finally, numerical studies highlight the impact of heterogeneity and load size on market equilibrium.

@unpublished{bmh2025a-preprint,
  abstract = {The main goal of a sequential two-stage electricity market---e.g., day-ahead and real-time markets---is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Inde- pendent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in addition to existing local policies. These policies aim to substitute noncompetitive bids with a default bid based on estimated generator costs. However, these policies may lead to unintended consequences when implemented without accounting for the conflicting interest of participants. In this paper, we model the competition between generators (bidding supply functions) and loads (bidding quantity) in a two-stage market with a stage-wise MPM policy. An equilibrium analysis shows that a real-time MPM policy leads to equilibrium loss, meaning no stable market outcome (Nash equilibrium) exists. A day-ahead MPM policy, besides, leads to a Stackelberg-Nash game with loads acting as leaders and generators as followers. In this setting, loads become winners, i.e., their aggregate payment is always less than competitive payments. Moreover, comparison with standard market equilibrium highlights that markets are better off without such policies. Finally, numerical studies highlight the impact of heterogeneity and load size on market equilibrium.},
  author = {Bansal, Rajni Kant and Mallada, Enrique and Hidalgo-Gonzalez, Patricia},
  bdsk-url-3 = {https://doi.org/10.1109/TEMPR.2023.3318149},
  grants = {CPS-2136324, EPICS-2330450},
  month = {7},
  pages = {1-10},
  record = {submitted Jul 2025},
  title = {A Market Mechanism for a Two‐stage Settlement Electricity Market with Energy Storage},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-BMH.pdf},
  year = {2025, submitted}
}

J. Liu and E. Mallada, Safety-Critical Control via Recurrent Tracking Functions, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

This paper addresses the challenge of synthesizing safety-critical controllers for high-order nonlinear systems, where constructing valid Control Barrier Functions (CBFs) remains computationally intractable. Leveraging layered control, we design CBFs in reduced-order models (RoMs) while regulating full-order models’ (FoMs) dynamics at the same time. Traditional Lyapunov tracking functions are required to decrease monotonically, but systematic synthesis methods for such functions exist only for fully-actuated systems. To overcome this limitation, we introduce Recurrent Tracking Functions (RTFs), which replace the monotonic decay requirement with a weaker finite-time recurrence condition. This relaxation permits transient deviations of tracking errors while ensuring safety. By augmenting CBFs for RoMs with RTFs, we construct recurrent CBFs (RCBFs) whose zero-superlevel set is control $τ$-recurrent, and guarantee safety for all initial states in such a set when RTFs are satisfied. We establish theoretical safety guarantees and validate the approach through numerical experiments, demonstrating RTFs’ effectiveness and the safety of FoMs.

@unpublished{lm2025a-preprint,
  abstract = {This paper addresses the challenge of synthesizing safety-critical controllers for high-order nonlinear systems, where constructing valid Control Barrier Functions (CBFs) remains computationally intractable. Leveraging layered control, we design CBFs in reduced-order models (RoMs) while regulating full-order models' (FoMs) dynamics at the same time. Traditional Lyapunov tracking functions are required to decrease monotonically, but systematic synthesis methods for such functions exist only for fully-actuated systems. To overcome this limitation, we introduce Recurrent Tracking Functions (RTFs), which replace the monotonic decay requirement with a weaker finite-time recurrence condition. This relaxation permits transient deviations of tracking errors while ensuring safety. By augmenting CBFs for RoMs with RTFs, we construct recurrent CBFs (RCBFs) whose zero-superlevel set is control $τ$-recurrent, and guarantee safety for all initial states in such a set when RTFs are satisfied. We establish theoretical safety guarantees and validate the approach through numerical experiments, demonstrating RTFs' effectiveness and the safety of FoMs.},
  author = {Liu, Jixian and Mallada, Enrique},
  bdsk-url-3 = {https://doi.org/10.23919/ACC55779.2023.10156212},
  grants = {Global-Centers-2330450;},
  month = {9},
  pages = {1-7},
  record = {submitted Sep 2025},
  title = {Safety-Critical Control via Recurrent Tracking Functions},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-LM.pdf},
  year = {2025, submitted}
}

R. Siegelmann and E. Mallada, Data-driven Practical Stabilization of Nonlinear Systems via Chain Policies: Sample Complexity and Incremental Learning, 2025, submitted.
[BibTeX] [Abstract] [Download PDF]

We propose a method for data-driven practical stabilization of nonlinear systems with provable guarantees, based on the concept of \emphNonparametric Chain Policies (NCPs). The approach employs a normalized nearest-neighbor rule to assign, at each state, a finite-duration control signal derived from stored data, after which the process repeats. Unlike recent works that model the system as linear, polynomial, or polynomial fraction, we only assume the system to be locally Lipschitz. Our analysis build son the framework of Recurrent Lyapunov Functions (RLFs), which enable data-driven certification of (practical) stability using standard norm functions instead of requiring the explicit construction of a classical Lyapunov function. To extend this framework, we introduce the concept of Recurrent Control Lyapunov Functions (R-CLFs), which can certify the existence of an NCP that practically stabilizes an arbitrarily small $c$-neighborhood of an equilibrium point. We also provide an explicit sample complexity guarantee of $\mathcalO\!łeft((3/h̊o)^d łog(R/c)\g̊ht)$ number of trajectories—where $R$ is the domain radius, $d$ the state dimension, and $\r$̊ a system-dependent constant. The proposed Chain Policies are nonparametric, thus allowing new verified data to be readily incorporated into the policy to either improve convergence rate or enlarge the certified region. Numerical experiments illustrate and validate these properties.

@unpublished{sm2025a-preprint,
  abstract = {We propose a method for data-driven practical stabilization of nonlinear systems with provable guarantees, based on the concept of \emphNonparametric Chain Policies (NCPs). The approach employs a normalized nearest-neighbor rule to assign, at each state, a finite-duration control signal derived from stored data, after which the process repeats. 
Unlike recent works that model the system as linear, polynomial, or polynomial fraction, we only assume the system to be locally Lipschitz.
Our analysis build son the framework of Recurrent Lyapunov Functions (RLFs), which enable data-driven certification of (practical) stability using standard norm functions instead of requiring the explicit construction of a classical Lyapunov function. To extend this framework, we introduce the concept of Recurrent Control Lyapunov Functions (R-CLFs), which can certify the existence of an NCP that practically stabilizes an arbitrarily small $c$-neighborhood of an equilibrium point. 
We also provide an explicit sample complexity guarantee of $\mathcalO\!łeft((3/h̊o)^d łog(R/c)\g̊ht)$ number of trajectories---where $R$ is the domain radius, $d$ the state dimension, and $\r$̊ a system-dependent constant. The proposed Chain Policies are nonparametric, thus allowing new verified data to be readily incorporated into the policy to either improve convergence rate or enlarge the certified region. Numerical experiments illustrate and validate these properties.},
  author = {Siegelmann, Roy and Mallada, Enrique},
  bdsk-url-3 = {https://doi.org/10.23919/ACC55779.2023.10156212},
  grants = {Global-Centers-2330450;},
  month = {9},
  pages = {1-8},
  record = {submitted Sep 2025},
  title = {Data-driven Practical Stabilization of Nonlinear Systems via Chain Policies: Sample Complexity and Incremental Learning},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Preprint-SgM.pdf},
  year = {2025, submitted}
}

P. You, Y. Liu, and E. Mallada, A Unified Analysis of Saddle Flow Dynamics: Stability and Algorithm Design, 2024, submitted.
[BibTeX] [Abstract] [Download PDF]

This work examines the conditions for asymptotic and exponential convergence of saddle flow dynamics of convex-concave functions. First, we propose an observability-based certificate for asymptotic convergence, directly bridging the gap between the invariant set in a LaSalle argument and the equilibrium set of saddle flows. This certificate generalizes conventional conditions for convergence, e.g., strict convexity-concavity, and leads to a novel state-augmentation method that requires minimal assumptions for asymptotic convergence. We also show that global exponential stability follows from strong convexity-strong concavity, providing a lower-bound estimate of the convergence rate. This insight also explains the convergence of proximal saddle flows for strongly convex-concave objective functions. Our results generalize to dynamics with projections on the vector field and have applications in solving constrained convex optimization via primal-dual methods. Based on these insights, we study four algorithms built upon different Lagrangian function transformations. We validate our work by applying these methods to solve a network flow optimization and a Lasso regression problem.

@unpublished{ylm2024a-preprint,
  abstract = {This work examines the conditions for asymptotic and exponential convergence of saddle flow dynamics of convex-concave functions. First, we propose an observability-based certificate for asymptotic convergence, directly bridging the gap between the invariant set in a LaSalle argument and the equilibrium set of saddle flows. This certificate generalizes conventional conditions for convergence, e.g., strict convexity-concavity, and leads to a novel state-augmentation method that requires minimal assumptions for asymptotic convergence. We also show that global exponential stability follows from strong convexity-strong concavity, providing a lower-bound estimate of the convergence rate. This insight also explains the convergence of proximal saddle flows for strongly convex-concave objective functions. Our results generalize to dynamics with projections on the vector field and have applications in solving constrained convex optimization via primal-dual methods. Based on these insights, we study four algorithms built upon different Lagrangian function transformations. We validate our work by applying these methods to solve a network flow optimization and a Lasso regression problem.},
  author = {You, Pengcheng and Liu, Yingzhu and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2024-Preprint-YLM.pdf},
  grants = {CAREER-1752362, CPS-2136324, Global-Centers-2330450},
  month = {9},
  pages = {1-16},
  title = {A Unified Analysis of Saddle Flow Dynamics: Stability and Algorithm Design},
  url = {https://mallada.ece.jhu.edu/pubs/2024-Preprint-YLM.pdf},
  year = {2024, submitted}
}

P. You, M. Fernandez, D. F. Gayme, and E. Mallada, Mixed Supply Function and Quantity Bidding in Two-Stage Settlement Markets, 2023, under revision, submitted Mar 2023.
[BibTeX] [Abstract] [Download PDF]

Motivated by electricity markets, we study the incentives of heterogeneous participants (firms and consumers) in a two-stage settlement market with a mixed bidding mechanism, in which firms participate using supply function bids and consumers use quantity bids. We carry out an equilibrium analysis of the market outcome and obtain closed-form solutions. The characterization of the equilibria allows us to gain insights into the market-power implications of mixed bidding and uncover the importance of accounting for consumers’ strategic behavior in a two-stage market, even when their demand is completely inelastic with respect to price. We show that strategic consumers are able to exploit firms’ strategic behavior to maintain a systematic difference between the forward and spot prices, with the latter being higher. Notably, such a strategy does bring down consumer payment and undermines the supply-side market power. However, it is only effective when firms are behaving strategically. We also observe situations where firms lose profit by behaving strategically, a sign of overturn of the conventional supply-side market power. Our results further suggest that market competition has a heterogeneous impact across consumer sizes, particularly benefiting small consumers. Our analysis can accommodate other market policies, and we demonstrate this versatility by examining the impact of some example policies, including virtual bidding, on the market outcome.

@unpublished{yfgm2023a-preprint,
  abstract = {Motivated by electricity markets, we study the incentives of heterogeneous participants (firms and consumers) in a two-stage settlement market with a mixed bidding mechanism, in which firms participate using supply function bids and consumers use quantity bids. We carry out an equilibrium analysis of the market outcome and obtain closed-form solutions.  The characterization of the equilibria allows us to gain insights into the market-power implications of mixed bidding and uncover the importance of accounting for consumers' strategic behavior in a two-stage market, even when their demand is completely inelastic with respect to price. We show that strategic consumers are able to exploit firms' strategic behavior to maintain a systematic difference between the forward and spot prices, with the latter being higher. Notably, such a strategy does bring down consumer payment and undermines the supply-side market power. However, it is only effective when firms are behaving strategically. We also observe situations where firms lose profit by behaving strategically, a sign of overturn of the conventional supply-side market power. Our results further suggest that market competition has a heterogeneous impact across consumer sizes, particularly benefiting small consumers. Our analysis can accommodate other market policies, and we demonstrate this versatility by examining the impact of some example policies, including virtual bidding, on the market outcome.},
  author = {You, Pengcheng and Fernandez, Marcelo and Gayme, Dennice F. and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2023-Preprint-YFGM.pdf},
  grants = {CAREER-1752362;TRIPODS-1934979;CPS-2136324},
  month = {8},
  pages = {1-45},
  title = {Mixed Supply Function and Quantity Bidding in Two-Stage Settlement Markets},
  url = {https://mallada.ece.jhu.edu/pubs/2023-Preprint-YFGM.pdf},
  year = {2023, under revision, submitted Mar 2023}
}

T. Zheng, J. W. Simpson-Porco, and E. Mallada, Closed-Loop Motion Planning for Differentially Flat Systems: A Time-Varying Optimization Framework, 2023, submitted.
[BibTeX] [Abstract] [Download PDF]

Motion planning and control are two core components of the robotic systems autonomy stack. The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances. Such an approach generally introduces conservativeness into the planning stage, which becomes difficult to overcome as the model complexity increases and real-time decisions need to be made in a changing environment. This work addresses these challenges for the class of differentially flat nonlinear systems by integrating planning and control into a cohesive closed-loop task. Precisely, we develop an optimization-based framework that aims to steer a differentially flat system to a trajectory implicitly defined via a constrained time-varying optimization problem. To that end, we generalize the notion of feedback linearization, which makes non-linear systems behave as linear systems, and develop controllers that effectively transform a differentially flat system into an optimization algorithm that seeks to find the optimal solution of a (possibly time-varying) optimization problem. Under sufficient regularity assumptions, we prove global asymptotic convergence for the optimization dynamics to the minimizer of the time-varying optimization problem. We illustrate the effectiveness of our method with two numerical examples: a multi-robot tracking problem and a robot obstacle avoidance problem.

@unpublished{zsm2023a-preprint,
  abstract = {Motion planning and control are two core components of the robotic systems autonomy stack. The standard approach to combine these methodologies comprises an offline/open-loop stage, planning, that designs a feasible and safe trajectory to follow, and an online/closed-loop stage, tracking, that corrects for unmodeled dynamics and disturbances. Such an approach generally introduces conservativeness into the planning stage, which becomes difficult to overcome as the model complexity increases and real-time decisions need to be made in a changing environment. This work addresses these challenges for the class of differentially flat nonlinear systems by integrating planning and control into a cohesive closed-loop task. Precisely, we develop an optimization-based framework that aims to steer a differentially flat system to a trajectory implicitly defined via a constrained time-varying optimization problem. To that end, we generalize the notion of feedback linearization, which makes non-linear systems behave as linear systems, and develop controllers that effectively transform a differentially flat system into an optimization algorithm that seeks to find the optimal solution of a (possibly time-varying) optimization problem. Under sufficient regularity assumptions, we prove global asymptotic convergence for the optimization dynamics to the minimizer of the time-varying optimization problem. We illustrate the effectiveness of our method with two numerical examples: a multi-robot tracking problem and a robot obstacle avoidance problem.},
  author = {Zheng, Tianqi and Simpson-Porco, John W. and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2023-Preprint-ZSM.pdf},
  grants = {CAREER-1752362,CPS-2136324,EPICS-2330450},
  month = {10},
  pages = {1-14},
  title = {Closed-Loop Motion Planning for Differentially Flat Systems: A Time-Varying Optimization Framework},
  url = {https://mallada.ece.jhu.edu/pubs/2023-Preprint-ZSM.pdf},
  year = {2023, submitted}
}

Recent Journals

H. Sibai and E. Mallada, “Recurrence of Nonlinear Control Systems: Entropy, Bit Rates, and Finite Alphabets,” , vol. 59, iss. 101649, 2026. doi:https://doi.org/10.1016/j.nahs.2025.101649
[BibTeX] [Abstract] [Download PDF]

In this paper, we introduce the notion of recurrence entropy in the context of nonlinear control systems. A set is said to be ($τ$-)recurrent if every trajectory that starts in the set returns to it (within at most $τ$ units of time). The recurrence entropy of a control system quantifies the complexity of making a set $τ$-recurrent measured by the average rate of growth, as time increases, of the number of control signals required to achieve this goal. Our analysis reveals that, compared to invariance, recurrence is quantitatively less complex, meaning that the recurrence entropy of a set is no larger than, and often strictly smaller than, the invariance entropy. We provide upper and lower bounds on recurrence entropy and show that they converge to the bounds on invariance entropy as $τ$ decreases to zero. Further, our results show that recurrence entropy lower bounds the minimum data rate between the sensor and controller required for achieving recurrence. We present an algorithm according to which the sensor can send state estimates to the controller over a limited-bandwidth channel to achieve recurrence asymptotically at an exponential rate. Finally, we show that, under mild stricter conditions on the set and dynamics, the control signals that enforce the $τ$-recurrence of a set can be generated by a finite alphabet of control signals of durations of at most $τ$ units of time, which allows us to store them for quick online execution.

@article{sm2026nahs,
  abstract = {In this paper, we introduce the notion of recurrence entropy in the context of nonlinear control systems. A set is said to be ($τ$-)recurrent if every trajectory that starts in the set returns to it (within at most $τ$ units of time). The recurrence entropy of a control system quantifies the complexity of making a set $τ$-recurrent measured by the average rate of growth, as time increases, of the number of control signals required to achieve this goal. Our analysis reveals that, compared to invariance, recurrence is quantitatively less complex, meaning that the recurrence entropy of a set is no larger than, and often strictly smaller than, the invariance entropy. We provide upper and lower bounds on recurrence entropy and show that they converge to the bounds on invariance entropy as $τ$ decreases to zero. Further, our results show that recurrence entropy lower bounds the minimum data rate between the sensor and controller required for achieving recurrence. We present an algorithm according to which the sensor can send state estimates to the controller over a limited-bandwidth channel to achieve recurrence asymptotically at an exponential rate. Finally, we show that, under mild stricter conditions on the set and dynamics, the control signals that enforce the $τ$-recurrence of a set can be generated by a finite alphabet of control signals of durations of at most $τ$ units of time, which allows us to store them for quick online execution.},
  author = {Sibai, Hussein and Mallada, Enrique},
  booktitle = {Nonlinear Analysis: Hybrid Systems},
  doi = {https://doi.org/10.1016/j.nahs.2025.101649},
  grants = {CPS-2136324; Global-Centers-2330450; CAREER-1752362},
  month = {2},
  number = {101649},
  record = {published Feb 2026, online Oct 2025, accepted Oct 2025, submitted Feb 2025},
  title = {Recurrence of Nonlinear Control Systems: Entropy, Bit Rates, and Finite Alphabets},
  url = {https://mallada.ece.jhu.edu/pubs/2026-NAHS-SM.pdf},
  volume = {59},
  year = {2026}
}

R. K. Bansal, P. You, Y. Chen, and E. Mallada, “Counterfactual analysis of default bid market power mitigation strategies in two-stage electricity markets,” European Journal of Operational Research, pp. 1-18, 2025. doi:https://doi.org/10.1016/j.ejor.2025.12.030
[BibTeX] [Abstract] [Download PDF]

Market power remains a persistent challenge in many liberalized electricity markets worldwide, driving the adoption of ex-ante and ex-post mitigation measures. Despite locational mitigation tools (e.g., cost-based reference levels or default energy bids), evidence of price manipulation has motivated system-level market power mitigation (MPM) policies. However, the full implications of these rules are not well understood, and limited insight into participant behavior can lead to unintended consequences, including increased market power and welfare losses. We study sequentially cleared electricity markets and analyze a two-stage settlement structure commonly used by system operators (e.g., day-ahead and real-time markets in North America). Our focus is on MPM policies that replace noncompetitive generator offers with operator-estimated default bids, and we model competition between generators and loads with inelastic energy requirements who act strategically in allocating demand across stages under real-time, day-ahead, and simultaneous applications of MPM policies. Motivated by the loss of Nash equilibrium under conventional supply-function bidding, we adopt an alternative mechanism in which generators bid the intercept of an affine supply function. Under real-time MPM, strategic interaction in the day-ahead market drives all demand to real time, producing an undesirable outcome. To test robustness, we incorporate demand uncertainty using a variance-penalized expectation framework. Low risk aversion still leads to substantial real-time clearing, while imbalances in risk preferences further amplify market power. Overall, intercept-function bidding combined with day-ahead and simultaneous MPM policies mitigates generator market power more effectively than real-time substitution alone, although these policies shift some market power toward loads.

@article{bcym2025ejor,
  abstract = {Market power remains a persistent challenge in many liberalized electricity markets worldwide, driving the adoption of ex-ante and ex-post mitigation measures. Despite locational mitigation tools (e.g., cost-based reference levels or default energy bids), evidence of price manipulation has motivated system-level market power mitigation (MPM) policies. However, the full implications of these rules are not well understood, and limited insight into participant behavior can lead to unintended consequences, including increased market power and welfare losses. We study sequentially cleared electricity markets and analyze a two-stage settlement structure commonly used by system operators (e.g., day-ahead and real-time markets in North America). Our focus is on MPM policies that replace noncompetitive generator offers with operator-estimated default bids, and we model competition between generators and loads with inelastic energy requirements who act strategically in allocating demand across stages under real-time, day-ahead, and simultaneous applications of MPM policies. Motivated by the loss of Nash equilibrium under conventional supply-function bidding, we adopt an alternative mechanism in which generators bid the intercept of an affine supply function. Under real-time MPM, strategic interaction in the day-ahead market drives all demand to real time, producing an undesirable outcome. To test robustness, we incorporate demand uncertainty using a variance-penalized expectation framework. Low risk aversion still leads to substantial real-time clearing, while imbalances in risk preferences further amplify market power. Overall, intercept-function bidding combined with day-ahead and simultaneous MPM policies mitigates generator market power more effectively than real-time substitution alone, although these policies shift some market power toward loads.},
  author = {Bansal, Rajni Kant and You, Pengcheng and Chen, Yue and Mallada, Enrique},
  doi = {https://doi.org/10.1016/j.ejor.2025.12.030},
  grants = {Global-Centers-2330450},
  issn = {0377-2217},
  journal = {European Journal of Operational Research},
  month = {12},
  pages = {1-18},
  record = {online 12 2025, accepted Dec 2025, under revision Jan 2024, submitted Aug 2023},
  title = {Counterfactual analysis of default bid market power mitigation strategies in two-stage electricity markets},
  url = {https://mallada.ece.jhu.edu/pubs/2025-EJOR-BCYM.pdf},
  year = {2025}
}

P. You, Y. Jiang, E. Yeung, D. Gayme, and E. Mallada, “On the Stability, Economic Efficiency and Incentive Compatibility of Electricity Market Dynamics,” IEEE Transactions on Automatic Control, vol. 70, iss. 10, pp. 6815-6830, 2025. doi:10.1109/TAC.2025.3589447
[BibTeX] [Abstract] [Download PDF]

This paper focuses on the operation of an electricity market that accounts for participants that bid at a sub-minute timescale. To that end, we model the market-clearing process as a dynamical system, called market dynamics, which is temporally coupled with the grid frequency dynamics and is thus required to guarantee system-wide stability while meeting the system operational constraints. We characterize participants as price-takers who rationally update their bids to maximize their utility in response to real-time schedules of prices and dispatch. For two common bidding mechanisms, based on quantity and price, we identify a notion of alignment between participants’ behavior and planners’ goals that leads to a saddle-based design of the market that guarantees convergence to a point meeting all operational constraints. We further explore cases where this alignment property does not hold and observe that misaligned participants’ bidding can destabilize the closed-loop system. We thus design a regularized version of the market dynamics that recovers all the desirable stability and steady-state performance guarantees. Numerical tests validate our results on the IEEE 39-bus system.

@article{yjygm2025tac,
  abstract = {This paper focuses on the operation of an electricity market that accounts for participants that bid at a sub-minute timescale. To that end, we model the market-clearing process as a dynamical system, called market dynamics, which is temporally coupled with the grid frequency dynamics and is thus required to guarantee system-wide stability while meeting the system operational constraints. We characterize participants as price-takers who rationally update their bids to maximize their utility in response to real-time schedules of prices and dispatch. For two common bidding mechanisms, based on quantity and price, we identify a notion of alignment between participants' behavior and planners' goals that leads to a saddle-based design of the market that guarantees convergence to a point meeting all operational constraints. We further explore cases where this alignment property does not hold and observe that misaligned participants' bidding can destabilize the closed-loop system.  We thus design a regularized version of the market dynamics that recovers all the desirable stability and steady-state performance guarantees. Numerical tests validate our results on the IEEE 39-bus system.},
  author = {You, Pengcheng and Jiang, Yan and Yeung, Enoch and Gayme, Dennice and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2024-TAC-YJYGM.pdf},
  doi = {10.1109/TAC.2025.3589447},
  grants = {CPS-2136324, Global Centers-2330450},
  journal = {IEEE Transactions on Automatic Control},
  month = {10},
  number = {10},
  pages = {6815-6830},
  record = {published Oct 2025, accepted Aug 2024, revised Dec 2023, submitted Dec 2021},
  title = {On the Stability, Economic Efficiency and Incentive Compatibility of Electricity Market Dynamics},
  url = {https://mallada.ece.jhu.edu/pubs/2025-TAC-YJYGM.pdf},
  volume = {70},
  year = {2025}
}

H. Min, R. Pates, and E. Mallada, “A Frequency Domain Analysis of Slow Coherency in Networked Systems,” Automatica, vol. 74, pp. 1-13, 2025. doi:https://doi.org/10.1016/j.automatica.2025.112184
[BibTeX] [Abstract] [Download PDF]

Network coherence generally refers to the emergence of simple aggregated dynamical behaviors, despite heterogeneity in the dynamics of the network’s subsystems. In this paper, we develop a general frequency domain framework to analyze and quantify the level of network coherence that a system exhibits by relating coherence with a low-rank property of the system’s input-output response. More precisely, for a networked system with linear dynamics and coupling, we show that, as the network’s effective algebraic connectivity grows, the system transfer matrix converges to a rank-one transfer matrix representing the coherent behavior. Interestingly, the non-zero eigenvalue of such a rank-one matrix is given by the harmonic mean of individual nodal dynamics, and we refer to it as coherent dynamics. Our analysis unveils the frequency-dependent nature of coherence and a non-trivial interplay between dynamics and network topology. We further show that many networked systems can exhibit similar coherent behavior by establishing a concentration result in a setting with randomly chosen individual nodal dynamics.

@article{mpm2025automatica,
  abstract = {Network coherence generally refers to the emergence of simple aggregated dynamical behaviors, despite heterogeneity in the dynamics of the network's subsystems. In this paper, we develop a general frequency domain framework to analyze and quantify the level of network coherence that a system exhibits by relating coherence with a low-rank property of the system's input-output response. More precisely, for a networked system with linear dynamics and coupling, we show that, as the network's effective algebraic connectivity grows, the system transfer matrix converges to a rank-one transfer matrix representing the coherent behavior. Interestingly, the non-zero eigenvalue of such a rank-one matrix is given by the harmonic mean of individual nodal dynamics, and we refer to it as coherent dynamics. Our analysis unveils the frequency-dependent nature of coherence and a non-trivial interplay between dynamics and network topology. We further show that many networked systems can exhibit similar coherent behavior by establishing a concentration result in a setting with randomly chosen individual nodal dynamics.},
  author = {Min, Hancheng and Pates, Richard and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2025-Automatica-MPM.pdf},
  bdsk-url-4 = {https://doi.org/10.1016/j.automatica.2025.112184},
  doi = {https://doi.org/10.1016/j.automatica.2025.112184},
  grants = {CAREER-1752362, TRIPODS-1934979, CPS-2136324},
  journal = {Automatica},
  month = {2},
  pages = {1-13},
  record = {published, available online Dec 2024, accepted Oct 2024, revised Feb 2024, submitted Feb 2022},
  title = {A Frequency Domain Analysis of Slow Coherency in Networked Systems},
  url = {https://mallada.ece.jhu.edu/pubs/2025-Automatica-MPM.pdf},
  volume = {74},
  year = {2025}
}

Z. Xu, H. Min, S. Tarmoun, E. Mallada, and R. Vidal, “A Local Polyak-Łojasiewicz and Descent Lemma of Gradient Descent For Overparametrized Linear Models,” Transaction on Machine Learning Research (TMLR), 2025.
[BibTeX] [Download PDF]

@article{xmtmv2025tmlr,
  author = {Xu, Ziqing and Min, Hancheng and Tarmoun, Salma and Mallada, Enrique and Vidal, Rene},
  grants = {Global Centers-2330450},
  issn = {2835-8856},
  journal = {Transaction on Machine Learning Research (TMLR)},
  month = {5},
  record = {accepted May 2025, submitted Feb 2025},
  title = {A Local Polyak-Łojasiewicz and Descent Lemma of Gradient Descent For Overparametrized Linear Models},
  url = {https://mallada.ece.jhu.edu/pubs/2025-TMLR-XMTMV.pdf},
  year = {2025}
}

T. Zheng, N. Loizou, P. You, and E. Mallada, “Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization,” IEEE Control Systems Letters (L-CSS), vol. 8, pp. 2009-2014, 2024. doi:10.1109/LCSYS.2024.3413004
[BibTeX] [Abstract] [Download PDF]

Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA’s effectiveness in solving saddle point problems.

@article{zlym2024lcss,
  abstract = {Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings.
To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA.
Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.},
  author = {Zheng, Tianqi and Loizou, Nicolas and You, Pengcheng and Mallada, Enrique},
  bdsk-url-3 = {https://doi.org/10.1109/LCSYS.2024.3413004},
  doi = {10.1109/LCSYS.2024.3413004},
  grants = {CPS-2136324, Global-Centers-2330450},
  journal = {IEEE Control Systems Letters (L-CSS)},
  month = {06},
  pages = {2009-2014},
  record = {published, accepted May 2024, submitted Mar 2024},
  title = {Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization},
  url = {https://mallada.ece.jhu.edu/pubs/2024-LCSS-ZLYM.pdf},
  volume = {8},
  year = {2024}
}

R. K. Bansal, Y. Chen, P. You, and E. Mallada, “Market Power Mitigation in Two-stage Electricity Market with Supply Function and Quantity Bidding,” IEEE Transactions on Energy Markets, Policy and Regulation, vol. 1, iss. 4, pp. 512-522, 2023. doi:10.1109/TEMPR.2023.3318149
[BibTeX] [Abstract] [Download PDF]

The main goal of a sequential two-stage electricity market—e.g., day-ahead and real-time markets—is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Inde- pendent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in addition to existing local policies. These policies aim to substitute noncompetitive bids with a default bid based on estimated generator costs. However, these policies may lead to unintended consequences when implemented without accounting for the conflicting interest of participants. In this paper, we model the competition between generators (bidding supply functions) and loads (bidding quantity) in a two-stage market with a stage-wise MPM policy. An equilibrium analysis shows that a real-time MPM policy leads to equilibrium loss, meaning no stable market outcome (Nash equilibrium) exists. A day-ahead MPM policy, besides, leads to a Stackelberg-Nash game with loads acting as leaders and generators as followers. In this setting, loads become winners, i.e., their aggregate payment is always less than competitive payments. Moreover, comparison with standard market equilibrium highlights that markets are better off without such policies. Finally, numerical studies highlight the impact of heterogeneity and load size on market equilibrium.

@article{bcym2023tempr,
  abstract = {The main goal of a sequential two-stage electricity market---e.g., day-ahead and real-time markets---is to operate efficiently. However, the price difference across stages due to inadequate competition and unforeseen circumstances leads to undesirable price manipulation. To mitigate this, some Inde- pendent System Operators (ISOs) proposed system-level market power mitigation (MPM) policies in addition to existing local policies. These policies aim to substitute noncompetitive bids with a default bid based on estimated generator costs. However, these policies may lead to unintended consequences when implemented without accounting for the conflicting interest of participants. In this paper, we model the competition between generators (bidding supply functions) and loads (bidding quantity) in a two-stage market with a stage-wise MPM policy. An equilibrium analysis shows that a real-time MPM policy leads to equilibrium loss, meaning no stable market outcome (Nash equilibrium) exists. A day-ahead MPM policy, besides, leads to a Stackelberg-Nash game with loads acting as leaders and generators as followers. In this setting, loads become winners, i.e., their aggregate payment is always less than competitive payments. Moreover, comparison with standard market equilibrium highlights that markets are better off without such policies. Finally, numerical studies highlight the impact of heterogeneity and load size on market equilibrium.},
  author = {Bansal, Rajni Kant and Chen, Yue and You, Pengcheng and Mallada, Enrique},
  bdsk-url-3 = {https://doi.org/10.1109/TEMPR.2023.3318149},
  doi = {10.1109/TEMPR.2023.3318149},
  grants = {CAREER-1752362, CPS-2136324, EPICS-2330450},
  journal = {IEEE Transactions on Energy Markets, Policy and Regulation},
  month = {12},
  number = {4},
  pages = {512-522},
  record = {published, online Sep 2023, revised July 2023, under revision May 2023, submitted Jan 2023},
  title = {Market Power Mitigation in Two-stage Electricity Market with Supply Function and Quantity Bidding},
  url = {https://mallada.ece.jhu.edu/pubs/2023-TEMPR-BCYM.pdf},
  volume = {1},
  year = {2023}
}

A. Castellano, H. Min, J. Bazerque, and E. Mallada, “Learning to Act Safely with Limited Exposure and Almost Sure Certainty,” IEEE Transactions on Automatic Control, vol. 68, iss. 5, pp. 2979-2994, 2023. doi:10.1109/TAC.2023.3240925
[BibTeX] [Abstract] [Download PDF]

This paper aims to put forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials, provided that one is willing to navigate trade-offs between optimality, level of exposure to unsafe events, and the maximum detection time of unsafe actions. We illustrate this concept in two complementary settings. We first focus on the canonical multi-armed bandit problem and seek to study the intrinsic trade-offs of learning safety in the presence of uncertainty. Under mild assumptions on sufficient exploration, we provide an algorithm that provably detects all unsafe machines in an (expected) finite number of rounds. The analysis also unveils a trade-off between the number of rounds needed to secure the environment and the probability of discarding safe machines. We then consider the problem of finding optimal policies for a Markov Decision Process (MDP) with almost sure constraints. We show that the (action) value function satisfies a barrier-based decomposition which allows for the identification of feasible policies independently of the reward process. Using this decomposition, we develop a Barrier-learning algorithm, that identifies such unsafe state-action pairs in a finite expected number of steps. Our analysis further highlights a trade-off between the time lag for the underlying MDP necessary to detect unsafe actions, and the level of exposure to unsafe events. Simulations corroborate our theoretical findings, further illustrating the aforementioned trade-offs, and suggesting that safety constraints can further speed up the learning process.

@article{cmbm2023tac,
  abstract = {This paper aims to put forward the concept that learning to take safe actions in unknown environments, even with probability one guarantees, can be achieved without the need for an unbounded number of exploratory trials, provided that one is willing to navigate trade-offs between optimality, level of exposure to unsafe events, and the maximum detection time of unsafe actions. We illustrate this concept in two complementary settings. We first focus on the canonical multi-armed bandit problem and seek to study the intrinsic trade-offs of learning safety in the presence of uncertainty.  Under mild assumptions on sufficient exploration, we provide an algorithm that provably detects all unsafe machines in an (expected) finite number of rounds. The analysis also unveils a trade-off between the number of rounds needed to secure the environment and the probability of discarding safe machines.  We then consider the problem of finding optimal policies for a Markov Decision Process (MDP) with almost sure constraints. 
We show that the (action) value function satisfies a barrier-based decomposition which allows for the identification of feasible policies independently of the reward process. Using this decomposition, we develop a Barrier-learning algorithm, that identifies such unsafe state-action pairs in a finite expected number of steps. Our analysis further highlights a trade-off between the time lag for the underlying MDP necessary to detect unsafe actions, and the level of exposure to unsafe events. Simulations corroborate our theoretical findings, further illustrating the aforementioned trade-offs, and suggesting that safety constraints can further speed up the learning process.},
  author = {Castellano, Agustin and Min, Hancheng and Bazerque, Juan and Mallada, Enrique},
  bdsk-url-3 = {https://mallada.ece.jhu.edu/pubs/2023-TAC-CMBM.pdf},
  bdsk-url-4 = {http://dx.doi.org/10.1109/TAC.2023.3240925},
  doi = {10.1109/TAC.2023.3240925},
  grants = {CAREER-1752362, TRIPODS-1934979, CPS-2136324},
  journal = {IEEE Transactions on Automatic Control},
  month = {5},
  number = {5},
  pages = {2979-2994},
  record = {published, online May 2023, accepted Jan 2023, revised Oct 2022, submitted May 2021},
  title = {Learning to Act Safely with Limited Exposure and Almost Sure Certainty},
  url = {https://mallada.ece.jhu.edu/pubs/2023-TAC-CMBM.pdf},
  volume = {68},
  year = {2023}
}

Recent Conference Papers

J. Liu and E. Mallada, “Recurrent Control Barrier Functions: A Path Towards Nonparametric Safety Verification,” in 64th IEEE Conference on Decision and Control (CDC), 2025.
[BibTeX] [Abstract] [Download PDF]

Ensuring the safety of complex dynamical systems often relies on Hamilton-Jacobi (HJ) Reachability Analysis or Control Barrier Functions (CBFs). Both methods require computing a function that characterizes a safe set that can be made (control) invariant. However, the computational burden of solving high-dimensional partial differential equations (for HJ Reachability) or large-scale semidefinite programs (for CBFs) makes finding such functions challenging. In this paper, we introduce the notion of Recurrent Control Barrier Functions (RCBFs), a novel class of CBFs that leverages a recurrent property of the trajectories, i.e., coming back to a safe set, for safety verification. Under mild assumptions, we show that the RCBF condition holds for the signed-distance function, turning function design into set identification. Notably, the resulting set need not be invariant to certify safety. We further propose a data-driven nonparametric method to compute safe sets that is massively parallelizable and trades off conservativeness against computational cost.

@inproceedings{lm2025cdc,
  abstract = {Ensuring the safety of complex dynamical systems often relies on Hamilton-Jacobi (HJ) Reachability Analysis or Control Barrier Functions (CBFs). Both methods require computing a function that characterizes a safe set that can be made (control) invariant. However, the computational burden of solving high-dimensional partial differential equations (for HJ Reachability) or large-scale semidefinite programs (for CBFs) makes finding such functions challenging. In this paper, we introduce the notion of Recurrent Control Barrier Functions (RCBFs), a novel class of CBFs that leverages a recurrent property of the trajectories, i.e., coming back to a safe set, for safety verification. Under mild assumptions, we show that the RCBF condition holds for the signed-distance function, turning function design into set identification. Notably, the resulting set need not be invariant to certify safety. We further propose a data-driven nonparametric method to compute safe sets that is massively parallelizable and trades off conservativeness against computational cost.},
  author = {Liu, Jixian and Mallada, Enrique},
  booktitle = {64th IEEE Conference on Decision and Control (CDC)},
  grants = {CPS-2136324; Global-Centers-2330450},
  month = {12},
  pubstate = {to appear},
  record = {accapted Jul 2025, submitted Mar 2025},
  title = {Recurrent Control Barrier Functions: A Path Towards Nonparametric Safety Verification},
  url = {https://mallada.ece.jhu.edu/pubs/2025-CDC-LM.pdf},
  year = {2025}
}

H. M. Bui, E. Mallada, and A. Liu, “Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits,” in International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
[BibTeX] [Abstract] [Download PDF]

Despite the empirical success of Low-Rank Adaptation (LoRA) in fine-tuning pretrained models, there is little theoretical understanding of how first-order methods with carefully crafted initialization adapt models to new tasks. In this work, we take the first step towards bridging this gap by theoretically analyzing the learning dynamics of LoRA for matrix factorization (MF) under gradient flow (GF), emphasizing the crucial role of initialization. For small initialization, we theoretically show that GF converges to a neighborhood of the optimal solution, with smaller initialization leading to lower final error. Our analysis shows that the final error is affected by the misalignment between the singular spaces of the pre-trained model and the target matrix, and reducing the initialization scale improves alignment. To address this misalignment, we propose a spectral initialization for LoRA in MF and theoretically prove that GF with small spectral initialization converges to the fine-tuning task with arbitrary precision. Numerical experiments from MF and image classification validate our findings.

@inproceedings{bml2025aistats,
  abstract = {Despite the empirical success of Low-Rank Adaptation (LoRA) in fine-tuning pretrained models, there is little theoretical understanding of how first-order methods with carefully crafted initialization adapt models to new tasks. In this work, we take the first step towards bridging this gap by theoretically analyzing the learning dynamics of LoRA for matrix factorization (MF) under gradient flow (GF), emphasizing the crucial role of initialization. For small initialization, we theoretically show that GF converges to a neighborhood of the optimal solution, with smaller initialization leading to lower final error. Our analysis shows that the final error is affected by the misalignment between the singular spaces of the pre-trained model and the target matrix, and reducing the initialization scale improves alignment. To address this misalignment, we propose a spectral initialization for LoRA in MF and theoretically prove that GF with small spectral initialization converges to the fine-tuning task with arbitrary precision. Numerical experiments from MF and image classification validate our findings.},
  author = {Bui, Ha Manh and Mallada, Enrique and Liu, Anqi},
  booktitle = {International Conference on Artificial Intelligence and Statistics (AISTATS)},
  grants = {No Grant},
  month = {4},
  publisher = {PMLR},
  record = {accepted Jan 2025, submitted Oct 2024},
  series = {Proceedings of Machine Learning Research},
  title = {Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits},
  url = {https://mallada.ece.jhu.edu/pubs/2025-AISTATS-BML.pdf},
  year = {2025}
}

A. Castellano, S. Rezaei, J. Markovitz, and E. Mallada, “Nonparametric Policy Improvement for Continuous Aciton Spaces via Expert Trajectories,” in Reinforcement Learning Conference, 2025.
[BibTeX] [Abstract] [Download PDF]

The policy improvement theorem is a fundamental building block of classical reinforcement learning for discrete action spaces. Unfortunately, the lack of an analogous result for continuous action spaces with function approximation has historically limited theoretical guarantees of policy optimization algorithms, undermining their reliability. Here, we introduce a novel nonparametric policy that relies purely on data to take actions and that admits a policy improvement theorem for deterministic Markov Decision Processes (MDPs). By imposing mild regularity assumptions on the optimal policy, we show that, when data come from expert demonstrations, one can construct a nonparametric lower bound on the value of the policy, thus enabling its robust evaluation. The constructed lower bound naturally leads to a simple improvement mechanism based on adding more demonstrations. We also provide conditions to identify regions of the state space where additional demonstrations are needed to meet specific performance goals. Finally, we propose a policy optimization algorithm that ensures a monotonic improvement of the lower bound and leads to high probability performance guarantees. These contributions provide a foundational step toward establishing a rigorous framework for policy improvement in continuous action spaces.

@inproceedings{crmm2025rlc,
  abstract = {The policy improvement theorem is a fundamental building block of classical reinforcement learning for discrete action spaces. Unfortunately, the lack of an analogous result for continuous action spaces with function approximation has historically limited theoretical guarantees of policy optimization algorithms, undermining their reliability. Here, we introduce a novel nonparametric policy that relies purely on data to take actions and that admits a 
policy improvement theorem for deterministic Markov Decision Processes (MDPs). By imposing mild regularity assumptions on the optimal policy, we show that, when data come from expert demonstrations, one can construct a nonparametric lower bound on the value of the policy, thus enabling its robust evaluation. The constructed lower bound naturally leads to a simple improvement mechanism based on adding more demonstrations. We also provide conditions to identify regions of the state space where additional demonstrations are needed to meet specific performance goals. Finally, we propose a policy optimization algorithm that ensures a monotonic improvement of the lower bound and leads to high probability performance guarantees. These contributions provide a foundational step toward establishing a rigorous framework for policy improvement in continuous action spaces.},
  author = {Castellano, Agustin and Rezaei, Sohrab and Markovitz, Jared and Mallada, Enrique},
  booktitle = {Reinforcement Learning Conference},
  month = {7},
  pubstate = {to appear},
  record = {accepted May 2025, submitted Feb 2025},
  title = {Nonparametric Policy Improvement for Continuous Aciton Spaces via Expert Trajectories},
  url = {https://mallada.ece.jhu.edu/pubs/2025-RLC-CRMM.pdf},
  year = {2025}
}

J. Drgona, T. X. Nghiem, T. Beckers, M. Fazlyab, E. Mallada, C. Jones, D. Vrabie, Brunton, S. L., and R. Findeisen, “Safe Physics-informed Machine Learning for Dynamics and Control,” in American Control Conference (ACC), 2025, pp. 591-606. doi:doi: 10.23919/ACC63710.2025.11107836
[BibTeX] [Abstract] [Download PDF]

This tutorial paper focuses on safe physics-informed machine learning in the context of dynamics and control, providing a comprehensive overview of how to integrate physical models and safety guarantees. As machine learning techniques enhance the modeling and control of complex dynamical systems, ensuring safety and stability remains a critical challenge, especially in safety-critical applications like autonomous vehicles, robotics, medical decision-making, and energy systems. We explore various approaches for embedding and ensuring safety constraints, including structural priors, Lyapunov and Control Barrier Functions, predictive control, projections, and robust optimization techniques. Additionally, we delve into methods for uncertainty quantification and safety verification, including reachability analysis and neural network verification tools, which help validate that control policies remain within safe operating bounds even in uncertain environments. The paper includes illustrative examples demonstrating the implementation aspects of safe learning frameworks that combine the strengths of data-driven approaches with the rigor of physical principles, offering a path toward the safe control of complex dynamical systems.

@inproceedings{dnbetal2025acc,
  abstract = {This tutorial paper focuses on safe physics-informed machine learning in the context of dynamics and control, providing a comprehensive overview of how to integrate physical models and safety guarantees. As machine learning techniques enhance the modeling and control of complex dynamical systems, ensuring safety and stability remains a critical challenge, especially in safety-critical applications like autonomous vehicles, robotics, medical decision-making, and energy systems. We explore various approaches for embedding and ensuring safety constraints, including structural priors, Lyapunov and Control Barrier Functions, predictive control, projections, and robust optimization techniques. Additionally, we delve into methods for uncertainty quantification and safety verification, including reachability analysis and neural network verification tools, which help validate that control policies remain within safe operating bounds even in uncertain environments. The paper includes illustrative examples demonstrating the implementation aspects of safe learning frameworks that combine the strengths of data-driven approaches with the rigor of physical principles, offering a path toward the safe control of complex dynamical systems.},
  author = {Drgona, Jan and Nghiem, Truong X. and Beckers, Thomas and Fazlyab, Mahyar and Mallada, Enrique and Jones, Colin and Vrabie, Draguna and Brunton, and Steven L. and Findeisen, Rolf},
  booktitle = {American Control Conference (ACC)},
  doi = {doi: 10.23919/ACC63710.2025.11107836},
  month = {7},
  pages = {591-606},
  record = {submitted Mar 2025, accepted May 2025, presented July 2025},
  title = {Safe Physics-informed Machine Learning for Dynamics and Control},
  url = {https://mallada.ece.jhu.edu/pubs/2025-ACC-Tutorial-DNBetal.pdf},
  year = {2025}
}

Z. Xu, H. Min, L. E. MacDonald, J. Luo, S. Tarmoun, E. Mallada, and R. Vidal, “Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization,” in International Conference on Artificial Intelligence and Statistics (AISTATS), 2025.
[BibTeX] [Abstract] [Download PDF]

Despite the empirical success of Low-Rank Adaptation (LoRA) in fine-tuning pretrained models, there is little theoretical understanding of how first-order methods with carefully crafted initialization adapt models to new tasks. In this work, we take the first step towards bridging this gap by theoretically analyzing the learning dynamics of LoRA for matrix factorization (MF) under gradient flow (GF), emphasizing the crucial role of initialization. For small initialization, we theoretically show that GF converges to a neighborhood of the optimal solution, with smaller initialization leading to lower final error. Our analysis shows that the final error is affected by the misalignment between the singular spaces of the pre-trained model and the target matrix, and reducing the initialization scale improves alignment. To address this misalignment, we propose a spectral initialization for LoRA in MF and theoretically prove that GF with small spectral initialization converges to the fine-tuning task with arbitrary precision. Numerical experiments from MF and image classification validate our findings.

@inproceedings{xmmltmv2025aistats,
  abstract = {Despite the empirical success of Low-Rank Adaptation (LoRA) in fine-tuning pretrained models, there is little theoretical understanding of how first-order methods with carefully crafted initialization adapt models to new tasks. In this work, we take the first step towards bridging this gap by theoretically analyzing the learning dynamics of LoRA for matrix factorization (MF) under gradient flow (GF), emphasizing the crucial role of initialization. For small initialization, we theoretically show that GF converges to a neighborhood of the optimal solution, with smaller initialization leading to lower final error. Our analysis shows that the final error is affected by the misalignment between the singular spaces of the pre-trained model and the target matrix, and reducing the initialization scale improves alignment. To address this misalignment, we propose a spectral initialization for LoRA in MF and theoretically prove that GF with small spectral initialization converges to the fine-tuning task with arbitrary precision. Numerical experiments from MF and image classification validate our findings.},
  author = {Xu, Ziqing and Min, Hancheng and MacDonald, Lachlan Ewen and Luo, Jinqi and Tarmoun, Salma and Mallada, Enrique and Vidal, Rene},
  booktitle = {International Conference on Artificial Intelligence and Statistics (AISTATS)},
  grants = {Global Centers},
  month = {4},
  publisher = {PMLR},
  record = {accepted Jan 2024, submitted Oct 2024},
  series = {Proceedings of Machine Learning Research},
  title = {Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization},
  url = {https://mallada.ece.jhu.edu/pubs/2025-AISTATS-XMMLTMV.pdf},
  year = {2025}
}

K. Poe, E. Mallada, and R. Vidal, “Invertibility of Discrete-Time Linear Systems with Sparse Inputs,” in 63rd IEEE Conference on Decision and Control (CDC), 2024. doi:10.1109/CDC56724.2024.10886207
[BibTeX] [Abstract] [Download PDF]

One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain structural assumptions, such as input sparsity, have been shown to translate to practical gains in the performance of inversion algorithms, surpassing classical guarantees. Establishing necessary and sufficient conditions for left invertibility of systems with sparse inputs is therefore a crucial step toward understanding the performance limits of system inversion under structured input assumptions. In this work, we provide the first necessary and sufficient characterizations of left invertibility for linear systems with sparse inputs, echoing classic characterizations for standard linear systems. The key insight in deriving these results is in establishing the existence of two novel geometric invariants unique to the sparse-input setting, the weakly unobservable and strongly reachable subspace arrangements. By means of a concrete example, we demonstrate the utility of these characterizations. We conclude by discussing extensions and applications of this framework to several related problems in sparse control.

@inproceedings{pmv2024cdc,
  abstract = {One of the fundamental problems of interest for discrete-time linear systems is whether its input sequence may be recovered given its output sequence, a.k.a. the left inversion problem. Many conditions on the state space geometry, dynamics, and spectral structure of a system have been used to characterize the well-posedness of this problem, without assumptions on the inputs. However, certain structural assumptions, such as input sparsity, have been shown to translate to practical gains in the performance of inversion algorithms, surpassing classical guarantees. Establishing necessary and sufficient conditions for left invertibility of systems with sparse inputs is therefore a crucial step toward understanding the performance limits of system inversion under structured input assumptions. In this work, we provide the first necessary and sufficient characterizations of left invertibility for linear systems with sparse inputs,  echoing classic characterizations for standard linear systems. The key insight in deriving these results is in establishing the existence of two novel geometric invariants unique to the sparse-input setting, the weakly unobservable and strongly reachable subspace arrangements. By means of a concrete example, we demonstrate the utility of these characterizations. We conclude by discussing extensions and applications of this framework to several related problems in sparse control.},
  author = {Poe, Kyle and Mallada, Enrique and Vidal, Rene},
  booktitle = {63rd IEEE Conference on Decision and Control (CDC)},
  doi = {10.1109/CDC56724.2024.10886207},
  grants = {CPS-2136324; Global-Centers-2330450},
  month = {12},
  record = {presented Dec 2024, accepted Jul 2024, submitted Mar 2024},
  title = {Invertibility of Discrete-Time Linear Systems with Sparse Inputs},
  url = {https://mallada.ece.jhu.edu/pubs/2024-CDC-PMV.pdf},
  year = {2024}
}

T. Zheng, N. Loizou, P. You, and E. Mallada, “Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization,” in 63rd IEEE Conference on Decision and Control (CDC), 2024. doi:10.1109/LCSYS.2024.3413004
[BibTeX] [Abstract] [Download PDF]

Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings. To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA. Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA’s effectiveness in solving saddle point problems.

@inproceedings{zlym2024cdc,
  abstract = {Gradient Descent Ascent (GDA) methods for min-max optimization problems typically produce oscillatory behavior that can lead to instability, e.g., in bilinear settings.
To address this problem, we introduce a dissipation term into the GDA updates to dampen these oscillations. The proposed Dissipative GDA (DGDA) method can be seen as performing standard GDA on a state-augmented and regularized saddle function that does not strictly introduce additional convexity/concavity. We theoretically show the linear convergence of DGDA in the bilinear and strongly convex-strongly concave settings and assess its performance by comparing DGDA with other methods such as GDA, Extra-Gradient (EG), and Optimistic GDA.
Our findings demonstrate that DGDA surpasses these methods, achieving superior convergence rates. We support our claims with two numerical examples that showcase DGDA's effectiveness in solving saddle point problems.},
  author = {Zheng, Tianqi and Loizou, Nicolas and You, Pengcheng and Mallada, Enrique},
  booktitle = {63rd IEEE Conference on Decision and Control (CDC)},
  doi = {10.1109/LCSYS.2024.3413004},
  grants = {CPS-2136324, Global-Centers-2330450},
  month = {12},
  note = {also in L-CSS},
  record = {presented Dec 2024, accepted Jul 2024, submitted Mar 2024},
  title = {Dissipative Gradient Descent Ascent Method: A Control Theory Inspired Algorithm for Min-max Optimization},
  url = {https://mallada.ece.jhu.edu/pubs/2024-CDC-ZLYM.pdf},
  year = {2024}
}

Y. Shen, H. Sibai, and E. Mallada, “Generalized Barrier Functions: Integral Conditions & Recurrent Relaxations,” in 60th Allerton Conference on Communication, Control, and Computing, 2024, pp. 1-8.
[BibTeX] [Abstract] [Download PDF]

Barrier functions constitute an effective tool for assessing and enforcing safety-critical constraints on dynamical systems. To this end, one is required to find a function $h$ that satisfies a Lyapunov-like differential condition, thereby ensuring the invariance of its zero super-level set $h_\ge 0$. This methodology, however, does not prescribe a general method for finding the function $h$ that satisfies such differential conditions, which, in general, can be a daunting task. In this paper, we seek to overcome this limitation by developing a generalized barrier condition that makes the search for $h$ easier. We do this in two steps. First, we develop integral barrier conditions that reveal equivalent asymptotic behavior to the differential ones, but without requiring differentiability of $h$. Subsequently, we further replace the stringent invariance requirement on $h≥0$ with a more flexible concept known as recurrence. A set is ($τ$-)recurrent if every trajectory that starts in the set returns to it (within $τ$ seconds) infinitely often. We show that, under mild conditions, a simple sign distance function can satisfy our relaxed condition and that the ($τ$-)recurrence of the super-level set $h_≥ 0$ is sufficient to guarantee the system’s safety.

@inproceedings{ssm2024allerton,
  abstract = {Barrier functions constitute an effective tool for assessing and enforcing safety-critical constraints on dynamical systems.  To this end, one is required to find a function $h$ that satisfies a Lyapunov-like differential condition, thereby ensuring the invariance of its zero super-level set $h_\ge 0$.  This methodology, however, does not prescribe a general method for finding the function $h$ that satisfies such differential conditions, which, in general, can be a daunting task. In this paper, we seek to overcome this limitation by developing a generalized barrier condition that makes the search for $h$ easier. We do this in two steps. First, we develop integral barrier conditions that reveal equivalent asymptotic behavior to the differential ones, but without requiring differentiability of $h$. Subsequently, we further replace the stringent invariance requirement on $h≥0$ with a more flexible concept known as recurrence. A set is ($τ$-)recurrent if every trajectory that starts in the set returns to it (within $τ$ seconds) infinitely often. We show that, under mild conditions, a simple sign distance function can satisfy our relaxed condition and that the ($τ$-)recurrence of the super-level set $h_≥ 0$ is sufficient to guarantee the system's safety.},
  author = {Shen, Yue and Sibai, Hussein and Mallada, Enrique},
  booktitle = {60th Allerton Conference on Communication, Control, and Computing},
  grants = {CPS-2136324, Global-Centers-2330450},
  keywords = {Barrier Functions},
  month = {09},
  pages = {1-8},
  pubstate = {presented},
  record = {accepted Jul 2024, submitted Jul 2024},
  title = {Generalized Barrier Functions: Integral Conditions & Recurrent Relaxations},
  url = {https://mallada.ece.jhu.edu/pubs/2024-Allerton-SSM.pdf},
  year = {2024}
}