Koopman-Operator-based Reinforcement Learning Control of Partial Differential Equations
This project aims to enhance reinforcement learning for large-scale engineering systems by developing performance-guaranteed controllers, addressing safety in energy-efficient technologies.
Projectdetails
Introduction
An unprecedented energy crisis is looming over us. In order to transition to a greener and more energy-efficient society, existing technologies need to be improved and novel techniques such as nuclear fusion developed. This requires the stabilization of aerodynamics, heat transfer, or combustion and fusion processes and thus, the development of efficient control strategies for large-scale dynamical systems.
Challenges with Reinforcement Learning
In recent years, reinforcement learning (RL) has emerged as a highly promising data-driven technique. Unfortunately, we cannot trust RL to handle our most important and complex systems, since the resulting controllers do not possess performance guarantees.
Limitations of Current Approaches
Certifiable RL approaches such as linear or kernel methods tend to scale poorly, such that their applicability is limited to toy examples. In contrast to other application areas, this is a complete show-stopper for safety-critical engineering. Moreover, the training is extremely data-hungry and costly, due to which RL itself contributes to the energy crisis.
Project Vision
The vision of this project is to develop new foundational methods to equip RL controllers for large-scale engineering systems with performance guarantees by exploiting system knowledge and systematically reducing the complexity.
Major Breakthroughs
To achieve this, I will target three major breakthroughs:
- Global linearization of the dynamics via the Koopman operator framework.
- The extension of certified Q-learning to continuous action spaces via control quantization.
- The detection and exploitation of symmetries in the system dynamics.
Required Advancements
The project requires significant joint advancements in several challenging areas such as control, approximation theory, and machine learning.
Potential Impact
In the case of success, the resulting controllers will provide a massive advancement of RL towards safety-critical engineering applications and significantly contribute to the challenge of meeting the future energy demands of our society.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.499.000 |
Totale projectbegroting | € 1.499.000 |
Tijdlijn
Startdatum | 1-1-2025 |
Einddatum | 31-12-2029 |
Subsidiejaar | 2025 |
Partners & Locaties
Projectpartners
- TECHNISCHE UNIVERSITAT DORTMUNDpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
MANUNKIND: Determinants and Dynamics of Collaborative ExploitationThis project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery. | ERC STG | € 1.497.749 | 2022 | Details |
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressureThe UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance. | ERC STG | € 1.498.280 | 2022 | Details |
Uncovering the mechanisms of action of an antiviral bacteriumThis project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function. | ERC STG | € 1.500.000 | 2023 | Details |
The Ethics of Loneliness and SociabilityThis project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field. | ERC STG | € 1.025.860 | 2023 | Details |
MANUNKIND: Determinants and Dynamics of Collaborative Exploitation
This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure
The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.
Uncovering the mechanisms of action of an antiviral bacterium
This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.
The Ethics of Loneliness and Sociability
This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Model-based Reinforcement Learning for Versatile Robots in the Real WorldREAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning. | ERC COG | € 1.998.500 | 2023 | Details |
Projection-based Control: A Novel Paradigm for High-performance SystemsPROACTHIS aims to develop a novel projection-based control paradigm to enhance performance in future engineering systems through innovative design and optimization techniques. | ERC ADG | € 2.498.516 | 2022 | Details |
Control for Deep and Federated LearningCoDeFeL aims to enhance machine learning methods through control theory, developing efficient ResNet architectures and federated learning techniques for applications in digital medicine and recommendations. | ERC ADG | € 2.499.224 | 2024 | Details |
Reinforcement Learning & Solver Racing in simulatieversnellingenHet project onderzoekt het gebruik van reinforcement learning en solver racing om de efficiëntie van computersimulaties te verbeteren. | MIT Haalbaarheid | € 20.000 | 2024 | Details |
Model-based Reinforcement Learning for Versatile Robots in the Real World
REAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning.
Projection-based Control: A Novel Paradigm for High-performance Systems
PROACTHIS aims to develop a novel projection-based control paradigm to enhance performance in future engineering systems through innovative design and optimization techniques.
Control for Deep and Federated Learning
CoDeFeL aims to enhance machine learning methods through control theory, developing efficient ResNet architectures and federated learning techniques for applications in digital medicine and recommendations.
Reinforcement Learning & Solver Racing in simulatieversnellingen
Het project onderzoekt het gebruik van reinforcement learning en solver racing om de efficiëntie van computersimulaties te verbeteren.