Data-Driven Verification and Learning Under Uncertainty
The DEUCE project aims to enhance reinforcement learning by developing novel verification methods that ensure safety and correctness in complex, uncertain environments through data-driven abstractions.
Projectdetails
Introduction
Reinforcement learning (RL) agents learn to behave optimally via trial and error, without the need to encode complicated behavior explicitly. However, RL generally lacks mechanisms to constantly ensure correct behavior regarding sophisticated task and safety specifications.
Formal Verification
Formal verification (FV), and in particular model checking, provides formal guarantees on a system's correctness based on rigorous methods and precise specifications. Despite active development by researchers from all over the world, fundamental challenges obstruct the application of FV to RL so far.
Key Challenges
We identify three key challenges that frame the objectives of this proposal:
-
Curse of Dimensionality: Complex environments with large degrees of freedom induce large state and feature spaces. This curse of dimensionality poses a longstanding problem for verification.
-
Idealized State Spaces: Common approaches for the correctness of RL systems employ idealized discrete state spaces. However, realistic problems are often continuous.
-
Uncertainty in Real-World Environments: Knowledge about real-world environments is inherently uncertain. To ensure safety, correctness guarantees need to be robust against such imprecise knowledge about the environment.
Project Objectives
The main objective of the DEUCE project is to develop novel and data-driven verification methods that tightly integrate with RL.
Approach
To cope with the curse of dimensionality, we devise learning-based abstraction schemes that distill the system parts that are relevant for correctness. We employ and define models whose expressiveness captures various types of uncertainty. These models are the basis for formal and data-driven abstractions of continuous spaces.
Mechanisms
We provide model-based FV mechanisms that ensure safe and correct exploration for RL agents.
Conclusion
DEUCE will elevate the scalability and expressiveness of verification towards real-world deployment of reinforcement learning.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.500.000 |
Totale projectbegroting | € 1.500.000 |
Tijdlijn
Startdatum | 1-1-2023 |
Einddatum | 31-12-2027 |
Subsidiejaar | 2023 |
Partners & Locaties
Projectpartners
- RUHR-UNIVERSITAET BOCHUMpenvoerder
- STICHTING RADBOUD UNIVERSITEIT
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
MANUNKIND: Determinants and Dynamics of Collaborative ExploitationThis project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery. | ERC STG | € 1.497.749 | 2022 | Details |
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressureThe UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance. | ERC STG | € 1.498.280 | 2022 | Details |
Uncovering the mechanisms of action of an antiviral bacteriumThis project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function. | ERC STG | € 1.500.000 | 2023 | Details |
The Ethics of Loneliness and SociabilityThis project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field. | ERC STG | € 1.025.860 | 2023 | Details |
MANUNKIND: Determinants and Dynamics of Collaborative Exploitation
This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure
The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.
Uncovering the mechanisms of action of an antiviral bacterium
This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.
The Ethics of Loneliness and Sociability
This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Model-based Reinforcement Learning for Versatile Robots in the Real WorldREAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning. | ERC COG | € 1.998.500 | 2023 | Details |
Control for Deep and Federated LearningCoDeFeL aims to enhance machine learning methods through control theory, developing efficient ResNet architectures and federated learning techniques for applications in digital medicine and recommendations. | ERC ADG | € 2.499.224 | 2024 | Details |
Dynamics underlying learning in complex environmentsDULCE aims to establish a unified framework to understand learning in complex environments by analyzing neural dynamics and co-occurring learning processes across multiple brain regions. | ERC COG | € 1.925.875 | 2025 | Details |
Model-based Reinforcement Learning for Versatile Robots in the Real World
REAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning.
Control for Deep and Federated Learning
CoDeFeL aims to enhance machine learning methods through control theory, developing efficient ResNet architectures and federated learning techniques for applications in digital medicine and recommendations.
Dynamics underlying learning in complex environments
DULCE aims to establish a unified framework to understand learning in complex environments by analyzing neural dynamics and co-occurring learning processes across multiple brain regions.