Reinventing the Theory of Machine Learning on Graphs

Project MALAGA aims to establish a foundational theory for Graph Machine Learning to enhance the performance and reliability of Graph Neural Networks across diverse domains like biology and social sciences.

Subsidie
€ 1.479.643
2025

Projectdetails

Introduction

In many scientific domains, graphs are the objects of choice to represent structured data: from molecules to social networks, power grids, the internet, and so on. The exploitation of graph data represents a major scientific and industrial challenge.

Graph Machine Learning

Graph Machine Learning (GML) is thus a fast-growing field, with so-called Graph Neural Networks (GNN) at the forefront. However, in sharp contrast with traditional ML, the field of GML has somewhat jumped from early methods to deep learning, without the decades-long development of well-established notions to compare, analyze, and improve algorithms.

Limitations of GNNs

As a result, there are significant limitations:

  1. GNNs, all based on the so-called message-passing paradigm, have significant limitations both practical and theoretical, and it is not clear how to address them.
  2. GNNs do not take into account the specificities of graphs coming from domains as different as biology or the social sciences.

Thus, practical results may vary wildly from one case to the other, with no guidelines on how to design reliable GNNs in each case. Overall, these are the symptoms of an overlooked major issue: GML is hitting a glass ceiling due to its severe lack of a grand, foundational theory.

Project MALAGA

The ambition of project MALAGA is to develop such a theory. Solving the crucial limitations of the current theory is highly challenging for several reasons:

  • Current mathematical tools cannot analyze the learning capabilities of GML methods in a unified way.
  • Existing statistical graph models do not faithfully represent the many characteristics of modern graph data.
  • Computational complexity becomes problematic on large graphs.

MALAGA will develop a radically new understanding of GML problems and of the strengths and limitations of a large panel of algorithms. Our goal is to significantly boost the performance, reliability, and adaptivity of GNNs, with a significant impact on three types of graph data that exhibit very different but representative behaviors: biological networks, social networks, and online recommender systems.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag€ 1.479.643
Totale projectbegroting€ 1.479.643

Tijdlijn

Startdatum1-2-2025
Einddatum31-1-2030
Subsidiejaar2025

Partners & Locaties

Projectpartners

  • CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE CNRSpenvoerder

Land(en)

France

Vergelijkbare projecten binnen European Research Council

ERC Starting...

Higher-Order Hodge Laplacians for Processing of multi-way Signals

This project aims to enhance graph signal processing by developing methods for analyzing higher-order relations in complex systems using Hodge-Laplacians and algebraic topology.

€ 1.500.000
ERC Starting...

Optimizing for Generalization in Machine Learning

This project aims to unravel the mystery of generalization in machine learning by developing novel optimization algorithms to enhance the reliability and applicability of ML in critical domains.

€ 1.494.375
ERC Advanced...

Control for Deep and Federated Learning

CoDeFeL aims to enhance machine learning methods through control theory, developing efficient ResNet architectures and federated learning techniques for applications in digital medicine and recommendations.

€ 2.499.224
ERC Consolid...

Reconciling Classical and Modern (Deep) Machine Learning for Real-World Applications

APHELEIA aims to create robust, interpretable, and efficient machine learning models that require less data by integrating classical methods with modern deep learning, fostering interdisciplinary collaboration.

€ 1.999.375
ERC Starting...

Graphs without Labels: Multimodal Structure Learning without Human Supervision

The project aims to enhance multimodal learning by using graph-based representations to capture semantic structures and relationships in diverse data, improving data efficiency and fairness in label-free learning.

€ 1.499.438