Learning Isoform Fingerprints to Discover the Molecular Diversity of Life

This project aims to revolutionize proteomics by developing a novel data analysis strategy using deep learning to discover and quantify protein isoforms through their unique multi-dimensional fingerprints (ORIGINs).

Subsidie
€ 1.498.939
2023

Projectdetails

Introduction

Did you know that ~80% of all proteomic data is not utilized? Proteins play a vital role in all biological processes and organisms. We believe that different versions of a single gene product – protein isoforms – shape the molecular diversity of life. However, comprehensive evidence on a protein level is not available.

Current Challenges

Chromatography-coupled tandem mass spectrometry (LC-MS/MS) is the de-facto standard for measuring proteomes, but it is not good at identifying isoforms because at least 80% of the recorded information is never used.

Project Aim

I argue that isoforms leave a deterministic multi-dimensional fingerprint (ORIGINs) representing their physicochemical properties in each proteomic measurement. Therefore, the central aim of this project is to discover and quantify protein isoforms systematically by a novel MS-based proteomics data analysis strategy.

Methodology

  1. Data Utilization: By tapping into the wealth of data the proteomics community has already amassed, I will train deep neural networks that allow the prediction of ORIGINs.
  2. Innovative Analysis: I will implement an innovative data analysis strategy that utilizes ORIGINs to identify and quantify isoforms.
  3. Application: I will demonstrate that ORIGINs can be used to substantially broaden our understanding of the molecular diversity of life by showcasing its application on four emerging and challenging questions in proteome research of varying biological and technical complexity.

Research Significance

This will allow me to address a fundamental open question in biology: to what extent and prevalence are isoforms actually translated and what functional roles might they be associated with? ORIGINs will improve the sensitivity, biological resolution, and accuracy at which proteins and their isoforms can be identified and quantified.

Broader Impact

Beyond this, the concept of ORIGINs can be applied to and improve any proteomics experiments, and thus holds the potential to revolutionize MS-based proteomics as a technology and elevate the whole field of protein-based research.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag€ 1.498.939
Totale projectbegroting€ 1.498.939

Tijdlijn

Startdatum1-6-2023
Einddatum31-5-2028
Subsidiejaar2023

Partners & Locaties

Projectpartners

  • TECHNISCHE UNIVERSITAET MUENCHENpenvoerder

Land(en)

Germany

Vergelijkbare projecten binnen European Research Council

ERC STG

MANUNKIND: Determinants and Dynamics of Collaborative Exploitation

This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.

€ 1.497.749
ERC STG

Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure

The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.

€ 1.498.280
ERC STG

Uncovering the mechanisms of action of an antiviral bacterium

This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.

€ 1.500.000
ERC STG

The Ethics of Loneliness and Sociability

This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.

€ 1.025.860

Vergelijkbare projecten uit andere regelingen

ERC COG

Proteome diversification in evolution

PROMISE aims to decode protein sequences and structures using AI to understand their interactions and evolution, ultimately transforming big data into actionable biological insights.

€ 1.952.762
ERC ADG

A Native Mass Spectrometry Systemic View of Cellular Structural Biology

This project aims to enhance native mass spectrometry for studying protein interactions and diversity in their natural cellular environments, advancing structural biology and related fields.

€ 2.954.167
ERC COG

Explainable Machine Learning for Identifying the Full Heterogeneity of Peptidoforms and Proteoforms

explAInProt aims to enhance proteomics by developing explainable, end-to-end machine learning models to identify undetected protein variants and improve clinical applications through advanced sequencing methods.

€ 1.992.500
ERC POC

Precise, Rapid and Scalable Proteomics Solutions for Archaeology, Ecology, Wildlife Forensics and Food-chain Authentication

The PReciSe project aims to develop a fast, cost-effective proteomics method for taxonomic identification to enhance archaeological, ecological, and food supply chain verification.

€ 150.000