Hybrid and Interpretable Deep neural audio machines
HI-Audio aims to develop hybrid deep learning models that integrate interpretable signal processing with neural architectures for enhanced audio analysis and synthesis applications.
Projectdetails
Introduction
Machine Listening, or AI for Sound, is defined as the general field of Artificial Intelligence applied to audio analysis, understanding, and synthesis by a machine. The access to ever-increasing super-computing facilities, combined with the availability of huge data repositories (although largely unannotated), has led to the emergence of a significant trend with pure data-driven machine learning approaches.
Current Trends
The field has rapidly moved towards end-to-end neural approaches which aim to directly solve the machine learning problem for raw acoustic signals. However, these approaches often only loosely take into account the nature and structure of the processed data.
Consequences of Current Approaches
The main consequences are that the models are:
- Overly complex, requiring massive amounts of data to be trained and extreme computing power to be efficient (in terms of task performance).
- Largely unexplainable and non-interpretable.
Proposed Solutions
To overcome these major shortcomings, we believe that our prior knowledge about the nature of the processed data, their generation process, and their perception by humans should be explicitly exploited in neural-based machine learning frameworks.
Project Aim
The aim of HI-Audio is to build such hybrid deep approaches combining:
- Parameter-efficient and interpretable signal models
- Musicological and physics-based models
- Highly tailored deep neural architectures
Research Directions
The research directions pursued in HI-Audio will exploit novel deterministic and statistical audio and sound environment models with dedicated neural auto-encoders and generative networks. The project will target specific applications including:
- Speech and audio scene analysis
- Music information retrieval
- Sound transformation and synthesis
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 2.482.317 |
Totale projectbegroting | € 2.482.317 |
Tijdlijn
Startdatum | 1-10-2022 |
Einddatum | 30-9-2027 |
Subsidiejaar | 2022 |
Partners & Locaties
Projectpartners
- INSTITUT MINES-TELECOMpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
MANUNKIND: Determinants and Dynamics of Collaborative ExploitationThis project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery. | ERC STG | € 1.497.749 | 2022 | Details |
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressureThe UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance. | ERC STG | € 1.498.280 | 2022 | Details |
Uncovering the mechanisms of action of an antiviral bacteriumThis project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function. | ERC STG | € 1.500.000 | 2023 | Details |
The Ethics of Loneliness and SociabilityThis project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field. | ERC STG | € 1.025.860 | 2023 | Details |
MANUNKIND: Determinants and Dynamics of Collaborative Exploitation
This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure
The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.
Uncovering the mechanisms of action of an antiviral bacterium
This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.
The Ethics of Loneliness and Sociability
This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Interactive and Explainable Human-Centered AutoMLixAutoML aims to enhance trust and interactivity in automated machine learning by integrating human insights and explanations, fostering democratization and efficiency in ML applications. | ERC STG | € 1.459.763 | 2022 | Details |
Reconciling Classical and Modern (Deep) Machine Learning for Real-World ApplicationsAPHELEIA aims to create robust, interpretable, and efficient machine learning models that require less data by integrating classical methods with modern deep learning, fostering interdisciplinary collaboration. | ERC COG | € 1.999.375 | 2023 | Details |
Natural Auditory SCEnes in Humans and Machines: Establishing the Neural Computations of Everyday HearingThe NASCE project aims to understand auditory scene analysis by developing the Semantic Segmentation Hypothesis, integrating neuroscience and AI to enhance comprehension and applications in machine hearing. | ERC SyG | € 8.622.811 | 2025 | Details |
RESONIKSHet project gebruikt Machine Learning en AI voor snelle, operatorloze akoestische kwaliteitsinspecties, wat de doorlooptijd en kwaliteit verbetert en de reputatie en omzet van bedrijven verhoogt. | MIT R&D Samenwerking | € 188.300 | 2023 | Details |
Interactive and Explainable Human-Centered AutoML
ixAutoML aims to enhance trust and interactivity in automated machine learning by integrating human insights and explanations, fostering democratization and efficiency in ML applications.
Reconciling Classical and Modern (Deep) Machine Learning for Real-World Applications
APHELEIA aims to create robust, interpretable, and efficient machine learning models that require less data by integrating classical methods with modern deep learning, fostering interdisciplinary collaboration.
Natural Auditory SCEnes in Humans and Machines: Establishing the Neural Computations of Everyday Hearing
The NASCE project aims to understand auditory scene analysis by developing the Semantic Segmentation Hypothesis, integrating neuroscience and AI to enhance comprehension and applications in machine hearing.
RESONIKS
Het project gebruikt Machine Learning en AI voor snelle, operatorloze akoestische kwaliteitsinspecties, wat de doorlooptijd en kwaliteit verbetert en de reputatie en omzet van bedrijven verhoogt.