Why do infants learn language so fast? A reverse engineering approach
This project develops a computational model to explore how infants efficiently learn language through statistical learning and three additional mechanisms, aiming to produce comparable outcomes to children's language acquisition.
Projectdetails
Introduction
How do infants learn their first language(s)? The popular yet controversial 'statistical learning hypothesis' posits that they learn by gradually collecting statistics over their language inputs. This is strikingly similar to how current AI's Large Language Models (LLMs) learn and shows that simple statistical mechanisms may be sufficient to attain adult-like language competence.
Language Input Estimates
But does it? Estimates of language inputs to children show that by age 3, they have received 2 or 3 orders of magnitude less data than LLMs of similar performance. The gap grows exponentially larger with children's age. Worse, when models are fed with speech instead of text, they learn even slower.
Research Question
How are infants such efficient learners? This project tests the hypothesis that in addition to statistical learning, infants benefit from three mechanisms that accelerate their learning rate:
- They are born with a vocal tract which helps them understand the link between abstract motor commands and speech sounds, and decode noisy speech inputs more efficiently.
- They have an episodic memory enabling them to learn from unique events, instead of gradually learning from thousands of repetitions.
- They start with an evolved learning architecture optimized for generalization from few and noisy inputs.
Methodology
Our approach is to build a computational model of the learner (an infant simulator), which when fed by realistic language input produces outcome measures comparable to children's (laboratory experiments, vocabulary estimates). This gives a quantitative estimate of the efficiency of each of the three mechanisms, as well as new testable predictions.
Language Focus
We start with English and French that have both accessible large annotated speech corpora and documented acquisition landmarks and focus on the first three years of life.
Community Building
We then help build similar resources across a larger set of languages by fostering a cross-disciplinary community that shares tools, data, and analysis methods.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 2.494.625 |
Totale projectbegroting | € 2.494.625 |
Tijdlijn
Startdatum | 1-1-2025 |
Einddatum | 31-12-2029 |
Subsidiejaar | 2025 |
Partners & Locaties
Projectpartners
- ECOLE DES HAUTES ETUDES EN SCIENCES SOCIALESpenvoerder
- ECOLE NORMALE SUPERIEURE
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
MANUNKIND: Determinants and Dynamics of Collaborative ExploitationThis project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery. | ERC STG | € 1.497.749 | 2022 | Details |
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressureThe UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance. | ERC STG | € 1.498.280 | 2022 | Details |
Uncovering the mechanisms of action of an antiviral bacteriumThis project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function. | ERC STG | € 1.500.000 | 2023 | Details |
The Ethics of Loneliness and SociabilityThis project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field. | ERC STG | € 1.025.860 | 2023 | Details |
MANUNKIND: Determinants and Dynamics of Collaborative Exploitation
This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.
Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure
The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.
Uncovering the mechanisms of action of an antiviral bacterium
This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.
The Ethics of Loneliness and Sociability
This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Multiple routes to memory for a second language: Individual and situational factorsThis project investigates alternative routes to second language acquisition by applying memory research theories to understand individual and situational differences in learning processes. | ERC COG | € 2.000.000 | 2022 | Details |
Infant verbal Memory in Development: a window for understanding language constraints and brain plasticity from birthIN-MIND investigates the development of verbal memory in infants to understand its role in language learning, using innovative methods to identify memory capacities and intervention windows. | ERC STG | € 1.499.798 | 2022 | Details |
DEep COgnition Learning for LAnguage GEnerationThis project aims to enhance NLP models by integrating machine learning, cognitive science, and structured memory to improve out-of-domain generalization and contextual understanding in language generation tasks. | ERC COG | € 1.999.595 | 2023 | Details |
Gates to LanguageThe GALA project investigates the biological mechanisms of language acquisition in humans and nonhuman species to uncover why only humans can learn language. | ERC STG | € 1.490.057 | 2024 | Details |
Multiple routes to memory for a second language: Individual and situational factors
This project investigates alternative routes to second language acquisition by applying memory research theories to understand individual and situational differences in learning processes.
Infant verbal Memory in Development: a window for understanding language constraints and brain plasticity from birth
IN-MIND investigates the development of verbal memory in infants to understand its role in language learning, using innovative methods to identify memory capacities and intervention windows.
DEep COgnition Learning for LAnguage GEneration
This project aims to enhance NLP models by integrating machine learning, cognitive science, and structured memory to improve out-of-domain generalization and contextual understanding in language generation tasks.
Gates to Language
The GALA project investigates the biological mechanisms of language acquisition in humans and nonhuman species to uncover why only humans can learn language.