Flexible Statistical Inference

Develop a flexible statistical theory allowing post-hoc data collection and decision-making with error control, utilizing e-values for improved inference in small samples.

Subsidie
€ 2.499.461
2024

Projectdetails

Introduction

Most statistical methods require that all aspects of data collection and inference are determined in advance, independently of the data. These include when to stop collecting data, what decisions can be made (e.g., accept/reject hypothesis, classify new point), and how to measure their quality (e.g., loss function/significance level). This is wildly at odds with the flexibility required in practice! It makes it impossible to achieve error control in meta-analyses and contributes to the replication crisis in the applied sciences.

Project Overview

I will develop a novel statistical theory in which all data-collection and decision aspects may be unknown in advance, possibly imposed post-hoc, depending on the data itself in unknowable ways. Yet this new theory will provide small-sample frequentist error control, risk bounds, and confidence sets.

Theoretical Foundations

I base myself on far-reaching extensions of e-values/processes. These generalize likelihood ratios and replace p-values, capturing 'evidence' in a much cleaner fashion. As lead author of the first paper (2019) that gave e-values a name and demonstrated their enormous potential, I kicked off and then played an essential role in the extremely rapid development of anytime-valid inference, the one aspect of flexibility that is by now well-studied.

Research Gaps

Still, efficient e-value design principles for many standard problems (e.g., GLMs and other settings with covariates) are lacking, and I will provide them. I will also develop theory for full decision-task flexibility, about which currently almost nothing is known. A major innovation is the e-posterior, which behaves differently from the Bayesian one: if priors are chosen badly, e-posterior based confidence intervals get wide rather than wrong.

Conclusion

Both the existing Wald-Neyman-Pearson and Bayesian statistical theories will arise as special, extreme cases of the new theory, based on perfect (hence unrealistic) knowledge of the data-collection/decision problem or the underlying distribution(s), respectively.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag€ 2.499.461
Totale projectbegroting€ 2.499.461

Tijdlijn

Startdatum1-11-2024
Einddatum31-10-2029
Subsidiejaar2024

Partners & Locaties

Projectpartners

  • STICHTING NEDERLANDSE WETENSCHAPPELIJK ONDERZOEK INSTITUTENpenvoerder

Land(en)

Netherlands

Vergelijkbare projecten binnen European Research Council

ERC STG

MANUNKIND: Determinants and Dynamics of Collaborative Exploitation

This project aims to develop a game theoretic framework to analyze the psychological and strategic dynamics of collaborative exploitation, informing policies to combat modern slavery.

€ 1.497.749
ERC STG

Elucidating the phenotypic convergence of proliferation reduction under growth-induced pressure

The UnderPressure project aims to investigate how mechanical constraints from 3D crowding affect cell proliferation and signaling in various organisms, with potential applications in reducing cancer chemoresistance.

€ 1.498.280
ERC STG

Uncovering the mechanisms of action of an antiviral bacterium

This project aims to uncover the mechanisms behind Wolbachia's antiviral protection in insects and develop tools for studying symbiont gene function.

€ 1.500.000
ERC STG

The Ethics of Loneliness and Sociability

This project aims to develop a normative theory of loneliness by analyzing ethical responsibilities of individuals and societies to prevent and alleviate loneliness, establishing a new philosophical sub-field.

€ 1.025.860

Vergelijkbare projecten uit andere regelingen

ERC STG

The missing mathematical story of Bayesian uncertainty quantification for big data

This project aims to enhance scalable Bayesian methods through theoretical insights, improving their accuracy and acceptance in real-world applications like medicine and cosmology.

€ 1.492.750
ERC STG

High-dimensional nonparametric Bayesian causal inference

Develop Bayesian nonparametric methods for high-dimensional causal inference to enhance variable selection and uncertainty quantification, enabling reliable causal conclusions across various fields.

€ 1.499.770
ERC STG

Provable Scalability for high-dimensional Bayesian Learning

This project develops a mathematical theory for scalable Bayesian learning methods, integrating computational and statistical insights to enhance algorithm efficiency and applicability in high-dimensional models.

€ 1.488.673
ERC STG

Uniting Statistical Testing and Machine Learning for Safe Predictions

The project aims to enhance the interpretability and reliability of machine learning predictions by integrating statistical methods to establish robust error bounds and ensure safe deployment in real-world applications.

€ 1.500.000