SpatioTemporal Reconstruction of Interacting People for pErceiving Systems

The project aims to develop robust methods for inferring Human-Object Interactions from natural images/videos, enhancing intelligent systems to assist people in task completion.

Subsidie

€ 1.500.000

2025

Projectdetails

Introduction

People constantly interact with objects to perform tasks. To help people accomplish these, computers need to perceive Human-Object Interactions (HOI), and for this, they need to reconstruct HOI from whole-body color images of people interacting with objects or scenes.

Challenges in HOI Reconstruction

This is challenging due to several factors:

Occlusions between bodies and objects
Motion blur
Depth ambiguities
Low image resolution of hands and graspable object parts

There has been significant prior work on estimating 3D humans without considering objects, and estimating 3D objects without considering humans. Little prior work estimates these jointly, but, for tractability, focuses either on:

Interacting hands, ignoring the body
Interacting bodies, ignoring hands

Only recent work addresses dexterous interaction of whole bodies, but instruments bodies with intrusive markers or sensors, and uses non-standard cameras to capture video of interactions.

Limitations of Current Methods

Moreover, reconstruction lacks hand detail that is crucial for grasping, and videos are captured in constrained settings. Consequently, methods trained on these struggle to generalize.

Research Goals

My goal is to infer HOI from natural whole-body images/videos. To this end, I present an ambitious 5-year research agenda with novelties in four directions:

Developing strong generative 3D shape models for objects and humans for a novel HOI representation.
Developing methods that estimate 3D HOI from a color image with rich contact and proximal awareness.
Instilling spatiotemporal reasoning into the heart of these for estimating 4D HOI from color video.
Extending these methods to also infer their own confidence that will be correlated with the reconstruction quality.

Expected Outcomes

The outcome will be novel and robust methods for HOI reconstruction from natural images/videos. This will fill an important gap, enabling future intelligent systems to amplify people’s skills and help them accomplish tasks, e.g., for assistive robots or virtual 3D assistants or trainers.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag	€ 1.500.000
Totale projectbegroting	€ 1.500.000

Tijdlijn

Startdatum	1-2-2025
Einddatum	31-1-2030
Subsidiejaar	2025

Partners & Locaties

Projectpartners

UNIVERSITEIT VAN AMSTERDAMpenvoerder

Land(en)

Netherlands

Vergelijkbare projecten binnen European Research Council

Project	Regeling	Bedrag	Jaar	Actie
Learning Digital Humans in Motion The project aims to enhance immersive telepresence by using natural language to reconstruct and animate photo-realistic digital humans for interactive communication in AR and VR environments.	ERC Starting...	€ 1.500.000	2025	Details
Learning to synthesize interactive 3D models This project aims to automate the generation of interactive 3D models using deep learning to enhance virtual environments and applications in animation, robotics, and digital entertainment.	ERC Consolid...	€ 2.000.000	2024	Details
Federated foundational models for embodied perception The FRONTIER project aims to develop foundational models for embodied perception by integrating neural networks with physical simulations, enhancing learning efficiency and collaboration across intelligent systems.	ERC Advanced...	€ 2.499.825	2024	Details
A Robust, Real-time, and 3D Human Motion Capture System through Multi-Cameras and AI Real-Move aims to develop a marker-less, real-time 3D human motion tracking system using multi-camera views and AI to enhance workplace safety and ergonomics, reducing costs and improving quality of life.	ERC Proof of...	€ 150.000	2024	Details
Harmonising Observations and Underlying Principles for Visual Data Association Harmony aims to enhance visual data association by addressing global optimality, scalability, and interconnections in complex tasks like 3D shape matching and physics-based scene understanding.	ERC Starting...	€ 1.624.911	2025	Details

ERC Starting...

Learning Digital Humans in Motion

The project aims to enhance immersive telepresence by using natural language to reconstruct and animate photo-realistic digital humans for interactive communication in AR and VR environments.

ERC Starting Grant

€ 1.500.000

2025

Details

ERC Consolid...

Learning to synthesize interactive 3D models

This project aims to automate the generation of interactive 3D models using deep learning to enhance virtual environments and applications in animation, robotics, and digital entertainment.

ERC Consolidator Grant

€ 2.000.000

2024

Details

ERC Advanced...

Federated foundational models for embodied perception

The FRONTIER project aims to develop foundational models for embodied perception by integrating neural networks with physical simulations, enhancing learning efficiency and collaboration across intelligent systems.

ERC Advanced Grant

€ 2.499.825

2024

Details

ERC Proof of...

A Robust, Real-time, and 3D Human Motion Capture System through Multi-Cameras and AI

Real-Move aims to develop a marker-less, real-time 3D human motion tracking system using multi-camera views and AI to enhance workplace safety and ergonomics, reducing costs and improving quality of life.

ERC Proof of Concept

€ 150.000

2024

Details

ERC Starting...

Harmonising Observations and Underlying Principles for Visual Data Association

Harmony aims to enhance visual data association by addressing global optimality, scalability, and interconnections in complex tasks like 3D shape matching and physics-based scene understanding.

ERC Starting Grant

€ 1.624.911

2025

Details

Vergelijkbare projecten uit andere regelingen

Project	Regeling	Bedrag	Jaar	Actie
Ontwikkeling elektronische beeldketen voor chirurgische ingrepen. PS-Medtech en I-Med technology ontwikkelen een hoofd gedragen 3D digitale microscoop om real-time 3D beelden van preoperatieve scans te integreren tijdens chirurgische ingrepen, ter verbetering van precisie en efficiëntie.	Mkb-innovati...	€ 164.780	2018	Details

Mkb-innovati...

Ontwikkeling elektronische beeldketen voor chirurgische ingrepen.

PS-Medtech en I-Med technology ontwikkelen een hoofd gedragen 3D digitale microscoop om real-time 3D beelden van preoperatieve scans te integreren tijdens chirurgische ingrepen, ter verbetering van precisie en efficiëntie.

Mkb-innovatiestimulering Topsectoren R&D Samenwerking

€ 164.780

2018

Details