SpatioTemporal Reconstruction of Interacting People for pErceiving Systems

The project aims to develop robust methods for inferring Human-Object Interactions from natural images/videos, enhancing intelligent systems to assist people in task completion.

Subsidie
€ 1.500.000
2025

Projectdetails

Introduction

People constantly interact with objects to perform tasks. To help people accomplish these, computers need to perceive Human-Object Interactions (HOI), and for this, they need to reconstruct HOI from whole-body color images of people interacting with objects or scenes.

Challenges in HOI Reconstruction

This is challenging due to several factors:

  • Occlusions between bodies and objects
  • Motion blur
  • Depth ambiguities
  • Low image resolution of hands and graspable object parts

There has been significant prior work on estimating 3D humans without considering objects, and estimating 3D objects without considering humans. Little prior work estimates these jointly, but, for tractability, focuses either on:

  1. Interacting hands, ignoring the body
  2. Interacting bodies, ignoring hands

Only recent work addresses dexterous interaction of whole bodies, but instruments bodies with intrusive markers or sensors, and uses non-standard cameras to capture video of interactions.

Limitations of Current Methods

Moreover, reconstruction lacks hand detail that is crucial for grasping, and videos are captured in constrained settings. Consequently, methods trained on these struggle to generalize.

Research Goals

My goal is to infer HOI from natural whole-body images/videos. To this end, I present an ambitious 5-year research agenda with novelties in four directions:

  1. Developing strong generative 3D shape models for objects and humans for a novel HOI representation.
  2. Developing methods that estimate 3D HOI from a color image with rich contact and proximal awareness.
  3. Instilling spatiotemporal reasoning into the heart of these for estimating 4D HOI from color video.
  4. Extending these methods to also infer their own confidence that will be correlated with the reconstruction quality.

Expected Outcomes

The outcome will be novel and robust methods for HOI reconstruction from natural images/videos. This will fill an important gap, enabling future intelligent systems to amplify people’s skills and help them accomplish tasks, e.g., for assistive robots or virtual 3D assistants or trainers.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag€ 1.500.000
Totale projectbegroting€ 1.500.000

Tijdlijn

Startdatum1-2-2025
Einddatum31-1-2030
Subsidiejaar2025

Partners & Locaties

Projectpartners

  • UNIVERSITEIT VAN AMSTERDAMpenvoerder

Land(en)

Netherlands

Vergelijkbare projecten binnen European Research Council

ERC Starting...

Learning Digital Humans in Motion

The project aims to enhance immersive telepresence by using natural language to reconstruct and animate photo-realistic digital humans for interactive communication in AR and VR environments.

€ 1.500.000
ERC Consolid...

Learning to synthesize interactive 3D models

This project aims to automate the generation of interactive 3D models using deep learning to enhance virtual environments and applications in animation, robotics, and digital entertainment.

€ 2.000.000
ERC Advanced...

Federated foundational models for embodied perception

The FRONTIER project aims to develop foundational models for embodied perception by integrating neural networks with physical simulations, enhancing learning efficiency and collaboration across intelligent systems.

€ 2.499.825
ERC Proof of...

A Robust, Real-time, and 3D Human Motion Capture System through Multi-Cameras and AI

Real-Move aims to develop a marker-less, real-time 3D human motion tracking system using multi-camera views and AI to enhance workplace safety and ergonomics, reducing costs and improving quality of life.

€ 150.000
ERC Starting...

Harmonising Observations and Underlying Principles for Visual Data Association

Harmony aims to enhance visual data association by addressing global optimality, scalability, and interconnections in complex tasks like 3D shape matching and physics-based scene understanding.

€ 1.624.911

Vergelijkbare projecten uit andere regelingen

Mkb-innovati...

Ontwikkeling elektronische beeldketen voor chirurgische ingrepen.

PS-Medtech en I-Med technology ontwikkelen een hoofd gedragen 3D digitale microscoop om real-time 3D beelden van preoperatieve scans te integreren tijdens chirurgische ingrepen, ter verbetering van precisie en efficiëntie.

€ 164.780