SpatioTemporal Reconstruction of Interacting People for pErceiving Systems
The project aims to develop robust methods for inferring Human-Object Interactions from natural images/videos, enhancing intelligent systems to assist people in task completion.
Projectdetails
Introduction
People constantly interact with objects to perform tasks. To help people accomplish these, computers need to perceive Human-Object Interactions (HOI), and for this, they need to reconstruct HOI from whole-body color images of people interacting with objects or scenes.
Challenges in HOI Reconstruction
This is challenging due to several factors:
- Occlusions between bodies and objects
- Motion blur
- Depth ambiguities
- Low image resolution of hands and graspable object parts
There has been significant prior work on estimating 3D humans without considering objects, and estimating 3D objects without considering humans. Little prior work estimates these jointly, but, for tractability, focuses either on:
- Interacting hands, ignoring the body
- Interacting bodies, ignoring hands
Only recent work addresses dexterous interaction of whole bodies, but instruments bodies with intrusive markers or sensors, and uses non-standard cameras to capture video of interactions.
Limitations of Current Methods
Moreover, reconstruction lacks hand detail that is crucial for grasping, and videos are captured in constrained settings. Consequently, methods trained on these struggle to generalize.
Research Goals
My goal is to infer HOI from natural whole-body images/videos. To this end, I present an ambitious 5-year research agenda with novelties in four directions:
- Developing strong generative 3D shape models for objects and humans for a novel HOI representation.
- Developing methods that estimate 3D HOI from a color image with rich contact and proximal awareness.
- Instilling spatiotemporal reasoning into the heart of these for estimating 4D HOI from color video.
- Extending these methods to also infer their own confidence that will be correlated with the reconstruction quality.
Expected Outcomes
The outcome will be novel and robust methods for HOI reconstruction from natural images/videos. This will fill an important gap, enabling future intelligent systems to amplify people’s skills and help them accomplish tasks, e.g., for assistive robots or virtual 3D assistants or trainers.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.500.000 |
Totale projectbegroting | € 1.500.000 |
Tijdlijn
Startdatum | 1-2-2025 |
Einddatum | 31-1-2030 |
Subsidiejaar | 2025 |
Partners & Locaties
Projectpartners
- UNIVERSITEIT VAN AMSTERDAMpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Learning Digital Humans in MotionThe project aims to enhance immersive telepresence by using natural language to reconstruct and animate photo-realistic digital humans for interactive communication in AR and VR environments. | ERC Starting... | € 1.500.000 | 2025 | Details |
Learning to synthesize interactive 3D modelsThis project aims to automate the generation of interactive 3D models using deep learning to enhance virtual environments and applications in animation, robotics, and digital entertainment. | ERC Consolid... | € 2.000.000 | 2024 | Details |
Federated foundational models for embodied perceptionThe FRONTIER project aims to develop foundational models for embodied perception by integrating neural networks with physical simulations, enhancing learning efficiency and collaboration across intelligent systems. | ERC Advanced... | € 2.499.825 | 2024 | Details |
A Robust, Real-time, and 3D Human Motion Capture System through Multi-Cameras and AIReal-Move aims to develop a marker-less, real-time 3D human motion tracking system using multi-camera views and AI to enhance workplace safety and ergonomics, reducing costs and improving quality of life. | ERC Proof of... | € 150.000 | 2024 | Details |
Harmonising Observations and Underlying Principles for Visual Data AssociationHarmony aims to enhance visual data association by addressing global optimality, scalability, and interconnections in complex tasks like 3D shape matching and physics-based scene understanding. | ERC Starting... | € 1.624.911 | 2025 | Details |
Learning Digital Humans in Motion
The project aims to enhance immersive telepresence by using natural language to reconstruct and animate photo-realistic digital humans for interactive communication in AR and VR environments.
Learning to synthesize interactive 3D models
This project aims to automate the generation of interactive 3D models using deep learning to enhance virtual environments and applications in animation, robotics, and digital entertainment.
Federated foundational models for embodied perception
The FRONTIER project aims to develop foundational models for embodied perception by integrating neural networks with physical simulations, enhancing learning efficiency and collaboration across intelligent systems.
A Robust, Real-time, and 3D Human Motion Capture System through Multi-Cameras and AI
Real-Move aims to develop a marker-less, real-time 3D human motion tracking system using multi-camera views and AI to enhance workplace safety and ergonomics, reducing costs and improving quality of life.
Harmonising Observations and Underlying Principles for Visual Data Association
Harmony aims to enhance visual data association by addressing global optimality, scalability, and interconnections in complex tasks like 3D shape matching and physics-based scene understanding.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Ontwikkeling elektronische beeldketen voor chirurgische ingrepen.PS-Medtech en I-Med technology ontwikkelen een hoofd gedragen 3D digitale microscoop om real-time 3D beelden van preoperatieve scans te integreren tijdens chirurgische ingrepen, ter verbetering van precisie en efficiëntie. | Mkb-innovati... | € 164.780 | 2018 | Details |
Ontwikkeling elektronische beeldketen voor chirurgische ingrepen.
PS-Medtech en I-Med technology ontwikkelen een hoofd gedragen 3D digitale microscoop om real-time 3D beelden van preoperatieve scans te integreren tijdens chirurgische ingrepen, ter verbetering van precisie en efficiëntie.