Skip to main content

3D Reconstruction of Human and Objects in Dynamic Scenes from a Monocular Video

Primary supervisor

Hamid Rezatofighi

3D localisation, reconstruction and mapping of the objects and human body in dynamic environments are important steps towards high-level 3D scene understanding, which has many applications in autonomous driving, robotics interaction and navigation. This project focuses on creating the scene representation in 3D which gives a complete scene understanding i.e pose, shape and size of different scene elements (humans and objects) and their spatio-temporal relationship.

Student cohort

Double Semester

URLs/references

https://vl4ai.erc.monash.edu/research.html

https://arxiv.org/pdf/2012.01591.pdf

https://arxiv.org/pdf/2012.05360.pdf

 

Required knowledge

  1. Good coding skills in a variety of coding languages
  2. Previous experience working with deep learning models for different tasks
  3. ​​​​​Proficient programming skills in Python and one of the main deep learning libraries (e.g., TensorFlow, PyTorch, Keras)