Zubair Irshad



PhD Candidate, Deep Learning and Computer Vision
Georgia Institute of Technology


I am a PhD Candidate at Georgia Institute of Technology. I’m working with Dr. Zsolt Kira at the Robotics, Perception and Learning (RIPL) Lab. I also closely collaborate with Rares AmbrusSergey ZakharovKatherine Liu and Adrien Gaidon from Toyota Research Institute. My current research focuses on 3D perception, Scene Understanding and Embodied AI and covers topics such as neural implicit reconstruction (i.e. NeRF), efficient 3D object detection, 6D pose estimation and visual embodied navigation.

I have been fortunate to spend time at Toyota Research Institute (ML-R) working on compositionality of neural-radiance based representations (NeRF) and implicit models for 3D shape, appearance and pose optimization. I also spent the wonderful summers before at Toyota Research Institute (Robotics) (Summer’21) and SRI International (Summer’20) working on 3D perception, scene understanding and semantic and spatial reasoning for embodied agents.

Feel free to contact me to talk anything related to 3D Vision, Robotics, NeRFs, Deep Learning or related to some of my projects. Below you will find my projects portfolio. You can find my resume here.

PhD, Robotics and Mechanical Engineering, 2019- 2023

M.S., Robotics and Mechanical Engineering, 2017- 2019

Fulbright Scholar,
2017- 2019

Deep Learning Research Intern,
Jan-Aug 2022

Robotics & Deep Learning Research Intern, May-Aug 2021

Computer Vision Research Intern, 
May-Aug 2020


[Jul 2023]

[Jul 2023]

Our Paper, NeO 360, accepted to ICCV’23! Grateful to have trio of papers accepted to ECCV, CVPR and now ICCV

[Jul 2023]

Awesome Implicit NeRF Robotics reached 800 stars on Github ⭐

[Jul 2023]

Served as a reviewer and reviewed 19 papers this year, so far for NeurIPS’23CVPR’23ICCV’23 and ICRA’23

[Jun 2023]

Attended CVPR 2023, Virtually. (Poster presentation of our paper, CARTO)

[Jun 2023]

Gave invited talks on Neural Fields in Robotics (Part 1 and 2) at 3D Deep Learning Reading Group

[Apr 2023]

Started as a mentor at Fatima Fellowship, supported by Huggingface

[Apr 2023]

Passed my PhD proposal defense titled ‘Inductive biases for object and agent-centric neural 3D scene representations’

[Apr 2023]

Guest lecture at Georgia Tech’s Deep learning Class on ‘Learning Object-centric Centric Neural 3D Scene Representations’

[Feb 2023]

Our paper, CARTO, on fast articulated object reconstruction, accepted into CVPR’23

[Oct 2022]

Attended ECCV’22 virtually (Poster presentation of our paper, ShAPO)

[Aug 2022]

Awarded GRA Funding (with Dr. Zsolt Kira) from Toyota Research Institute for my PhD

[Jul 2022]

Our paper, ShAPO on categorical object reconstruction and 6D pose estiamation, accepted into ECCV’22

[May 2022]

Our paper, SASRA on semantic mapping for Vision-and-Language Navigation, accepted to ICPR’22

[May 2022]

Attended ICRA’22 in person. Gave a talk on our paper, CenterSnap

[Jan 2022]

Started my second internship at Toyota Research Institute, with Machine Learning team in Bay Area, California

[May 2021]

Attended ICRA’21 virtually. Gave a talk on our paper, Robo-VLN

[Jul 2021]

Started my first internship at Toyota Research Institute, with Robotics perception team in Bay Area, California.

[Jan 2021]

Our paper, Robo-VLN, accepted to ICRA’21

[May 2020]

Started summer internship at SRI International, with CVT team in Princeton, New Jersey

[Nov 2019]

Passed PhD Qualifying Exams at Georgia Tech

[Aug 2019]

The beginning of my PhD program


FSD Pose teaser

FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects

Mayank Lunayach, Sergey Zakharov, Dian Chen, Rares Ambrus, Zsolt Kira, Muhammad Zubair Irshad



NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes

Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Vitor Guizilini, Thomas Kollar, Adrien Gaidon, Zsolt Kira*, Rares Ambrus*

International Conference on Computer Vision, ICCV 2023


CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects

Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Rares Ambrus, Jeannette Bohg, Abhinav Valada, Thomas Kollar

Conference on Computer Vision and Pattern Recognition, CVPR 2023

ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization

Muhammad Zubair Irshad*, Sergey Zakharov*, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon

European Conference on Computer Vision, ECCV 2022

CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation

Muhammad Zubair Irshad, Thomas Kollar, Michael Laskey, Kevin Stone, Zsolt Kira

IEEE International Conference on Robotics and Automation, ICRA 2022

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation

Muhammad Zubair Irshad, Chih-Yao Ma, Zsolt Kira

IEEE International Conference on Robotics and Automation, ICRA 2021

SASRA: Semantically-aware Spatio-Temporal Reasoning Agent for Vision-and-Language Navigation

Muhammad Zubair Irshad, Niluthpol Mithun, Zachary Seymour, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

International Conference on Pattern Recognition, ICPR 2022


Deep Reinforcement Learning Agents

Deep Reinforcement Learning based control of complex robotic agents

Habitat Point Goal Navigation

Embodied Visual Navigation in Habitat


Learning inverse dynamics of 7-DOF Robot Arm


Complex robot maze navigation using image classification and ROS


Vehicle Control for Autonomous Driving


Environment perception stack for Self Driving Cars


Visual Odometry for Autonomous Driving


End to end imitation learning of dynamically unstable systems