Zubair Irshad

PhD Candidate at Georgia Tech
Researcher at Institute of Robotics & Intelligent Machines, Georgia Tech



I am a PhD Candidate at Georgia Institute of Technology. I’m working with Dr. Zsolt Kira at the Institute of Robotics & Intelligent Machines. I also closely collaborate with Sergey ZakharovRares Ambrus and Adrien Gaidon from Toyota Research Institute. My current research focuses on 3D perception, Scene Understanding and Embodied AI and covers topics such as neural implicit reconstruction, efficient 3D object detection, 6D pose estimation, visual embodied navigation and semantic & spatial reasoning.

I have been fortunate to spend time at Toyota Research Institute (ML-R) working on compositionality of neural-radiance based representations and implicit models for 3D shape, appearance and pose optimization. I also spent the wonderful summers before at Toyota Research Institute (Robotics) (Summer’21) and SRI International (Summer’20) working on 3D perception, scene understanding and semantic and spatial reasoning for embodied agents.

Feel free to contact me to talk anything related to Robotics, Deep Learning or related to some of my projects. Below you will find my projects portfolio. You can find my resume here





CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects

Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Rares Ambrus, Jeannette Bohg, Abhinav Valada, Thomas Kollar

Conference on Computer Vision and Pattern Recognition, CVPR 2023

ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization

Muhammad Zubair Irshad*, Sergey Zakharov*, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon

European Conference on Computer Vision, ECCV 2022

CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation

Muhammad Zubair Irshad, Thomas Kollar, Michael Laskey, Kevin Stone, Zsolt Kira
IEEE International Conference on Robotics and Automation, ICRA 2022
Project PagearXiv Code Video Poster Bibtex

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation

Muhammad Zubair Irshad, Chih-Yao Ma, Zsolt Kira

IEEE International Conference on Robotics and Automation, ICRA 2021
Project Page arXiv Code VideoPoster Bibtex

SASRA: Semantically-aware Spatio-Temporal Reasoning Agent for Vision-and-Language Navigation

Muhammad Zubair Irshad, Niluthpol Mithun, Zachary Seymour, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar International Conference on Pattern Recognition, ICPR 2022

Deep Reinforcement Learning Agents

Deep Reinforcement Learning based control of complex robotic agents

Habitat Point Goal Navigation

Embodied Visual Navigation in Habitat


Learning inverse dynamics of 7-DOF Robot Arm


Complex robot maze navigation using image classification and ROS


Vehicle Control for Autonomous Driving


Environment perception stack for Self Driving Cars


Visual Odometry for Autonomous Driving


End to end imitation learning of dynamically unstable systems