Researcher at Institute of Robotics & Intelligent Machines, Georgia Tech
ABOUT ME !
I am a PhD Candidate at Georgia Institute of Technology. I’m working with Dr. Zsolt Kira at the Institute of Robotics & Intelligent Machines. My current research focuses on 3D perception, Scene Understanding and Embodied AI and covers topics such as neural implicit reconstruction, efficient 3D object detection, 6D pose estimation, visual embodied navigation and semantic & spatial reasoning.
This spring, I am a research intern at Toyota Research Institute (ML-R) with Adrien Gaidon, Rares Ambrus and Sergey Zakharov working on generalizable object representation for 3D perception. I have been fortunate to spend time at Toyota Research Institute (Robotics) as a Research Intern. I worked on 3D perception, scene understanding and generalized behaviors for robotics manipulation with Thomas Kollar and Michael Laskey. The summer before, I interned at SRI International in the Vision and Learning Lab on multi-modal semantic and spatial reasoning for embodied agents.
Feel free to contact me to talk anything related to Robotics, Deep Learning or related to some of my projects. Below you will find my projects portfolio. You can find my resume here
EDUCATION
- PhD, Robotics/AI and Mechanical Engineering, 2023 (Expected)
Georgia Institute of Technology - M.S, Robotics/AI & Mechanical Engineering & , 2019
Georgia Institute of Technology
EXPERIENCE
-
Research Intern - CV and Deep Learning, Jan-Aug 2022
Toyota Research Institute (TRI) - MLR - Research Intern - Robotics, May-Aug 2021
Toyota Research Institute (TRI) - Robotics - Deep Learning Research Intern, May-Aug 2020
Stanford Research Institute (SRI) International - Graduate Research Assistant, Jan 2019 - Present
Georgia Institute of Technology
SELECTED PUBLICATIONS
ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization
Muhammad Zubair Irshad*, Sergey Zakharov*, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon
CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation
IEEE International Conference on Robotics and Automation, ICRA 2022
Project PagearXiv Code Video Poster



Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation
IEEE International Conference on Robotics and Automation, ICRA 2021
Project Page arXiv Code VideoPoster



SASRA: Semantically-aware Spatio-Temporal Reasoning Agent for Vision-and-Language Navigation
International Conference on Pattern Recognition, ICPR 2022



Deep Reinforcement Learning based control of complex robotic agents
Details


Embodied Visual Navigation in Habitat
Details


Learning inverse dynamics of 7-DOF Robot Arm
Details


Complex robot maze navigation using image classification and ROS
Details


Vehicle Control for Autonomous Driving
Details


Environment perception stack for Self Driving Cars
Details


Visual Odometry for Autonomous Driving
Details

