ABOUT ME!

I am a Machine Learning Research Scientist at Toyota Research Institute working on Large Behavior Models and Deep-learning based 3D perception systems for Robotics. More recently, I have been a core contributor to the LBM 1.0 multi-task policy learning and post-training efforts. I received my PhD in the George W. Woodruff School of Mechanical Engineering at Georgia Institute of Technology. I was advised by Dr. Zsolt Kira from the Robotics, Perception and Learning (RIPL) Lab. My PhD thesis is titled Learning 3D Robotics Perception using Inductive Priors. My thesis is available here and the dissertation defense video here. My current research covers the following topics:

  • 3D Perception: 3D Reconstruction, NeRF, Gaussian Splatting, Representation Learning, 6D pose estimation
  • Generative AI: Diffusion Models, Generative Action Models, Data Augmentation and Data Generation for Robotics
  • Multimodal AI: Vision-and-Language, Embodied AI, Semantic Understanding, Spatio-temporal learning

I'm also a technical reviewer for Machine Learning and Robotics Conferences including CVPR, ECCV, ICCV, ICLR, Neurips, ICRA, IROS, RSS, RA-L and the lead co-organizer of RoboNeRF Workshop at ICRA'24 and Robo 3D-VLM workshop at CVPR'25! Below you will find my projects portfolio. You can find my updated resume here.

Affiliations

TRI 2021 – Present
Habib University 2025–2026 Ongoing
Georgia Tech 2017 – 2023
Fulbright 2017 – 2019
SRI International Summer 2020

NEWS

[Jan 2026]Teaching undergraduate Computer Vision Spring 2026!
[Jan 2026]Beyond Teleop workshop accepted to ICRA'26!. Call for papers coming soon!
[Nov 2025]FastMap accepted to 3DV'26!
[Oct 2025]Started as an Associate Editor for Robot Learning at ICRA'26!
[Sep 2025]Attending the Conference on Robot Learning in Seoul, Korea to help present Real2Render2Real
[Aug 2025]Blog Post released integrating our first TRI LBM with Boston Dynamics' Atlas!
[Jul 2025]Large Behavior Models is up on arXiv—I'm a core contributor to multi-task policy learning and post-training efforts.
[Jul 2025]Invited talk at GIKI on Embodied Artificial Intelligence for 3D Perception leveraging Inductive Priors.
[Jun 2025]Two ICCV'25 papers on 3D Gaussian Splatting for object-goal navigation and 3D articulated object reconstruction.
[Apr 2025]Released Posed-DROID, improved camera calibration for the DROID dataset.
[Jan 2025]Our paper on object-centric Gaussian Splats for 3D tracking i.e. POGS has been accepted to ICRA'25!
[Dec 2024]CVPR Workshop accepted on 3D Vision Language Models (VLMs) for Robotics. Call for papers coming soon!
[Oct 2024]Our comprehensive survey on Neural Fields in Robotics, is now available on arXiv!
[Sep 2024]RoVi-Aug, diffusion-based data-augmentation for robotics manipulation, accepted to CORL'24!
[Jul 2024]NeRF-MAE, large-scale pretraining using NeRFs, accepted to ECCV'24!

FEATURED PUBLICATIONS

(For a complete list, please see my Google Scholar Profile)

A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation

Jose Barreiros*, Andrew Beaulieu*, Aditya Bhat*, Rick Cory*, … Muhammad Zubair Irshad*, … Rares Ambrus, Kerri Fetzer-Borelli, Ben Burchfiel, Hadas Kress-Gazit, Siyuan Feng, Stacie Ford, Russ Tedrake.

*Primary contributors (listed first, alphabetical)

@article{lbmtri2025,
  title={A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation}, 
  author={TRI LBM Team and Jose Barreiros and Andrew Beaulieu and Aditya Bhat and Rick Cory and Eric Cousineau and Hongkai Dai and Ching-Hsin Fang and Kunimatsu Hashimoto and Muhammad Zubair Irshad and Masha Itkina and Naveen Kuppuswamy and Kuan-Hui Lee and Katherine Liu and Dale McConachie and Ian McMahon and Haruki Nishimura and Calder Phillips-Grafflin and Charles Richter and Paarth Shah and Krishnan Srinivasan and Blake Wulfe and Chen Xu and Mengchao Zhang and Alex Alspach and Maya Angeles and Kushal Arora and Vitor Campagnolo Guizilini and Alejandro Castro and Dian Chen and Ting-Sheng Chu and Sam Creasey and Sean Curtis and Richard Denitto and Emma Dixon and Eric Dusel and Matthew Ferreira and Aimee Goncalves and Grant Gould and Damrong Guoy and Swati Gupta and Xuchen Han and Kyle Hatch and Brendan Hathaway and Allison Henry and Hillel Hochsztein and Phoebe Horgan and Shun Iwase and Donovon Jackson and Siddharth Karamcheti and Sedrick Keh and Joseph Masterjohn and Jean Mercat and Patrick Miller and Paul Mitiguy and Tony Nguyen and Jeremy Nimmer and Yuki Noguchi and Reko Ong and Aykut Onol and Owen Pfannenstiehl and Richard Poyner and Leticia Priebe Mendes Rocha and Gordon Richardson and Christopher Rodriguez and Derick Seale and Michael Sherman and Mariah Smith-Jones and David Tago and Pavel Tokmakov and Matthew Tran and Basile Van Hoorick and Igor Vasiljevic and Sergey Zakharov and Mark Zolotas and Rares Ambrus and Kerri Fetzer-Borelli and Benjamin Burchfiel and Hadas Kress-Gazit and Siyuan Feng and Stacie Ford and Russ Tedrake},
  year={2025},
  eprint={2507.05331},
  archivePrefix={arXiv},
  primaryClass={cs.RO},
  url={https://arxiv.org/abs/2507.05331}, 
}

arXiv 2025

PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies

Arhan Jain, Mingtong Zhang, Kanav Arora, William Chen, Marcel Torne, Muhammad Zubair Irshad, Sergey Zakharov, Yue Wang, Sergey Levine, Chelsea Finn, Wei-Chiu Ma, Dhruv Shah, Abhishek Gupta, Karl Pertsch

@misc{polaris,
        title   = {PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies},
        author  = {Jain, Arhan and Zhang, Mingtong and Arora, Kanav and Chen, William and Torne, Marcel and Irshad, Muhammad Zubair and Zakharov, Sergey and Wang, Yue and Levine, Sergey and Finn, Chelsea and Ma, Wei-Chiu and Shah, Dhruv and Gupta, Abhishek and Pertsch, Karl},
        year    = {2025}
}

RSS 2024

Posed-DROID: Scaling-Up Automatic Camera Calibration for DROID dataset

Muhammad Zubair Irshad, Vitor Guizilini, Alexander Khazatsky, Karl Pertsch

Part of DROID paper

@misc{irshad2024scalingupcalibration,
title={Scaling-Up Automatic Camera Calibration for DROID Dataset: A study using Foundation models and Existing Deep-Learning tools},
author={Muhammad Zubair Irshad and Vitor Guizilini and Alexander Khazatsky and Karl Pertsch},
year={2024},
howpublished={\url{medium.com/p/4ddfc45361d3}},
note={Medium blog post}
}

CVPR 2025

Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion

Vitor Guizilini, Muhammad Zubair Irshad, Dian Chen, Greg Shakhnarovich, Rares Ambrus

@misc{guizilini2025zeroshotnovelviewdepth,
        title={Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion}, 
        author={Vitor Guizilini and Muhammad Zubair Irshad and Dian Chen and Greg Shakhnarovich and Rares Ambrus},
        year={2025},
        eprint={2501.18804},
        archivePrefix={arXiv},
        primaryClass={cs.CV}
}
Embodied Splat

ICCV 2025 | EAI Workshop CVPR 2025

EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device

Gunjan Chhablani, Xiaomeng Ye, Rynaa Grover, Muhammad Zubair Irshad, Zsolt Kira

@InProceedings{Chhablani_2025_ICCV,
    author    = {Chhablani, Gunjan and Ye, Xiaomeng and Irshad, Muhammad Zubair and Kira, Zsolt},
    title     = {EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {25431-25441}
}

ICCV 2025

SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting

Shengjie Lin, Jiading Fang, Muhammad Zubair Irshad, Vitor Campagnolo Guizilini, Rares Andrei Ambrus, Greg Shakhnarovich, Matthew R. Walter

@InProceedings{Lin_2025_ICCV,
    author    = {Lin, Shengjie and Fang, Jiading and Irshad, Muhammad Zubair and Guizilini, Vitor Campagnolo and Ambrus, Rares Andrei and Shakhnarovich, Greg and Walter, Matthew R.},
    title     = {SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {8841-8851}
}

CoRL 2025

Oral Presentation

Real2Render2Real: Scaling Robotic Manipulation Data Without Dynamics Simulation or Robot Hardware

Justin Yu, Letian Fu, Huang Huang, Karim El-Refai, Rares Andrei Ambrus, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

@InProceedings{pmlr-v305-yu25a,
  title = 	 {Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware},
  author =       {Yu, Justin and Fu, Letian and Huang, Huang and El-Refai, Karim and Ambrus, Rares Andrei and Cheng, Richard and Irshad, Muhammad Zubair and Goldberg, Ken},
  booktitle = 	 {Proceedings of The 9th Conference on Robot Learning},
  pages = 	 {547--577},
  year = 	 {2025},
  volume = 	 {305},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {27--30 Sep},
  publisher =    {PMLR},
  url = 	 {https://proceedings.mlr.press/v305/yu25a.html}
}

CVPR 2025

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

Shun Iwase, Muhammad Zubair Irshad, Katherine Liu, Vitor Guizilini, Robert Lee, Takuya Ikeda, Amma Ayako, Koichi Nishiwaki, Kris Kitani, Rares Ambrus, Sergey Zakharov

@InProceedings{Iwase_CVPR_2025,
  author = {Iwase, Shun and, Irshad, Muhammad Zubair and Liu, Katherine and Guizilini, Vitor and Lee, Robert and Ikeda, Takuya and Amma, Ayako and Nishiwaki, Koichi and Kitani, Kris and Ambrus, Rares and Zakharov, Sergey},
  title = {ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping},
  booktitle = {CVPR},
  year = {2025}
}

3DV 2026

Oral Presentation

FastMap: Revisiting Dense and Scalable Structure from Motion

Jiahao Li, Haochen Wang, Muhammad Zubair Irshad, Igor Vasiljevic, Matthew R. Walter, Vitor Campagnolo Guizilini, Greg Shakhnarovich

@inproceedings{fastmap2025,
    author        = {Jiahao Li and Haochen Wang and Muhammad Zubair Irshad and Igor Vasiljevic and Matthew R. Walter and Vitor Campagnolo Guizilini and Greg Shakhnarovich},
    title         = {FastMap: Revisiting Structure from Motion through First-Order Optimization},
    journal       = {International Conference on 3D Vision (3DV)},
    year          = {2026}
}
Neural Fields Survey

In Submission 2025

Neural Fields in Robotics: A Survey

Muhammad Zubair Irshad, Mauro Comi, Yen-Chen Lin, Nick Heppert, Abhinav Valada, Zsolt Kira, Rares Ambrus, Johnathan Trembley

@article{irshad2024neuralfieldsroboticssurvey,
  title={Neural Fields in Robotics: A Survey},
  author={Muhammad Zubair Irshad and Mauro Comi and Yen-Chen Lin and Nick Heppert and Abhinav Valada and Rares Ambrus and Zsolt Kira and Jonathan Tremblay},
  journal={arXiv preprint arXiv:2410.20220},
  year={2024}
}
NeRF-MAE

ECCV 2024 | NRI Workshop CVPR 2024

NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Muhammad Zubair Irshad, Sergey Zakharov, Vitor Guizilini, Adrien Gaidon, Zsolt Kira, Rares Ambrus

@inproceedings{irshad2024nerfmae,
    title={NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields},
    author={Muhammad Zubair Irshad and Sergey Zakharov and Vitor Guizilini and Adrien Gaidon and Zsolt Kira and Rares Ambrus},
    journal={European Conference on Computer Vision (ECCV)},
    year={2024}
}

ICRA 2025

POGS: Persistent Object Gaussian Splat for Tracking Human and Robot Manipulation of Irregularly Shaped Objects

Justin Yu, Kush Hari, Karim El-Refai, Arnav Dalil, Justin Kerr, Chung-Min Kim, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

@article{yu2025pogs,
  author    = {Yu, Justin and Hari, Kush and El-Refai, Karim and Dalil, Arnav and Kerr, Justin and Kim, Chung-Min and Cheng, Richard and Irshad, Muhammad Zubair and Goldberg, Ken},
  title     = {Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects},
  journal   = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
  year      = {2025},
}

CoRL 2024

Oral Presentation (Top 4.3%)

RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning

Lawrence Chen*, Chenfeng Xu*, Karthik Dharmarajan, Muhammad Zubair Irshad, Richard Cheng, Kurt Keutzer, Masayoshi Tomizuka, Quan Vuong, Ken Goldberg

@inproceedings{chen2024roviaug,
      title={RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning},
      author={Lawrence Yunliang Chen and Chenfeng Xu and Karthik Dharmarajan and Zubair Irshad and Richard Cheng and Kurt Keutzer and Masayoshi Tomizuka and Quan Vuong and Ken Goldberg},
      booktitle = {Conference on Robot Learning (CoRL)},
      address  = {Munich, Germany},
      year = {2024},
}

IROS 2024

Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot

Justin Yu*, Kush Hari*, Kishore Srinivas*, Adam Rashid, Chung Min Kim, Justin Kerr, Richard Cheng, Muhammad Zubair Irshad, Ashwin Balakrishna, Thomas Kollar, Ken Goldberg

@inproceedings{yu2024legs,
    title={Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot},
    author={Justin Yu and Kush Hari and Kishore Srinivas and Karim El-Refai and Adam Rashid and Chung Min Kim and Justin Kerr1 and Richard Cheng and Muhammad Zubair Irshad and Ashwin Balakrishna and Thomas Kollar and Ken Goldberg},
    booktitle={Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS)},
    year={2024}
}

RSS 2024

DROID: A Large-Scale In-the-Wild Robot Manipulation Dataset

Alexander Khazatsky*, Karl Pertsch*, Suraj Nair, Ashwin Balakrishna, … Muhammad Zubair Irshad et al.

@article{khazatsky2024droid,
    title   = {DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset},
    author  = {Alexander Khazatsky and Karl Pertsch and Suraj Nair and Ashwin Balakrishna and Sudeep Dasari and Siddharth Karamcheti and Soroush Nasiriany and Mohan Kumar Srirama and Lawrence Yunliang Chen and Kirsty Ellis and Peter David Fagan and Joey Hejna and Masha Itkina and Marion Lepert and Yecheng Jason Ma and Patrick Tree Miller and Jimmy Wu and Suneel Belkhale and Shivin Dass and Huy Ha and Arhan Jain and Abraham Lee and Youngwoon Lee and Marius Memmel and Sungjae Park and Ilija Radosavovic and Kaiyuan Wang and Albert Zhan and Kevin Black and Cheng Chi and Kyle Beltran Hatch and Shan Lin and Jingpei Lu and Jean Mercat and Abdul Rehman and Pannag R Sanketi and Archit Sharma and Cody Simpson and Quan Vuong and Homer Rich Walke and Blake Wulfe and Ted Xiao and Jonathan Heewon Yang and Arefeh Yavary and Tony Z. Zhao and Christopher Agia and Rohan Baijal and Mateo Guaman Castro and Daphne Chen and Qiuyu Chen and Trinity Chung and Jaimyn Drake and Ethan Paul Foster and Jensen Gao and Vitor Guizilini and David Antonio Herrera and Minho Heo and Kyle Hsu and Jiaheng Hu and Muhammad Zubair Irshad and Donovon Jackson and Charlotte Le and Yunshuang Li and Kevin Lin and Roy Lin and Zehan Ma and Abhiram Maddukuri and Suvir Mirchandani and Daniel Morton and Tony Nguyen and Abigail O'Neill and Rosario Scalise and Derick Seale and Victor Son and Stephen Tian and Emi Tran and Andrew E. Wang and Yilin Wu and Annie Xie and Jingyun Yang and Patrick Yin and Yunchu Zhang and Osbert Bastani and Glen Berseth and Jeannette Bohg and Ken Goldberg and Abhinav Gupta and Abhishek Gupta and Dinesh Jayaraman and Joseph J Lim and Jitendra Malik and Roberto Martín-Martín and Subramanian Ramamoorthy and Dorsa Sadigh and Shuran Song and Jiajun Wu and Michael C. Yip and Yuke Zhu and Thomas Kollar and Sergey Levine and Chelsea Finn},
    year    = {2024},
}
DiffusionNOCS

IROS 2024 | R6D Workshop ECCV 2024

DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation

Takuya Ikeda, Sergey Zakharov, Tianyi Ko, Muhammad Zubair Irshad, Robert Lee, Katherine Liu, Rares Ambrus, Koichi Nishiwaki

@inproceedings{ikeda2024diffusionnocs,
    title={DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation},
    author={Takuya Ikeda and Sergey Zakharov and Tianyi Ko and Muhammad Zubair Irshad and Robert Lee and Katherine Liu and Rares Ambrus and Koichi Nishiwaki},
    booktitle={Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS)},
    year={2024}
}

ICRA 2024

Best Paper Award

Open X‑Embodiment: Robotic Learning Datasets and RT‑X Models

Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, … Muhammad Zubair Irshad et al.

@misc{open_x_embodiment_rt_x_2023,
  title={Open {X-E}mbodiment: Robotic Learning Datasets and {RT-X} Models},
  author = {Open X-Embodiment Collaboration and Abby O'Neill and Abdul Rehman and Abhinav Gupta and ... and Muhammad Zubair Irshad and ... et al.},
  howpublished  = {\url{https://arxiv.org/abs/2310.08864}},
  year = {2023},
}
FSD

ICRA 2024

FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects

Mayank Lunayach, Sergey Zakharov, Dian Chen, Rares Ambrus, Zsolt Kira, Muhammad Zubair Irshad

@inproceedings{lunayach2023fsd,
  title={FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects},
  author={Mayank Lunayach and Sergey Zakharov and Dian Chen and Rares Ambrus and Zsolt Kira and Muhammad Zubair Irshad},
  booktitle={International Conference on Robotics and Automation},
  organization={IEEE},
  year={2024}
}

AICC Workshop CVPR 2024

ICE-G: Image Conditional Editing of 3D Gaussian Splats

Vishnu Jaganathan, Hannah Huang, Muhammad Zubair Irshad, Varun Jampani, Amit Raj, Zsolt Kira

@misc{jaganathan2024iceg,
      title={ICE-G: Image Conditional Editing of 3D Gaussian Splats}, 
      author={Vishnu Jaganathan and Hannah Hanyun Huang and Muhammad Zubair Irshad and Varun Jampani and Amit Raj and Zsolt Kira},
      year={2024},
      eprint={2406.08488},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

ICCV 2023

NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes

Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Vitor Guizilini, Thomas Kollar, Adrien Gaidon, Zsolt Kira, Rares Ambrus

@inproceedings{irshad2023neo360,
  title={NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes},
  author={Muhammad Zubair Irshad and Sergey Zakharov and Katherine Liu and Vitor Guizilini and Thomas Kollar and Adrien Gaidon and Zsolt Kira and Rares Ambrus},
  journal={International Conference on Computer Vision (ICCV)},
  year={2023},
  url={https://arxiv.org/abs/2308.12967},
}
CARTO

CVPR 2023

CARTO: Category and Join Agnositc Reconstruction of Articulated Objects

Nick Heppert, Muhammad Zubair Irshad, Sergey Zakharov, Katherine Liu, Rares Ambrus, Jeannette Bohg, Abhinav Valada, Thomas Kollar

@inproceedings{heppert2023carto,
  title={Carto: Category and joint agnostic reconstruction of articulated objects},
  author={Heppert, Nick and Irshad, Muhammad Zubair and Zakharov, Sergey and Liu, Katherine and Ambrus, Rares Andrei and Bohg, Jeannette and Valada, Abhinav and Kollar, Thomas},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={21201--21210},
  year={2023}
}

ECCV 2022

ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization

Muhammad Zubair Irshad*, Sergey Zakharov*, Rares Ambrus, Thomas Kollar, Zsolt Kira, Adrien Gaidon

@inproceedings{irshad2022shapo,
  title = {ShAPO: Implicit Representations for Multi-Object Shape Appearance and Pose Optimization},
  author = {Muhammad Zubair Irshad and Sergey Zakharov and Rares Ambrus and Thomas Kollar and Zsolt Kira and Adrien Gaidon},
  journal = {European Conference on Computer Vision (ECCV)},
  year = {2022}
}

ICRA 2022

CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation

Muhammad Zubair Irshad, Thomas Kollar, Michael Laskey, Kevin Stone, Zsolt Kira

@inproceedings{irshad2022centersnap,
	title = {CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation},
	author = {Muhammad Zubair Irshad and Thomas Kollar and Michael Laskey and Kevin Stone and Zsolt Kira},
	journal = {IEEE International Conference on Robotics and Automation (ICRA)},
	year = {2022}
}

ICRA 2021

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation

Muhammad Zubair Irshad, Chih-Yao Ma, Zsolt Kira

@inproceedings{irshad2021hierarchical,
  title={Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation},
  author={Muhammad Zubair Irshad and Chih-Yao Ma and Zsolt Kira},
  booktitle={Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
  year={2021}
}
SASRA

ICPR 2022

SASRA: Semantically-aware Spatio-Temporal Reasoning Agent for Vision-and-Language Navigation

Muhammad Zubair Irshad, Niluthpol Mithun, Zachary Seymour, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

@INPROCEEDINGS{irshad2022sasra,
        author={Irshad, Muhammad Zubair and Chowdhury Mithun, Niluthpol and Seymour, Zachary and Chiu, Han-Pang and Samarasekera, Supun and Kumar, Rakesh},
        booktitle={2022 26th International Conference on Pattern Recognition (ICPR)}, 
        title={Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments}, 
        year={2022},
        pages={4065-4071},
        doi={10.1109/ICPR56361.2022.9956561}
}

THESIS

PhD Thesis

PhD Thesis 2023

Learning 3D Robotics Perception using Inductive Priors

Muhammad Zubair Irshad

Georgia Institute of Technology

@misc{irshad2024learning3droboticsperception,
      title={Learning 3D Robotics Perception using Inductive Priors}, 
      author={Muhammad Zubair Irshad},
      year={2024},
      eprint={2405.20364},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2405.20364}, 
}

MACHINE LEARNING WORKSHOPS

Beyond Teleop Workshop

ICRA 2026

Beyond Teloperation: Learning from Diverse Human and Simulation Data

Kushal Kedia, Muhammad Zubair Irshad, Jingyun Yang, Rutav Shah, Satvik Sharma, Himanshu Gaurav Singh, Tyler Lum, Sergey Zakharov

3D VLMs Workshop

CVPR 2025

3D Vision Language Models (VLMs) for Robotics Manipulation: Opportunities and Challenges

Jiafei Duan, Muhammad Zubair Irshad, Ishika Singh, Vitor Guizlini, Rares Ambrus, Zsolt Kira

RoboNerF Workshop

ICRA 2024

RoboNerF: 1st Workshop On Neural Fields In Robotics

Muhammad Zubair Irshad, Nick Heppert, Jonathan Tremblay, Shreyas Kousik, Zsolt Kira, Abhinav Valada

TEACHING

Computer Vision Course

Instructor

EE 452: Computer Vision

Spring 2026 (Ongoing) — Habib University

Data Structures Course

Instructor

CS 102: Data Structures and Algorithms

Spring 2026 (Ongoing) — Habib University

Instructor

CS 224: Object Oriented Programming and Design Methodologies

Fall 2025 — Habib University

Deep Learning Course

Graduate Teaching Assistant

CS 7643: Deep Learning

Fall 2022 — Georgia Institute of Technology

Robotics Course

Teaching Practicum

ME 7757: Robotics

Spring 2021 — Georgia Institute of Technology

TALKS

Oct 2025

Pose Estimation in Robotics: From Object Understanding to Camera Calibration

ICCV 2025 Workshop on Category-Level Object Pose Estimation — Virtual, Hawaii

July 2025

Embodied Artificial Intelligence for 3D Perception leveraging Inductive Priors

GIKI — Virtual

Dec 2024

Learning 3D Robotics Perception using Inductive Priors

Woven by Toyota — Tokyo, JP

Aug 2024

Towards Embodied 3D Foundation Models

Habib University — Karachi, PK

Apr 2024

Towards 3D Foundation Models

Facebook AI Research — Bay Area, CA

Mar 2024

Neural Fields in Vision and Beyond

Stanford Computer Vision Class — Bay Area, CA

Jan 2024

Neural Fields in Robotics and beyond

Stanford Robot Perception Class — Bay Area, CA

Jul 2023

Learning Object-Centric Neural 3D Scene Representations

Robotics and AI Institute — Boston, MA

Jun 2023

Neural Fields in Robotics

3D Deep Learning Reading Group — Virtual

Jun 2023

Learning Object-Centric Neural 3D Scene Representations

Qualcomm — San Diego, CA

Apr 2023

Learning Object-centric Neural 3D Representations

Georgia Tech Deep Learning Class — Atlanta, GA

ACADEMIC SERVICE

MACHINE LEARNING SOFTWARE & REPO