๐จโ๐ป About Me
Hi, I'm Mudit Jain, an engineer at heart with expertise in 3D/2D Machine Learning, SLAM, Computer Vision, GPU programming, and Embedded Systems.
Currently, my work is centered on using Multimodal AI, particularly fusing LiDAR and camera data for 2D/3D static/dynamic object detection/tracking. My interests include applied ML and CV research, real-time SLAM systems, and advancing computer vision capabilities for autonomous vehicles.
๐ฐ News & Updates
2026: Judge and mentor for the Qualcomm Innovation Fund 2026
2026: Paper submitted to ECCV 2026
BeyondSight: Object Permanence for End-to-End Autonomous Driving
2026: Paper accepted at OmniCV (CVPR Workshop 2026)
FishRoPE: Projective Rotary Position Embeddings for Omnidirectional Visual Perception
2025: Received 2 Qualcomm Distinguished Innovation Awards
2025: 6 patents submitted
2025: Mentor for Google Summer of Code (GSoC) with OpenCV
2025: Judge and mentor for the Qualcomm Innovation Fund 2025
2025: Reviewer for IEEE ITSC 2025 (Intelligent Transportation Systems Conference)
2025: Reviewer for IEEE IV 2025 (Intelligent Vehicles Symposium)
2024: Joined Qualcomm as Senior Deep Learning Engineer in Multimodal AI.
2024: Mentored a project at Google Summer of Code on 3D Reconstruction.
2024: Serving as a reviewer for WACV 2025.
2021: Joined Qualcomm as Senior Machine Learning Engineer, XR Research.
2021: Graduated with Master's degree from University of California San Diego.
2019: Started Master's in Electrical and Computer Engineering at UCSD.
2019: Joined DroneLab at UCSD as Graduate Student Researcher.
2017: Promoted to Embedded System Software Engineer II at NVIDIA.
2016: Joined NVIDIA as Embedded System Software Engineer I.
2016: Graduated with Bachelor's degree from BITS Pilani.
2016: Selected for Google Summer of Code as Developer for RTEMS.
2015: Joined NVIDIA as an Intern.
2014: Joined Srujana Innovation Center as an Intern.
2012: Started Bachelor's in Electronics and Communication at BITS Pilani.
๐ Education
-
Specialization in Machine Learning and Data Science
Courses: Linear Algebra, Probability and Statistics, Statistical Learning, Visual Learning, Computer Vision I & III, GPU Programming, Deep Learning and Applications
-
Minor in Data Science with focus on Embedded Systems and Computer Vision
๐ ๏ธ Skills
๐ป Programming Languages
C++ [8+ years]
Python [8+ years]
๐ง Technical Knowledge Domains
Multimodal Large Scale Deep Learning [Ray, Kubernetes, PyTorch]
2D/3D Object Detection & Tracking [LiDAR + Camera]
BEV Modeling
Prediction & Planning
End-to-End Autonomous Driving
Vision Language Action Models
Transformers [PEFT, HuggingFace]
Classical Computer Vision [C++, OpenCV]
3D Computer Vision
Machine Learning [PyTorch, JAX]
SLAM [ORB-SLAM, VINS Mono]
Non-linear Optimization [Eigen, g2o, ceres, GTSAM]
Bundle Adjustment
Camera Calibration
Pose Graph Optimization
IMU Preintegration
Bayesian Inference
Embedded Systems
SIMD Programming [CUDA]
Model & Data Parallelism
Model Optimization [TensorRT]
3D Reconstruction [NeRFs, Gaussian Splatting]
๐ Projects
Interactive step-through visualizer for 42 algorithm problems covering graphs, grids, union-find, tries, and more. Features 6 renderer types, weighted graph support, and auxiliary panels for queue/stack visualization.
Trains Snake agents directly from rendered board images, comparing plain DQN, DQN with configurable MCTS rollouts, and GRPO. Includes TensorBoard logging and video generation hooks.
Local near-duplicate question detection on the Quora Question Pairs dataset using sentence-transformers and HNSW-based approximate nearest neighbor search, with a retrieval-first pattern for semantic similarity and hard example mining.
Neural agents for Tic-Tac-Toe using plain DQN and DQN with MCTS-guided action selection, trained against a mixed minimax/random opponent. Includes a live browser demo with exported model weights.
Implementation of object detection using DINOv2 self-supervised vision transformers, enabling state-of-the-art zero-shot detection capabilities.
Optimization methods for large-scale bundle adjustment in 3D reconstruction challenges, focusing on efficiency and scalability.
Monocular depth estimation techniques for 3D scene understanding from 2D images using deep learning approaches.
Custom CUDA Implementation for Multi-Agent Reinforcement Learning
Accelerated Q-table updates and reward policies for multi-agent Q-learning using CUDA, achieving 100% training accuracy and 99.8% test accuracy in under 4 minutes on a 46ร46 grid with 512 agents.
University of California San Diego โข Jan 2021 - Mar 2021
Speeding up Mario RL with Custom Torch C++ Extensions
Developed custom CUDA kernels for linear, pooling, ReLU, and convolutional layers to accelerate the training and inference of a CNN for a Double Q-learning based RL agent playing Mario.
University of California San Diego โข Jan 2021 - Mar 2021
AlertWILDFire Plume Detection
Deployed an ensemble neural network model across 610 cameras in California for early wildfire detection, using modified MaskRCNN with focal loss and an EfficientNet-based segmentation model with SCSE attention.
Drone Lab - UCSD โข Jul 2020 - Jan 2021
Domain Adaptation for Semantic Segmentation
Trained OCNet on Cityscapes dataset and used CycleGAN-based domain adaptation to generate real-world-like data from gaming data, improving model performance through expanded training data.
University of California San Diego โข Sep 2019 - Dec 2019
Image Denoising using Deep CNNs
Implemented and compared DnCNN, UDnCNN, and DUDnCNN architectures for image denoising, achieving up to 99.85% accuracy with U-Net with dilated convolutions.
University of California San Diego โข Sep 2019 - Dec 2019
3D Reconstruction of the Anterior Segment of the Eye
Developed image processing pipeline and GUI interface for 3D eye model reconstruction by projecting patterns on the anterior segment and applying PCA and object tracking techniques.
Srujana Innovation Center and MIT Media Labs
๐ Honors & Awards
2 Qualcomm Distinguished Innovation Awards
Qualcomm
Recognized for outstanding contributions to innovation in multimodal AI and autonomous driving technologies.
100% Tuition Scholarship
DroneLab, University of California San Diego
Full tuition coverage for exceptional research contributions and academic merit.
Telangana Overseas Scholarship
Government of Telangana โข University of California San Diego
Prestigious government scholarship awarded to exceptional students for pursuing graduate studies abroad.
MCN Scholarship
Birla Institute of Technology and Science, Pilani
Merit-based scholarship recognizing academic excellence and leadership potential.