Soham Shinde

Experience | Publications | Projects | Contact | Phone

I am a Machine Learning Engineer at Clutterbot, working on applied ML for robotics. I graduated with a B.E. in Electronics and Communications Engineering with a minor in Data Science from BITS Pilani, K.K Birla Goa Campus in July 2025. I am interested in interdisciplinary research, particularly in the domains of Computer Vision and Applied Machine Learning.

Previously, I was a Research Associate at Nanyang Technological University, working on Gaze Estimation and other computer vision techniques with Dr. Yuvaraj and Dr. Amalin. My work involved developing advanced gaze estimation models using deep learning and analyzing correlations between gaze patterns and student engagement to assess attention in educational settings. Additionally, I have experience in 3D Computer Vision, specifically in Point Cloud Processing, where I worked on implementing supervised algorithms for Human Cloth Segmentation - CloSe. In conclusion, my research revolves around Computer Vision, 3D Modeling, Applied Machine Learning, HCI, and Robotics

During my undergraduate years, I contributed to various student-led projects at BITS Pilani, Goa, including Project Kratos, where I was involved in the Autonomous Subsystem of a Mars Rover prototype. I also served as the Events and Initiatives Head at the Center for Technical Education, BITS Goa.

Outside of work, I'm usually found brewing Blue Tokai in a French press, bouldering, or planning for a trek to the Himalayas! =)

Email | CV | GitHub | LinkedIn

	Machine Learning Engineer \| Clutterbot May '25 - Present Built an end-to-end pipeline for training and deploying segmentation and detection models for indoor navigation, packaging deployment deliverables for the Autonomous Navigation team. Migrated a complex perception system to the Qualcomm DragonWing QCS6490P chipset and NVIDIA IoT kits, improving edge inference speed by ~30% in barebones Linux environments using GStreamer. Also designed and deployed a custom 6DOF pose estimation proof-of-concept pipeline to ensure reliable target locking and rebasing during the hardware migration.
	Research Associate \| Nanyang Technological University July '24 - May '25 Developed and benchmarked a Vision Transformer (ViViT) for classroom activity recognition on a 927-clip EduNet subset, achieving 88% test accuracy by fine-tuning a Kinetics-400 pre-trained model with a 224x224 video preprocessing pipeline. Validated generalizability on an independent 100-video dataset (72% accuracy) and generated gradient-based saliency maps to confirm focus on key regions (e.g. raised hand, board writing). Analyzed gaze and activity patterns to derive quantitative student engagement measures.
	Intern \| TCS Research June '24 - August '24 Implemented and optimized a PyQt5-based GUI integrating SAM for image segmentation, enabling efficient AI-assisted annotation. Developed and automated a mechanism to aggregate individually segmented regions into larger sub-scenes using advanced image processing techniques, advancing the semantic understanding of complex images and reducing manual effort.
	Research Intern \| CSIR-CEERI, Pilani May '23 - August '23 Worked under the supervision of Dr. Dhiraj Sangwan on restoring and segmenting deteriorated Rajasthani murals using deep learning models like U-Net++, DeepLabV3+, PSPNet, and FPN. Developed a synthetic damaged-image generation pipeline using GANs (StyleGAN2-ADA) and crafted diverse binary masks to simulate complex real-world mural damage. Implemented a state-of-the-art inpainting pipeline achieving an SSIM score of 0.9812 for reinstating missing sections.

Subscene Segmentation for Artwork Scene Understanding
Paper

Soham Shinde and Vikram Jamwal

Submitted at ECCV AI4VA Workshop.

Damage Segmentation and Restoration of Ancient Wall Paintings for Preserving Cultural Heritage
Paper

HS Baath, S. Shinde, J. Keniya, PR. Mishra, A. Saini and D. Sangwan

Accepted at the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

CloSe++ Feb '24 - May '24 Supervised by Dr. Garvita Tiwari	Extended the CloSe-Net framework for fine-grained 3D clothing segmentation from coloured point clouds. Refined edge detection mechanisms to produce sharper boundaries between clothing types, addressing the blurriness in the original model, and automated clothing-type detection to remove manual input and improve usability.
Project Visio Oct '22 - Aug '23 Supervised by Prof. Sougata Sen	Engineered smart-glasses to aid the visually impaired, integrating computer vision techniques for object detection and scene understanding. Developed Tiny-ML models for efficient inference on ESP microcontrollers and designed a companion Android app to interface with the smart-glass system, ensuring seamless real-time assistance over WiFi.
Project Kratos Sept '22 - Jan '23	Contributed to the Autonomous Subsystem of BITS Goa's student-managed Mars Rover. Optimized autonomous navigation in ROS, composed and modified path-planning algorithms (A*, Dijkstra, SLAM) in the Gazebo simulator interfaced with an Nvidia Jetson Xavier, and improved navigation through OpenCV-based object detection.

This template is a modification to Jon Barron's website. Find the source code to my website here.