I'm Sivaramakrishnan Subramanian, and I'm a graduate student studying Computer Vision and Machine Learning at the Robotics Institute, Carnegie Mellon University. Last summer, I interned with the Perception team at Waymo, working on large Vision Language Models (VLMs) to solve the long tail edge scenarios. I'm broadly interested in problems at the heart of perception, image synthesis, multi-modal learning, and all things machine intelligence.
At CMU, I am working with the Air Lab in the Robotics Institute, on multi-view stereo Depth prediction from 6-pair fisheye camera lenses for autonomous navigation. Earlier, I worked with the Xu Lab in the School of Computer Science as a research assistant, exploring visual learning pipelines for self-supervised extraction of 3D object-aware representations using controllable GANs and domain adaptation. I also TA'ed the popular Machine learning course from the ML Department over the Spring (392 students) and Fall (487 students) semesters in 2023.
Before grad school, I worked on CV problems in the R&D Division at AppOrchid Inc., a Fast500 AI company in the utilities & energy industry. My research problems here were skewed towards Document Representation Learning (DRL) and Semantic PDF understanding for financial doc cohorts: extracting document metadata from legalese docs using DL & vision techniques.
I have an intersectional interest in PPML and Differential privacy-FL techniques and part of the vibrant community at OpenMined. My previous work and publications run the gamut from electrical motor design and applied statistical analysis to industrial machine vision, and I still relish contemporary research here.