Sai Prahladh Padmanabhan

Sai Prahladh Padmanabhan

Engineer

Samsung Semiconductor INC

Biography

I am an engineer at Samsung Semiconductor INC with a strong background in designing scalable solutions leveraging Large Language Models (LLMs) and deep learning frameworks such as PyTorch and TensorFlow. My expertise spans developing end-to-end systems, from user-friendly interfaces to inference server backend, with a focus on Generative AI applications. I am an Electrical and Computer Engineering graduate with my Master’s degree from Carnegie Mellon University. My interests are in exploring the spheres of Machine Learning, Deep Learning and Computer vision.

Download my resume.

Interests
  • Artificial Intelligence
  • Machine Learning
  • Deep Learning
  • Computer Vision
Education
  • M.S in Electrical and Computer Engineering, 2021

    Carnegie Mellon University, Pittsburgh, PA

  • B.E in Electronics Engineering, 2019

    Vivekanand Education Society's Institute of Technology, Mumbai, India

Skills

nn
Deep Learning

Proficient

Python

Proficient

pytorch
PyTorch

Proficient

tensorflow-tf
Tensorflow

Intermediate

matlab
MATLAB

Intermediate

cpp
C++

Beginner

Experience

 
 
 
 
 
Samsung Semiconductor INC
Engineer
Sep 2021 – Present San Jose, CA

Engineer at the Memory Solutions Lab working on value proposition of storage solutions for SOTA Deep Learning Algorithms:

  • Developed a Verilog code generation assistant using LLMs, incorporating conversational AI for a chat-based interface that improved code suggestions and completion. Achieved 88% pass@5 success, leveraging fine-tuning on domain-
    specific datasets and a custom benchmark to optimize performance and efficiency.
  • Developed a scalable full-stack application for the Verilog Code gen assistant, comprising of a user-friendly interface, an inference server for model execution, and robust user authentication integration, ensuring seamless and secure functionality.
  • Developed a movie recommendation system using DLRM with Memory Semantic SSD for personalized recommendations. Demonstrated it at the “Open Compute Project” conference, highlighting AI-based decision-making and system performance.
  • Established a Jenkins pipeline to automate performance testing and built a Grafana dashboard for data visualization, accelerating the testing process by 15% and enhancing monitoring efficiency.
 
 
 
 
 
Carnegie Mellon University
Graduate Teaching Assistant
Dec 2020 – May 2021 Pittsburgh, PA

Fulfilled the role of a TA for a reputed Deep Learning course (11-785) at CMU, taught by Prof. Bhiksha Raj. Responsibilities include:

  • Creating Assignments and quizzes.
  • Holding Office Hours.
  • Project group mentoring
 
 
 
 
 
Cylab, Carnegie Mellon University
Graduate Research Assistant
Sep 2020 – Dec 2020 Pittsburgh, PA

Worked on ensuring safety of shared control in autonomous driving under the guidance of Prof. Corina Pasareanu.

  • Performed K-means clustering on driver reaction time classification output of a neural network to verify robustness of classification and ensure safety of shared
    control in autonomous driving.
  • Improved the clustering methodology through mean centroid initialization and elbow method to observe a maximum of 20% increase in cluster radii across 5 clustered regions.
 
 
 
 
 
Cere Labs Pvt. Ltd.
Deep Learning Intern
Jun 2019 – Aug 2019 Mumbai, India

Fulfilled the following responsibilities:

  • Trained Feature Pyramid Networks for text localization in documents.
  • Enhanced text localization accuracy by 2% after adopting Progressive Scale Expansion Network architecture.
  • Resolved issue of omission of isolated characters by tuning hyperparameters for a ResNet-50 backbone.

Projects

*
Augmented Reality system using Planar Homographies.
Planar homography is a warp operation that maps pixel coordinates from one camera frame to another with the fundamental assumption that the points are lying on a plane in the real world. This concept allows us to create cool applications such as an augmented reality system or a panorama stitcher.
Augmented Reality system using Planar Homographies.
Face Verification.
Face Verification is a problem whereby we are required to confirm if a pair of images depict the same peron’s facial features. This task is widely used in modern day applications like the popular ‘Face-unlock’ feature in smartphones, document id verification etc. This task can essentially be split into two steps, face classification followed by face verification. Convolutional Neural Networks are the most popular choice while dealing with such tasks, hence ResNet-18 is the chosen architecture here.
Face Verification.
Fake News Detection.
Fake news is rampant in today’s date and verifying the authenticity of a news article is paramount. The aim of this project is to train various machine learning models to classify a given news article as authentic or fake. This task falls under the domain of Natural Language processing. The machine learning models explored in this project are Naive Bayes classifier, Random forest classifier and Logistic Regression.
Fake News Detection.
Real time Eye/Gaze Tracking.
Real time Eye/Gaze tracking is a process which is really useful in the field of medical science and has proven to be the only method to objectively and accurately record and analyse visual behaviour. This use case is designed as a part of a larger medical subsystem catered towards improving the quality of life of Huntington’s disease patients. This eye tracking implementation enables us to quantify the range of voluntary eye movement of HD patients through Occular Pursuit exercises. This implementation leverages the vast capabilities of the OpenCV library.
Real time Eye/Gaze Tracking.
Speech to text transcription network.
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as automatic speech recognition (ASR), computer speech recognition or speech to text (STT). This project attempts to create an end-to-end speech transcription network consisting of encoder-decoder structure equipped with attention mechanism. Levenshtein distance was the evaluation metric used to gauge the performance of the network. This architecture obtains an average Levenshtein distance of 24.3.
Speech to text transcription network.