Shashank Shekhar - AI Research Engineer

Profile Summary

A proactive AI/ML engineer with 7 years of professional experience and over 2 years of entrepreneurial experience in building scalable AI solutions across Computer Vision, LLMs, Agentic AI, Reinforcement Learning, and Generative models.

Experienced in LLM post-training research, diffusion models, model distillation, audio models, video models, lip-sync models, and LLM evaluation harnesses. Proven track record in end-to-end product development, from prototyping and experimentation through deployment, scaling, and optimization, with deep expertise in cloud-native, platform-agnostic architectures.

Directed the full lifecycle development of a high-value project worth $5M, including architecture design, resource allocation, and cross-functional team coordination. Worked as a Claude Code Opus model super-user, validating and reviewing over 15,000 lines of production code for correctness, efficiency, and software engineering quality.

Strong background in production ML engineering and MLOps, including GPU-accelerated inference pipelines, FastAPI-based REST APIs, secure back-end systems with Nginx, and containerized deployments across GCP, AWS, and Azure. Delivered edge and embedded AI systems with lightweight computer vision models, Kalman filter-based tracking, and real-time tampering detection.

Career Objective

Career-focused AI Research Engineer leveraging deep expertise in foundation models, cloud-native infrastructure, and full-cycle product development to accelerate innovation, scalability, and transformative solutions across diverse industries and enterprise ecosystems worldwide. I would like to work on the hardest problems in AI and am seeking collaboration with like-minded founders and investors passionate about building an AI startup.

Work Experience

Optum India, Bangalore

Jan 2026 - Present

AI ML Engineer

Working on Agentic AI Search using a Reinforcement Learning framework.
Designed an LLM-as-a-judge evaluation pipeline that pre-screens data and surfaces likely failures for targeted human auditing, reducing the review set to about 30% of data and cutting manual review effort by about 70%.

Insigteye.ai, Hyderabad

Nov 2024 - Dec 2025

Lead / Founding Engineer

Spearheaded end-to-end development and delivery of multiple AI/ML projects as Project Lead, covering requirement gathering, system design, deployment, scaling, and maintenance.
Architected and deployed cloud-native, scalable back-end infrastructure for video analytics applications with platform-agnostic principles for on-premises and cloud environments.
Fine-tuned advanced vision-language models on NVIDIA H100 GPUs and deployed production-grade inference using the vLLM framework for optimized throughput and low-latency serving.
Configured and optimized Nginx as a reverse proxy and load balancer, including request routing, caching, and security enhancements for reliability under high traffic.
Designed and developed RESTful APIs using FastAPI with error management, request validation, authentication, and authorization for secure and scalable client-server communication.
Implemented an agentic back-end pipeline to support horizontal scalability and distributed workloads for enterprise AI automation workflows.

Honeywell India, Pune

Jul 2022 - Dec 2022

Senior ML Engineer

Deployed a lightweight object detection algorithm optimized for the Amlogic Neural Hardware Accelerator for real-time performance on constrained hardware.
Designed and integrated a novel camera tampering detection algorithm using the OpenCV C-API, improving system reliability and security.
Contributed to simplifying, refactoring, and debugging a legacy C codebase of more than 10,000 lines, improving readability, maintainability, and execution performance.

Quest Global, Bangalore

Jul 2021 - Dec 2021

ML Engineer

Developed an unsupervised anomaly detection algorithm to identify irregularities and detect fault triggers in ambient light sensor data.
Experimented with sensor fusion algorithms.

Tarsyer, Pune

Sep 2019 - Dec 2020

Computer Vision Engineer

Implemented and deployed computer vision algorithms for real-time detection and tracking using Kalman filters on edge devices, with an optimized detection model size of only 200 KB.
Managed scientific research and engineering execution for machine learning projects, ensuring both innovation and production-level reliability.

Entrepreneurial Experience

Stealth AI Startup

Jan 2023 - Jun 2024

Conducted pre-training and fine-tuning of foundation models spanning LLMs, text-to-image models, TTS, STT, and vision-language models.
Adapted pre-trained LLMs to specific domains and tasks using targeted datasets and optimized them for real-world applications.
Designed and implemented Retrieval-Augmented Generation systems integrating LLMs with company knowledge bases to improve response accuracy.
Optimized LLM architectures, hyperparameters, and training workflows to improve efficiency, accuracy, and inference performance.
Developed LLM-powered applications for summarization, translation, question answering, chatbots, and conversational AI.
Deployed LLMs across Azure, GCP, and on-premises infrastructure.
Monitored model outputs, corrected biases and errors, and retrained models to maintain high-quality performance.
Containerized LLM applications using Docker for portability, reproducibility, and scalability.

Internship

Swaayatt Robots, Bhopal

Aug 2018 - May 2019

RL Intern

Successfully tested Deep Reinforcement Learning algorithms on a self-driving toy car and analyzed Optical Flow and SLAM.
Led experimentation with RL algorithms including DQN variants, Vanilla Policy Gradient, PPO, and TRPO in OpenAI Gym environments across discrete and continuous action spaces.

Technical Skills

Programming: Python, C/C++, JavaScript, Go, Rust
AI/ML Frameworks: PyTorch, TensorFlow
Computer Vision: OpenCV, real-time video analytics, embedded vision systems
LLMs: Trained and fine-tuned LLMs, Qwen 3B Reasoning Model, Deepseek-R1 model
Reinforcement Learning: Deep RL, policy optimization algorithms, Generative AI, Foundation Models
Cloud Infra and DevOps: GCP, AWS, Docker, Kubernetes, Ansible
Databases: MySQL, PostgreSQL, MongoDB, Vector DB
Development Tools: Claude Code, Cursor IDE, Codex, Github, CI/CD pipelines
Web, AI Model and API Development: Consumer back-end development, API design and integration, model deployment

Core Competencies

AI model design and optimization
Computer vision and video analysis
LLM post-training research
Reinforcement learning
Model deployment and optimization
System design and back-end architectures
API development and system integration
Generative AI and foundation models
Real-time and embedded AI solutions
End-to-end AI project management
Algorithm development and research
Edge AI and low-latency inference

Projects

NanoGPT
RLHF book
Latent-Sync Video Model
WhisperX
Diffusion Models, Stable Diffusion, SD3, Flux, HyperDreamBooth
Encodec, F5-TTS, OpenAI Whisper
LLM, Llama 3, LoRA, DsPY, RLHF, DPO
Bank vault monitoring using computer vision
Real-time bearing counting using computer vision
Manufacturing plant monitoring using computer vision
Self-driving toy car

Education

Bachelor of Engineering
R.G.P.V, Bhopal, June 2018

Courses

Stanford CS229 by Andrew Ng
Stanford CS231N by Andrej Karpathy
Stanford CS224N: NLP with DL
Deep RL by David Silver and Deep RL BootCamp Lectures
Self-Driving Engineer Nanodegree, Udacity
Cryptography by Christof Paar and CryptoCurrency E&D, MIT
Deep learning books by Michael Nielsen and Ian Goodfellow
Intro to Reinforcement Learning by Rich Sutton
Numerical Optimization by Nocedal and Wright
Linux Handbook by Evi Nemeth