Skip to content
View SRDdev's full-sized avatar
:octocat:
Empowering lives through AI
:octocat:
Empowering lives through AI

Block or report SRDdev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SRDdev/README.md

Shreyas Dixit

Machine Learning Engineer & Researcher

LLM Inference Optimization | Multimodal Research & Safety

Profile Views Google Scholar Twitter LinkedIn


About Me

I am a Machine Learning Engineer and Researcher specializing in the intersection of Applied Research and Implementation. My core focus is on LLM Inference Optimization, Quantization, and AI Safety Mechanisms.

I bridge the gap between "State-of-the-Art" and "Scale," building systems that are not only novel but robust and deployable. Currently, I work full-time as a Machine Learning Engineer at Techolution, where I lead the development of Embodied AI systems that achieve 97% task accuracy and 2mm precision in dynamic environments.

In parallel, I collaborate with the AI Institute of South Carolina (AIISC) as a Contributing Researcher, focusing on AI integrity layers (Watermarking, Hallucination Mitigation) for Generative AI.


Research & Mentorship

My research on "Safety-by-Design" architectures is conducted under the guidance of Dr. Amitava Das (BITS Pilani/AIISC). I am fortunate to be mentored by and collaborate with leading scientists including Vasu Sharma (Meta AI/FAIR), Aman Chadha (Apple/Stanford AI), and Vinija Jain (Meta/Amazon).


Latest News & Updates

  • [Feb 2025] Workshop Organizer: Serving as an Associate Organizer for the Defactify 4.0 Workshop at AAAI 2025.
  • [Jan 2025] New Pre-print: Released PECCAVI, a novel watermarking technique for AI-generated images, co-authored with mentors from Meta and Apple.
  • [2024] Journal Publication: WaveFormer published in Ocean Engineering (Q1 Journal). Proposed a Transformer-based architecture for long-term time-series forecasting.
  • [2024] Production Deployment: Deployed end-to-end Agentic Systems using LangGraph and CrewAI for autonomous industrial tasks at Techolution.

Selected Publications & Patents

Papers

  • Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking (2025)
    • Shreyas Dixit, Ashhar Aziz, Shashwat Bajpai, Vasu Sharma (Meta), Aman Chadha (Apple), Vinija Jain (Meta), Amitava Das.
    • Proposed a robust watermarking technique resistant to visual paraphrase attacks.
  • WaveFormer: Lag Removing Univariate Long Time Series Forecasting Transformer (2024)
    • Shreyas Dixit, Pradnya Dixit.
    • Published in Ocean Engineering (Elsevier).
  • Rethinking Data Integrity in Federated Learning: Are we ready? (2022)
    • IEEE International WIE Conference on Electrical and Computer Engineering.

Patents

  • Patent #1: "Real-Time MultiModal Video Narration Platform for Visually Impaired People" (2023).
  • Patent #2: "Assistance Platform for Visually Impaired Person Using Image Captioning" (Indian Patent).

Engineering Moats & Tech Stack

I specialize in optimizing inference pipelines and building forensic layers for AI.

Domain Technologies
Inference Optimization vLLM, TensorRT-LLM, Triton Inference Server, TorchServe, Quantization (AWQ/GPTQ)
Agentic AI LangGraph, CrewAI, Model Context Protocol (MCP), AutoGen
Computer Vision PyTorch, OpenCV, YOLO, NVIDIA Isaac Sim, 6D Pose Estimation
Infrastructure Docker, Kubernetes, GCP, VectorDBs (Milvus/Pinecone), FastAPI

Featured Projects

  • Research: A multimodal pipeline aligning visual encoders with audio generation modules to create "Safety-by-Design" accessibility tools.
  • Stack: PyTorch, Diffusers, CLAP/CLIP Embeddings.
  • Research: Optimized Transformer architectures for English-to-Hindi translation, focusing on efficient tokenization for low-resource languages.
  • Stack: HuggingFace Transformers, PyTorch.
  • Research: A from-scratch PyTorch implementation of the BART architecture for Masked Language Modeling (MLM) on mixed-script datasets.

Connect

I am actively seeking high-impact roles as a Senior ML Engineer, AI Architect, or Inference Engineer. If you are building production-grade LLM pipelines or working on efficient inference, I would love to connect.

🌐 Portfolio | 🐦 Twitter | 💼 LinkedIn

"The best way to predict the future is to invent it." – Alan Kay

Pinned Loading

  1. PaliGemma PaliGemma Public

    Building PaliGemma from scratch, a Vision Language Model by GoogleDeepmind designed to address a broad range of vision-language tasks. It combines the SigLIP-So400m vision encoder and the Gemma-2B …

    Python 6 1

  2. YouTube-Llama YouTube-Llama Public

    A question-answering chatbot for any YouTube video using Local Llama2 & Retrival Augmented Generation

    Python 3 3

  3. Multi-Head-Yolov9 Multi-Head-Yolov9 Public

    This repository contains the implementation of a multi-head YOLOv9 model for clothes detection and instance segmentation. The model is trained on the DeepFashion dataset and evaluated using MSCOCO …

    Jupyter Notebook 1 2

  4. OpenAI-CLIP OpenAI-CLIP Public

    Simple Educational Implementation of OpenAI CLIP in PyTorch

    Jupyter Notebook 5 2

  5. SwinTransformer SwinTransformer Public

    This project aims to replicate the architecture proposed in the Swin Transformer paper for medical image semantic segmentation.

    Jupyter Notebook 1

  6. PaLM-RLHF PaLM-RLHF Public

    Forked from lucidrains/PaLM-rlhf-pytorch

    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

    Python 30 4