Charchit Sharma, Developer in Varanasi, Uttar Pradesh, India
Charchit is available for hire
Hire Charchit

Charchit Sharma

ML Engineer and Developer

Varanasi, Uttar Pradesh, India

Toptal member since January 29, 2026

Bio

Charchit is a senior machine learning engineer specializing in building, optimizing, and deploying scalable artificial intelligence systems, with deep expertise in generative AI, large language models (LLMs), and computer vision. His focus is on transitioning complex models into high-throughput production environments, reducing inference latency, and building robust, automated data pipelines.

Portfolio

IDfy
PyTorch, Python, Docker, FastAPI, Google Cloud Platform (GCP), Linux...
Avataar.ai
Agentic AI, Diffusion Models, Large Language Models (LLMs), Computer Vision...
IIIT Hyderabad
Computer Vision, Machine Learning, Large Language Models (LLMs)...

Experience

  • PyTorch - 7 years
  • Python - 7 years
  • Natural Language Processing (NLP) - 7 years
  • Computer Vision - 7 years
  • Large Language Models (LLMs) - 5 years
  • FastAPI - 5 years
  • Open-source LLMs - 4 years
  • Agentic AI - 3 years

Preferred Environment

Linux, PyTorch, vLLM, Open-source LLMs, Docker, Python, Hugging Face

The most amazing...

...thing I've built is a highly scalable deepfake detection solution that processed 20,000+ daily requests, using a CLIP-based architecture to achieve 95%+ AUC.

Work Experience

Senior ML Engineer

2024 - PRESENT
IDfy
  • Built and productionized a CLIP-based deepfake detection system using parameter-efficient fine-tuning (PEFT), achieving an AUC greater than 95%.
  • Deployed the solution in a production Video-KYC pipeline capable of processing 20,000+ daily requests for identity fraud prevention.
  • Architected a retrieval-augmented generation (RAG) pipeline handling 1+ million requests per day, and optimized vector search (Qdrant) to reduce inference turnaround time by 84%.
  • Engineered a context-aware document classification engine using Qwen3-4B and optimized the back-end inference using vLLM to ensure high-throughput processing and low latency.
  • Developed an automated regulatory compliance platform utilizing LLaMA-3.1-8B, implementing custom parsers and multi-stage pipelines to reduce manual audit time by 90%.
Technologies: PyTorch, Python, Docker, FastAPI, Google Cloud Platform (GCP), Linux, Open-source LLMs, Transformers, Natural Language Processing (NLP), Machine Learning Algorithms, Agentic RAG Systems, Claude

Applied ML Engineer

2024 - 2024
Avataar.ai
  • Engineered an automated data curation pipeline for diffusion-based 3D reconstruction models, applying semantic segmentation and clustering to automatically filter 66% of noisy data.
  • Developed distributed evaluation pipelines and optimized data loaders, ultimately reducing evaluation times by 30%.
  • Deployed and evaluated multiple diffusion-based 3D models across distributed AWS cloud infrastructure utilizing GPU clusters.
Technologies: Agentic AI, Diffusion Models, Large Language Models (LLMs), Computer Vision, Natural Language Processing (NLP), Machine Learning Algorithms

Computer Vision Engineer (Research Assistant)

2022 - 2023
IIIT Hyderabad
  • Benchmarked and evaluated over 150 pretrained CNN and ViT models to test for model robustness under real-world data corruption.
  • Contributed core engineering code to major open-source repositories, including the HuggingFace Diffusers library and Facebook's Py-IRT library.
  • Published paper at the ICLR 2023 workshop: https://arxiv.org/abs/2409.04041.
  • Scaled educational resources and technical infrastructure for an NPTEL Computer Vision course, supporting over 7,000 enrolled students.
Technologies: Computer Vision, Machine Learning, Large Language Models (LLMs), Natural Language Processing (NLP), Machine Learning Algorithms

Systems Engineer

2019 - 2022
Infosys
  • Was part of the Apple Global Business Intelligence team; implemented a data pipeline for named-entity recognition using a pre-trained transformer model, enabling efficient extraction of relevant entities from unstructured text data.
  • Performed data analysis and validation on upstream data sources to identify quality issues and anomalies, improving the reliability of entity extraction workflows in production.
  • Went through the training program organized by Infosys Limited.
Technologies: Python, BERT

Experience

Inspect-AI | Automated Compliance Auditing Platform (DPDP Act)

Designed a multi-stage LLM and RAG pipeline to analyze end-to-end user journeys, detect dark patterns, and identify regulatory gaps under India's DPDP Act. I implemented custom parsers and purpose classification modules to tag PII and data-collection intent, reducing manual audit effort by approximately 90% and accelerating compliance review cycles.

AI Scheduling Assistant | LLM and Google Calendar Integration

Built an autonomous scheduling assistant that converts natural language meeting requests into conflict-free meeting slots by fetching real-time availability through the Google Calendar API. I deployed DeepSeek-7B using vLLM for information extraction and exposed the system as a Flask API. I also optimized for AMD MI300 GPUs, achieving 2.6 times faster inference and lower memory usage than LLaMA-3.1-8B.

Education

2015 - 2019

Bachelor's Degree in Computer Science

Rajasthan Technical University - Jaipur, Rajasthan, India

Certifications

JULY 2025 - PRESENT

ML Summer School

Cohere

AUGUST 2021 - PRESENT

Deep Learning Specialization

Coursera

AUGUST 2021 - PRESENT

Apply Generative Adversarial Networks (GANs)

Coursera

Skills

Libraries/APIs

PyTorch, Pandas, NumPy, Pydantic, vLLM, Google Calendar API

Tools

Claude, DeepSeek

Languages

Python

Platforms

Linux, Docker, Google Cloud Platform (GCP)

Frameworks

Flask

Other

Open-source LLMs, Hugging Face, Transformers, FastAPI, Parsers, Natural Language Processing (NLP), Computer Vision, Machine Learning Algorithms, Large Language Models (LLMs), Agentic AI, Agentic RAG Systems, Anthropic, Vector Databases, Machine Learning, Generative Adversarial Networks (GANs), Deep Learning, Computer Science, Diffusion Models, BERT

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring