Shivagya Dixit, Developer in Bengaluru, Karnataka, India
Shivagya is available for hire
Hire Shivagya

Shivagya Dixit

Verified Expert  in Engineering

Machine Learning Developer

Bengaluru, Karnataka, India

Toptal member since October 10, 2024

Bio

Shivagya is a seasoned data science practitioner with over six years of industry experience. He specializes in developing and implementing AI/ML solutions and has worked in various sectors, including asset management, banking, air cargo, eCommerce, and IT. His core skills encompass natural language processing (NLP), large language models (LLM), generative AI (GenAI), RAG, machine learning (ML), deep learning, Python, PySpark, and QL. Shivagya is eager for his next professional engagement.

Portfolio

HSBC
Natural Language Processing (NLP), Machine Learning, Azure Machine Learning...
Skellam AI
Natural Language Processing (NLP), Machine Learning, Azure Machine Learning...
Unisys
Python 3, Machine Learning, Azure Machine Learning, Deep Learning, TensorFlow...

Experience

  • Machine Learning - 6 years
  • Python 3 - 6 years
  • SQL - 5 years
  • Deep Learning - 5 years
  • Natural Language Processing (NLP) - 4 years
  • Generative Artificial Intelligence (GenAI) - 2 years
  • Retrieval-augmented Generation (RAG) - 2 years
  • Large Language Models (LLMs) - 2 years

Availability

Part-time

Preferred Environment

Windows 10, Python 3, Visual Studio Code (VS Code)

The most amazing...

...thing I've worked on is a solution that extracts specific factoids from financial statements and related documents using a transformer based language model.

Work Experience

Data Scientist

2022 - PRESENT
HSBC
  • Engineered an intelligent email platform that automates customer query routing and suggests responses from a knowledge base, resulting in 50% faster response times, 7% shorter sales cycles, and significantly enhanced customer service efficiency.
  • Implemented a solution to extract and summarize financial data, increasing AUM by $75 million through improved lead identification and cross-selling. Enhanced information retrieval and content summarization capabilities.
  • Designed and developed solutions utilizing the retrieval‑augmented generation (RAG) framework to effectively extract and summarize information from financial documents leveraging large language models (LLMs).
Technologies: Natural Language Processing (NLP), Machine Learning, Azure Machine Learning, PyTorch, Artificial Intelligence (AI), LangChain, LlamaIndex, Prompt Engineering, Transformer Models, TensorFlow, Deep Learning, Python 3, SQL

Data Scientist

2021 - 2022
Skellam AI
  • Worked on DeepBrew, a personalized recommendation engine, leveraging content‑based collaborative filtering, projected to generate $100 million in revenue and serve 30 million customers for a leading global quick-service restaurant (QSR) coffee chain.
  • Created ingredient‑based similarity search using BERT embeddings to enhance recommendation accuracy.
  • Engineered and deployed Spark ETL pipelines for big data processing to facilitate trending product recommendations.
  • Migrated extensive in‑memory feature caches to Redis, enhancing application scalability and availability. Worked on containerization and deployment of AI/ML models.
Technologies: Natural Language Processing (NLP), Machine Learning, Azure Machine Learning, PyTorch, Transformer Models, TensorFlow, Deep Learning, Recommendation Systems, Python 3, PySpark, SQL, Docker, Kubernetes, Azure Kubernetes Service (AKS), Azure, Databricks, FastAPI, Flask

Data Scientist

2018 - 2021
Unisys
  • Developed AI models for root cause analysis of IT incidents and their resolutions. I was involved in end‑to‑end ML lifecycle development, including data pre‑processing, feature engineering, model development, and visualization.
  • Built AI-based conversational virtual assistant using the Rasa framework for an artificial intelligence-based IT operation (AIOps) solution to provide recommendations on intelligent capacity management of cloud/on‑premise infrastructure resources.
  • Enabled the identification of data drift patterns through rigorous statistical hypothesis testing methodology by implementing a Kolmogorov‑Smirnov test to compare two diverse datasets.
  • Created Grafana dashboards to visualize and interpret data, model predictions, and business outcomes.
Technologies: Python 3, Machine Learning, Azure Machine Learning, Deep Learning, TensorFlow, Databricks, SQL, Grafana, Natural Language Processing (NLP), Flask

Experience

Intelligent Email Platform

Engineered an intelligent email platform that automates customer query routing and suggests responses from a knowledge base, resulting in 50% faster response times, 7% shorter sales cycles, and significantly enhanced customer service efficiency.

Document Information Extraction and Summarization Platform

Engineered solutions using advanced language models (LLMs) like bidirectional encoder representations from transformers (BERT), FLAN‑T5, and Llama to extract and summarize key financial data, freeing up close to 30 full-time equivalents (FTEs) for strategic initiatives and leading to an AUM increase of around $75 million through improved lead identification and cross‑selling asset management products.

I fine‑tuned a Q&A model to extract specific factoids from financial statements and related documents, improving information retrieval accuracy. I also fine‑tuned a sequence‑to‑sequence model for summarization using Parameter-efficient Fine-tuning (PEFT) and Low-rank adaptation (LoRA) techniques to generate concise summaries of financial content.

Education

2014 - 2018

Bachelor's Degree in Computer Science

SRM University - Chennai, Tamil Nadu, India

Certifications

JANUARY 2024 - PRESENT

Building Your Own Database Agent

DeepLearning.AI

JANUARY 2024 - PRESENT

Building Agentic RAG with Llamaindex

DeepLearning.AI

JANUARY 2024 - PRESENT

Functions, Tools, and Agents with LangChain

DeepLearning.AI

JANUARY 2024 - PRESENT

LangChain Chat with Your Data

DeepLearning.AI

JANUARY 2024 - PRESENT

LangChain for LLM Application Development

DeepLearning.AI

JANUARY 2024 - PRESENT

ChatGPT Prompt Engineering for Developers

DeepLearning.AI

JANUARY 2024 - PRESENT

PyTorch for Deep Learning

Udemy

JANUARY 2024 - PRESENT

Generative AI with Large Language Models (LLMs)

DeepLearning.AI

JANUARY 2020 - PRESENT

Machine Learning Specialization

Stanford University | via Coursera

JANUARY 2020 - PRESENT

TensorFlow Developer Professional Certificate

DeepLearning.AI

JANUARY 2020 - PRESENT

Deep Learning Specialization

DeepLearning.AI

Skills

Libraries/APIs

PyTorch, TensorFlow, PySpark

Tools

Azure Kubernetes Service (AKS), Azure Machine Learning, Grafana, ChatGPT

Languages

Python 3, SQL

Frameworks

Flask, LlamaIndex

Platforms

Docker, Kubernetes, Azure, Databricks, Visual Studio Code (VS Code)

Storage

Databases

Other

Programming, Machine Learning, Artificial Intelligence (AI), Deep Learning, Natural Language Processing (NLP), Software Engineering, Prompt Engineering, Transformer Models, FastAPI, Large Language Models (LLMs), Retrieval-augmented Generation (RAG), Generative Artificial Intelligence (GenAI), Windows 10, Algorithms, Mathematics, Data Structures, Computer Networking, Computer Architecture, LangChain, Recommendation Systems, Reinforcement Learning, OpenAI, Open-source LLMs

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring