
Shivagya Dixit
Verified Expert in Engineering
Machine Learning Developer
Bengaluru, Karnataka, India
Toptal member since October 10, 2024
Shivagya is a seasoned data science practitioner with over six years of industry experience. He specializes in developing and implementing AI/ML solutions and has worked in various sectors, including asset management, banking, air cargo, eCommerce, and IT. His core skills encompass natural language processing (NLP), large language models (LLM), generative AI (GenAI), RAG, machine learning (ML), deep learning, Python, PySpark, and QL. Shivagya is eager for his next professional engagement.
Portfolio
Experience
- Machine Learning - 6 years
- Python 3 - 6 years
- SQL - 5 years
- Deep Learning - 5 years
- Natural Language Processing (NLP) - 4 years
- Generative Artificial Intelligence (GenAI) - 2 years
- Retrieval-augmented Generation (RAG) - 2 years
- Large Language Models (LLMs) - 2 years
Availability
Preferred Environment
Windows 10, Python 3, Visual Studio Code (VS Code)
The most amazing...
...thing I've worked on is a solution that extracts specific factoids from financial statements and related documents using a transformer based language model.
Work Experience
Data Scientist
HSBC
- Engineered an intelligent email platform that automates customer query routing and suggests responses from a knowledge base, resulting in 50% faster response times, 7% shorter sales cycles, and significantly enhanced customer service efficiency.
- Implemented a solution to extract and summarize financial data, increasing AUM by $75 million through improved lead identification and cross-selling. Enhanced information retrieval and content summarization capabilities.
- Designed and developed solutions utilizing the retrieval‑augmented generation (RAG) framework to effectively extract and summarize information from financial documents leveraging large language models (LLMs).
Data Scientist
Skellam AI
- Worked on DeepBrew, a personalized recommendation engine, leveraging content‑based collaborative filtering, projected to generate $100 million in revenue and serve 30 million customers for a leading global quick-service restaurant (QSR) coffee chain.
- Created ingredient‑based similarity search using BERT embeddings to enhance recommendation accuracy.
- Engineered and deployed Spark ETL pipelines for big data processing to facilitate trending product recommendations.
- Migrated extensive in‑memory feature caches to Redis, enhancing application scalability and availability. Worked on containerization and deployment of AI/ML models.
Data Scientist
Unisys
- Developed AI models for root cause analysis of IT incidents and their resolutions. I was involved in end‑to‑end ML lifecycle development, including data pre‑processing, feature engineering, model development, and visualization.
- Built AI-based conversational virtual assistant using the Rasa framework for an artificial intelligence-based IT operation (AIOps) solution to provide recommendations on intelligent capacity management of cloud/on‑premise infrastructure resources.
- Enabled the identification of data drift patterns through rigorous statistical hypothesis testing methodology by implementing a Kolmogorov‑Smirnov test to compare two diverse datasets.
- Created Grafana dashboards to visualize and interpret data, model predictions, and business outcomes.
Experience
Intelligent Email Platform
Document Information Extraction and Summarization Platform
I fine‑tuned a Q&A model to extract specific factoids from financial statements and related documents, improving information retrieval accuracy. I also fine‑tuned a sequence‑to‑sequence model for summarization using Parameter-efficient Fine-tuning (PEFT) and Low-rank adaptation (LoRA) techniques to generate concise summaries of financial content.
Education
Bachelor's Degree in Computer Science
SRM University - Chennai, Tamil Nadu, India
Certifications
Building Your Own Database Agent
DeepLearning.AI
Building Agentic RAG with Llamaindex
DeepLearning.AI
Functions, Tools, and Agents with LangChain
DeepLearning.AI
LangChain Chat with Your Data
DeepLearning.AI
LangChain for LLM Application Development
DeepLearning.AI
ChatGPT Prompt Engineering for Developers
DeepLearning.AI
PyTorch for Deep Learning
Udemy
Generative AI with Large Language Models (LLMs)
DeepLearning.AI
Machine Learning Specialization
Stanford University | via Coursera
TensorFlow Developer Professional Certificate
DeepLearning.AI
Deep Learning Specialization
DeepLearning.AI
Skills
Libraries/APIs
PyTorch, TensorFlow, PySpark
Tools
Azure Kubernetes Service (AKS), Azure Machine Learning, Grafana, ChatGPT
Languages
Python 3, SQL
Frameworks
Flask, LlamaIndex
Platforms
Docker, Kubernetes, Azure, Databricks, Visual Studio Code (VS Code)
Storage
Databases
Other
Programming, Machine Learning, Artificial Intelligence (AI), Deep Learning, Natural Language Processing (NLP), Software Engineering, Prompt Engineering, Transformer Models, FastAPI, Large Language Models (LLMs), Retrieval-augmented Generation (RAG), Generative Artificial Intelligence (GenAI), Windows 10, Algorithms, Mathematics, Data Structures, Computer Networking, Computer Architecture, LangChain, Recommendation Systems, Reinforcement Learning, OpenAI, Open-source LLMs
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring