Siddharth Chabra
Verified Expert in Engineering
Data Engineer and Developer
Gurugram, Haryana, India
Toptal member since July 13, 2022
Siddharth is a seasoned professional with 15 years of experience. He worked in multiple areas such as image processing, artificial neural networks, and data warehousing. Siddharth specializes in cloud data warehousing, working mainly with BigQuery and Snowflake.
Portfolio
Experience
Availability
Preferred Environment
GitHub, Python, SQL, REST APIs, Google BigQuery, Snowflake, Data Build Tool (dbt), Data Engineering, Data Architecture, Data Analytics
The most amazing...
...project I've completed is the single-handed creation of a data warehouse for a D2C eCommerce client in just 60 days.
Work Experience
Data Architect
Elysium Health, Inc.
- Deployed new data models to support new product launches. Kept the legacy data models operative and made the data models backward compatible to support the legacy data models.
- Developed a GA4 data model to support the client's marketing needs, helping the marketing team understand and transition from the existing UA3 models to the new GA4 data models.
- Optimized data pipelines on Fivetran to reduce costs for the client.
System Architect | Data Engineer
EVEREST GROWTH PARTNERS LLC
- Developed a system to parse large-scale nested JSON files into BigQuery.
- Developed a proprietary JSON parser to parse terabyte-sized JSON files in minutes.
- Developed an API to make the system automated and attachable to any UI.
Director of Data Engineering
US-based Venture Capital Firm
- Built the data warehousing business from scratch. Hired a team of data engineers, business analysts, data analysts, and business intelligence analysts.
- Led the development of 30+ data warehouses for three years with a small team of 20 people.
- Oversaw the development of 1,000+ visualizations on multiple database BI tool combinations, including Snowflake with Looker, Snowflake with Sigma Computing, Snowflake with Tableau, and Big Query with Looker.
Software Engineer
Freelance
- Developed a GTO optimized playing poker trainer for a professional poker player.
- Expanded the trainer from just handling heads-up Texas hold'em to 6-max Texas hold'em as well as PLO4.
- Created and delivered the MVP with relevant documentation to the client in 10 weeks.
Senior Consultant
Infosys
- Acted as a program manager of a team of 50 process mapping experts to map 11,000 BAU processes. The project had a twofold goal, regulatory requirements and process optimization. Delivered the project on time and 30% under budget.
- Helped a large US-based hedge fund prevent front running by designing a system for a large-scale data obfuscation project, which transformed 100 TB of production data into 100 TB of obfuscated data that analysts and developers could work with.
- Got four promotions in three years and was on the fast track to becoming a partner in Infosys Consulting.
Senior Software Engineer
Newgen Software Technologies Limited
- Created an artificial neural network-based image analysis software to automatically identify fraudulent signatures on cheques and documents. The system had a successful identification rate of over 80% and a false positive error rate of less than 2%.
- Ported the organization image processing library for C (32-bit) to C++ (64-bit), C# (64-bit), and Python.
- Filed for eight patents in image processing, artificial neural networks, and document security areas.
Experience
Data Pipelines for a Snowflake Data Warehouse
Shopify Support for DHL
Poker Hand Evaluator
Product Marketing Dashboard
Logistics Auditing Function
Education
Master's Degree in General Business Administration (MBA)
Indian Institute of Management Calcutta - Calcutta, India
Bachelor's Degree in Computer Science
Delhi College of Engineering - Delhi, India
Skills
Libraries/APIs
Amazon Marketplace Web Service (MWS), Amazon API, Google Analytics API, NumPy, REST APIs, Pandas
Tools
BigQuery, Microsoft Excel, Rundeck, MATLAB, GitHub, Apache Airflow, Google Analytics, Looker, AWS Glue
Languages
SQL, Snowflake, Python, C, C++, C#
Paradigms
Automation, ETL, Database Design, Business Intelligence (BI)
Platforms
Google Cloud Platform (GCP), Windows, Visual Studio Code (VS Code), Amazon Web Services (AWS), Cloud Run
Storage
Data Pipelines, JSON, Database Modeling, Database Migration, Databases, API Databases, Relational Databases, PostgreSQL, Redshift, Google Cloud
Industry Expertise
Marketing, Healthcare, Insurance, Banking & Finance
Other
Data Engineering, Google BigQuery, Data Build Tool (dbt), Data Modeling, Data Architecture, Program Management, Data Analytics, Data Structures, Data Warehousing, Database Schema Design, Reporting, Integration, Data Analysis, APIs, ELT, ETL Tools, Data Aggregation, Data Visualization, Data Reporting, Dashboards, Direct to Consumer (D2C), Data Warehouse Design, Scraping, Web Crawlers, Amazon RDS, Google Analytics 4, ETL Pipelines, eCommerce, Google Pub/Sub, CSV, Business Analysis, Web Scraping, Marketing Analytics, Cohort Analysis, GitHub Actions, Image Processing, Analytics, Game Theory, PokerTracker 4, Fivetran, Star Schema, Google Cloud Functions, Cloud Tasks, Consolidation, Cloud, Big Data, Large Language Models (LLMs), Relational Database Services (RDS), Neural Networks, Finance, Structured Finance, Process Design, Artificial Neural Networks (ANN), Poker, Minimum Viable Product (MVP)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring