Keval Katrodiya
Verified Expert in Engineering
Python Developer
Surat, Gujarat, India
Toptal member since October 18, 2024
Keval is a Python developer and web scraping expert focused on Python and cloud computing. Since 2015, he has contributed to product development from concept to maintenance across startups and corporate projects. Keval also excels in big data processing with PySpark and advanced analytics in AWS.
Portfolio
Experience
- Data Scraping - 7 years
- Data Extraction - 7 years
- Web Scraping - 7 years
- Python - 7 years
- Scrapy - 7 years
- Data Mining - 7 years
- MySQL - 7 years
- Docker - 5 years
Availability
Preferred Environment
AWS CLI, Python, Web Scraping, Data Extraction, MySQL, Scrapy, Web Development, Docker, Data Scraping, Data Mining
The most amazing...
...thing I've built is a real-time, distributed data processing system with Python, Celery, and RabbitMQ for scalable task management and efficient resource use.
Work Experience
Senior Web Scraping Developer
Firmagraphix LLC
- Automated data collection using Python and libraries such as Requests, Scrapy, Beautiful Soup, lxml, Selenium, and pandas.
- Scraped data from diverse websites, including real estate, finance, travel and leisure, eCommerce like Amazon and Alibaba, and sales leads from Instagram, Facebook, and TikTok.
- Converted scraped data into various structured formats, including TXT, PDF, image, JSON, FTP, API, CSV, XML, XLS, and SQL.
- Enabled automatic data delivery to cloud-based storage solutions like AWS, Dropbox, Amazon S3, and Oracle.
- Designed master-slave architecture using AWS Lambda for high-volume data scraping, with proxy rotation for Cloudflare bypass.
- Developed scrapers with advanced security features for data collection from various websites, including banking sites with real transaction fetching.
- Utilized Google Cloud Pub/Sub for message queuing, facilitating ARMLS data syndication to the platform database.
- Implemented data processing pipelines for feeding scraped data into dashboards and visualization tools.
- Created and deployed real-time dashboards using Tableau, providing valuable insights to the business team.
Data Specialist
Luminoso
- Developed advanced web scraping and web crawling scripts to handle large datasets.
- Utilized a combination of Python, Scrapy, pandas, Perl, MySQL, MS SQL, Alteryx, RedPoint, and Pentaho for script development and data management.
- Implemented advanced techniques for crawling, finding, fetching, parsing, and cleaning data, enhancing accuracy and efficiency.
- Integrated data management solutions like Alteryx, RedPoint, and Pentaho to streamline data processing and analysis workflows.
- Devised strategies to handle dynamic content and circumvent anti-scraping measures, ensuring reliable data extraction.
- Created scalable web scraping solutions using Python, Scrapy, and Pentaho, handling over 200 million records from hundreds of web pages, enhancing data accuracy and speed.
- Implemented parallel processing and distributed computing techniques to manage large datasets, utilizing advanced tools like Alteryx, RedPoint, and AWS.
Data Engineering
Code-X
- Designed and developed a data pipeline to extract, transform, and load sales data from multiple sources into a centralized database.
- Built data models and analyzed data to generate insights and visualize sales trends using Python, SQL, and visualization libraries.
- Implemented automated data updates and scheduled ETL processes using AWS Lambda and cron jobs.
- Utilized Flask to create a web-based analytics dashboard for real-time monitoring and interactive visualizations.
Web Developer
Ecotech IT Solutions Pvt Ltd
- Improved user engagement and satisfaction by 30% by collaborating with clients to redesign and launch several websites.
- Developed responsive website layouts and user interfaces using HTML, CSS, and JavaScript, ensuring seamless functionality across all devices.
- Boosted team productivity by 20% by integrating back-end services and databases using Python, Django, and MySQL, contributing to internal tool development.
- Conducted regular testing, debugging, and optimization, implementing SEO best practices that increased website visibility and organic traffic.
Experience
Automated Real Estate Data Scraper
eCommerce Web Scraper and Data Pipeline
Education
Bachelor's Degree in Computer Engineering
Parul University - Surat, Gujarat, India
Skills
Libraries/APIs
Playwright, Node.js, Puppeteer, jQuery, Pandas
Tools
Visual Studio, AWS CLI, AWS SDK, Git, AWS Glue, RabbitMQ, Celery, Jira, PyCharm, Tableau, Apache Airflow, Cron
Frameworks
Scrapy, Selenium, Django, Flask
Paradigms
Automation
Languages
Python, Snowflake, HTML, CSS, Python 3, SQL
Platforms
Docker, Amazon EC2, AWS Lambda, Jupyter Notebook
Storage
MySQL, MongoDB, Amazon S3 (AWS S3)
Other
Web Scraping, Data Extraction, Data Scraping, Data Mining, Web Development, Data Analytics, Apps, lxml, Proxies, Cloudflare
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring