
Yahia Mahmoud
Verified Expert in Engineering
Data Analyst and Developer
Cairo, Cairo Governorate, Egypt
Toptal member since January 9, 2024
Yahia is an experienced data analyst and web scraper with nearly three years of experience. He has focused on cleaning tasks, building scraping scripts, and analyzing data for companies in Germany, Canada, and Egypt. Yahia specializes in data analysis, Python, Selenium, Beautiful Soup, Microsoft Excel, SQL, and MongoDB.
Portfolio
Experience
- Python - 3 years
- Data Cleaning - 3 years
- Data Analysis - 3 years
- Algorithms - 3 years
- Regex - 3 years
- Data Cleansing - 3 years
- Beautiful Soup - 2 years
- Selenium - 2 years
Availability
Preferred Environment
Windows, Jupyter Notebook, Visual Studio Code (VS Code), Google Colaboratory (Colab), Python, Selenium, Beautiful Soup, Microsoft Excel, SQL, MongoDB
The most amazing...
...project I've been a part of transformed complex data cleaning tasks into streamlined processes, boosting productivity through scripts that saved time by 98%.
Work Experience
Data Analyst
Brightly - Main
- Reduced task completion time by 15% compared to the team average through advanced Python and SQL automation, ensuring 100% data validation accuracy.
- Developed and fine-tuned SQL scripts that decreased database errors by 40%, improving performance and operational reliability.
- Migrated and integrated data for over 100 clients from legacy systems to new databases using AWS Glue, improving data processing efficiency by 25%.
- Resolved six critical client issues by troubleshooting and optimizing SQL queries, decreasing average incident resolution time by 30%.
- Led cross-functional meetings with clients, cutting recurring data migration errors by 20% through proactive issue identification and solution implementation.
Data Analyst
Analytic Company GmbH
- Conducted comprehensive research to gather missing information and enhance data completeness, contributing to a broader understanding of the data.
- Executed thorough data analysis, including filling in missing values, performing model-specific analyses, and conducting quality checks to ensure data accuracy and completeness.
- Delivered over seven datasets monthly by inputting and processing relevant vehicle data into the existing data entry system and online editor.
Data Operation Specialist
E-motion Digital Creative Agency
- Led the "Gromart" project's foundation and served as the data team's central figure, overseeing the development and execution of web scraping initiatives across 15+ websites.
- Implemented efficient data collection strategies using Python libraries such as Selenium and Beautiful Soup, enabling the extraction of diverse data types. I stored information in Microsoft Excel, MongoDB, and relational and non-relational databases.
- Engineered time-saving scripts that transformed time-consuming tasks into streamlined processes completed in just five minutes, showcasing a significant productivity and resource utilization boost.
- Orchestrated the integration of multiple data storage solutions, including MongoDB, for unstructured and structured data, ensuring a versatile and scalable approach to data management within the "Gromart" project.
Data Analyst and Web Scraper
Aview International
- Scraped over 40,000 rows of data from diverse websites, including YouTube and Udemy, using Python libraries such as Selenium and Beautiful Soup.
- Led a team, ensuring smooth project execution by setting clear objectives, providing regular reports, and assisting colleagues in troubleshooting issues.
- Leveraged Python libraries such as Pandas and NumPy to clean the data, ensuring data quality and reliability for subsequent analysis and decision-making.
- Organized the extracted data into structured formats, including CSV files and Microsoft Excel sheets, to facilitate efficient analysis and reporting.
Data Analyst
Freelance
- Collaborated with a diverse range of clients, including over five companies and various individual customers, delivering specialized data services.
- Leveraged my expertise as a data analyst, data entry specialist, data collector, web scraper, and data visualization specialist to meet client-specific requirements.
- Successfully managed projects through online freelance agencies, platforms, and personal contacts, ensuring timely delivery of high-quality work.
- Demonstrated versatility and adaptability, consistently meeting client expectations and exceeding project goals.
Experience
Dream2000 Products Scraper
https://github.com/YahiaML/Dream2000-ScraperLeveraging Python, Beautiful Soup, and Selenium, I ensured the scraper's adaptability to the site's structure, enabling it to systematically collect product information, including categories, subcategories, links, images, prices, and descriptions. Implementing a checkpoint system mitigated potential data loss and provided reliability during prolonged scraping sessions.
This project showcases my expertise in web scraping, data processing with Pandas, and problem-solving in handling diverse website structures. The Dream2000 Products Scraper is a testament to my commitment to delivering robust, versatile, and user-friendly solutions for extracting valuable insights from complex online platforms.
Coldwell Banker Data Processing
https://github.com/YahiaML/Coldwell-Banker-Data-ProcessingMy involvement in the project encompassed the data processing script's design, development, and refinement. I played a pivotal role in crafting the logic for data cleaning, implementing mapping strategies, and orchestrating the creation of Microsoft Excel sheets. Additionally, I spearheaded the project's overarching goal of comparing new projects with existing ones and refining project names based on a predefined mapping schema. My commitment to data accuracy and consistency has been instrumental in the project's success, contributing to the seamless integration of real estate data into Coldwell Banker's system.
Movie Data Analysis
https://github.com/YahiaML/TMDb-movies-data-investigationManga Downloader
https://drive.google.com/file/d/1RyB6YIzS-mRsMWItHXSHNDR50yyoyf7_/view?usp=sharingKey features include a vast selection of manga and manhwa titles, chapter-specific downloads for offline reading, and a user-friendly offline reading experience. Automation supports batch downloads of multiple chapters or entire series, saving time for avid readers. This project, driven by a passion for manga and manhwa, leverages Python and diverse libraries to provide a delightful offline reading experience for enthusiasts worldwide.
Education
Bachelor's Degree in Data Science
Arab Open University - Cairo, Egypt
Certifications
Advanced Data Analysis
Udacity
Python for Data Science
Sololearn
Python: Working with Predictive Analytics
Data Analysis Professional
Udacity
Data Analysis Challenger Certificate
Udacity
Skills
Libraries/APIs
Beautiful Soup, Pandas, Requests, Matplotlib, NumPy, OpenCV, PIL
Tools
Seaborn, Jupyter, Microsoft Excel, Microsoft PowerPoint, Tableau, Microsoft Power BI, AWS Glue
Languages
Regex, Python, Python 3, Excel VBA, SQL, R, HTML
Frameworks
Selenium
Platforms
Jupyter Notebook, Visual Studio Code (VS Code), Windows
Storage
JSON, MongoDB, Databases
Paradigms
Automation
Other
Excel 365, Data, Data Analysis, Data Cleaning, Data Cleansing, Algorithms, CSV, XLSX File Processing, Web Scraping, Website Data Scraping, Data Scraping, PDF Scraping, Financial Data, Manual QA, Scraping, Microsoft 365, Google Colaboratory (Colab), Data Entry, PDF to Excel, Calculus, Algebra, Linear Algebra, Discrete Mathematics, Statistics, Pivot Tables, Data Visualization, Data Science, Data Processing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring