Yahia Mahmoud
Verified Expert in Engineering
Data Analyst and Developer
Yahia is an experienced data analyst and web scraper with nearly three years of experience. He has focused on cleaning tasks, building scraping scripts, and analyzing data for companies in Germany, Canada, and Egypt. Yahia specializes in data analysis, Python, Selenium, Beautiful Soup, Microsoft Excel, SQL, and MongoDB.
Portfolio
Experience
Availability
Preferred Environment
Windows, Jupyter Notebook, Visual Studio Code (VS Code), Google Colaboratory (Colab), Python, Selenium, Beautiful Soup, Microsoft Excel, SQL, MongoDB
The most amazing...
...project I've been a part of transformed complex data cleaning tasks into streamlined processes, boosting productivity through scripts that saved time by 98%.
Work Experience
Data Operation Specialist
E-motion Digital Creative Agency
- Led the "Gromart" project's foundation and served as the data team's central figure, overseeing the development and execution of web scraping initiatives across 15+ websites.
- Implemented efficient data collection strategies using Python libraries such as Selenium and Beautiful Soup, enabling the extraction of diverse data types. I stored information in Microsoft Excel, MongoDB, and relational and non-relational databases.
- Engineered time-saving scripts that transformed time-consuming tasks into streamlined processes completed in just five minutes, showcasing a significant productivity and resource utilization boost.
- Orchestrated the integration of multiple data storage solutions, including MongoDB, for unstructured and structured data, ensuring a versatile and scalable approach to data management within the "Gromart" project.
Data Analyst
Analytic Company GmbH
- Conducted comprehensive research to gather missing information and enhance data completeness, contributing to a broader understanding of the data.
- Executed thorough data analysis, including filling in missing values, performing model-specific analyses, and conducting quality checks to ensure data accuracy and completeness.
- Delivered over seven datasets monthly by inputting and processing relevant vehicle data into the existing data entry system and online editor.
Data Analyst and Web Scraper
Aview International
- Scraped over 40,000 rows of data from diverse websites, including YouTube and Udemy, using Python libraries such as Selenium and Beautiful Soup.
- Led a team, ensuring smooth project execution by setting clear objectives, providing regular reports, and assisting colleagues in troubleshooting issues.
- Leveraged Python libraries such as Pandas and NumPy to clean the data, ensuring data quality and reliability for subsequent analysis and decision-making.
- Organized the extracted data into structured formats, including CSV files and Microsoft Excel sheets, to facilitate efficient analysis and reporting.
Data Analyst
Freelance
- Collaborated with a diverse range of clients, including over five companies and various individual customers, delivering specialized data services.
- Leveraged my expertise as a data analyst, data entry specialist, data collector, web scraper, and data visualization specialist to meet client-specific requirements.
- Successfully managed projects through online freelance agencies, platforms, and personal contacts, ensuring timely delivery of high-quality work.
- Demonstrated versatility and adaptability, consistently meeting client expectations and exceeding project goals.
Experience
Dream2000 Products Scraper
https://github.com/YahiaML/Dream2000-ScraperLeveraging Python, Beautiful Soup, and Selenium, I ensured the scraper's adaptability to the site's structure, enabling it to systematically collect product information, including categories, subcategories, links, images, prices, and descriptions. Implementing a checkpoint system mitigated potential data loss and provided reliability during prolonged scraping sessions.
This project showcases my expertise in web scraping, data processing with Pandas, and problem-solving in handling diverse website structures. The Dream2000 Products Scraper is a testament to my commitment to delivering robust, versatile, and user-friendly solutions for extracting valuable insights from complex online platforms.
Coldwell Banker Data Processing
https://github.com/YahiaML/Coldwell-Banker-Data-ProcessingMy involvement in the project encompassed the data processing script's design, development, and refinement. I played a pivotal role in crafting the logic for data cleaning, implementing mapping strategies, and orchestrating the creation of Microsoft Excel sheets. Additionally, I spearheaded the project's overarching goal of comparing new projects with existing ones and refining project names based on a predefined mapping schema. My commitment to data accuracy and consistency has been instrumental in the project's success, contributing to the seamless integration of real estate data into Coldwell Banker's system.
Movie Data Analysis
https://github.com/YahiaML/TMDb-movies-data-investigationManga Downloader
https://drive.google.com/file/d/1RyB6YIzS-mRsMWItHXSHNDR50yyoyf7_/view?usp=sharingKey features include a vast selection of manga and manhwa titles, chapter-specific downloads for offline reading, and a user-friendly offline reading experience. Automation supports batch downloads of multiple chapters or entire series, saving time for avid readers. This project, driven by a passion for manga and manhwa, leverages Python and diverse libraries to provide a delightful offline reading experience for enthusiasts worldwide.
Education
Bachelor's Degree in Data Science
Arab Open University - Cairo, Egypt
Certifications
Advanced Data Analysis
Udacity
Python for Data Science
Sololearn
Python: Working with Predictive Analytics
Data Analysis Professional
Udacity
Data Analysis Challenger Certificate
Udacity
Skills
Libraries/APIs
Pandas, Requests, Matplotlib, Beautiful Soup, NumPy, OpenCV, PIL
Tools
Seaborn, Jupyter, Microsoft PowerPoint, Tableau, Microsoft Power BI, Microsoft Excel
Frameworks
Selenium
Languages
Regex, Python, Python 3, Excel VBA, SQL, R, HTML
Paradigms
Data Science
Platforms
Jupyter Notebook, Visual Studio Code (VS Code), Windows
Storage
JSON, MongoDB, Databases
Other
Excel 365, Data, Data Analysis, Data Cleaning, Data Cleansing, Algorithms, CSV, XLSX File Processing, Web Scraping, Website Data Scraping, Data Scraping, PDF Scraping, Financial Data, Manual QA, Google Colaboratory (Colab), Data Entry, PDF to Excel, Calculus, Algebra, Linear Algebra, Discrete Mathematics, Statistics, Pivot Tables, Data Visualization, Data Processing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring