Omar Helwani
Verified Expert in Engineering
Machine Learning Developer
Montcada i Reixac, Spain
Toptal member since January 14, 2022
Omar is an experienced data engineer who has worked with several databases, cloud providers, and BI tools. He has some experience performing data science and machine learning tasks with Python and has performed data architect tasks in many projects. Omar is highly interested in business processes and how they can be improved using data in multiple ways, such as automation, data quality checks, fraud detection, and assets optimization.
Portfolio
Experience
- SQL - 6 years
- Data Pipelines - 5 years
- Python - 5 years
- ETL - 5 years
- Data Engineering - 4 years
- Business Intelligence (BI) - 4 years
- Docker - 4 years
- Git - 4 years
Availability
Preferred Environment
Python, MacOS, Google Cloud Platform (GCP), SQL
The most amazing...
...project that I've participated in and led as an architect is the migration of the whole legacy data architecture using a business-oriented architecture.
Work Experience
Senior Data Engineer
Vic.ai
- Increased the dataset generation to train machine learning models from days to a couple of hours.
- Improved code quality and readability, eliminating redundant code and simplifying the login process.
- Included new data from current and new sources into the datasets so that machine learning models have more data available to make predictions.
Senior Data Engineer
Packlink
- Built a new data warehouse on BigQuery to bring new reporting capabilities.
- Implemented data quality checks to report anomalies in the data.
- Orchestrated ETL with Google Cloud Composer (Apache Airflow).
- Implemented DBT as SQL orchestrator and metadata repository.
- Built automated data entries with GCS and cloud functions written in Python linked to BigQuery through DBT queries.
- Implemented CI/CD with Google Cloud Build using DBT Docker image and DBT tests.
Data Scientist
Hemav Technology
- Implemented UI to load data gathered manually using Tkinter (Python).
- Worked on a more robust statistical analysis using Python libraries like Pandas and Matplotlib.
- Built machine learning models using scikit-learn to predict optimal harvest date.
- Deployed machine learning models on AWS using Lambdas.
- Ingested weather API data and used it in machine learning models as input.
Data Engineer
Netquest
- Built new data marts to analyze customer behavior on Redshift.
- Implemented slow-changing dimension processes using SQL and Spark in AWS EMR using Scala code.
- Created new dashboards using Qlik Sense to analyze the latest data marts.
ODI and ETL Developer
Avanttic
- Implemented complex data quality checks using regular expressions embedded in SQL queries running on an Oracle database.
- Improved daily load performance using parallel scheduling in ODI with dynamic SQL code and database tuning.
- Enhanced the current data warehouse design using star modeling.
Business Intelligence Analyst
eDreams ODIGEO
- Developed new dashboards and maintained existing ones using QlikView and MicroStrategy.
- Created an outlier detection system using dynamic queries run on an Oracle database.
- Maintained current ETL processes with ODI and QlikView.
Business Intelligence Assistant
everis Spain, S.L.U
- Adapted existing data marts to the adoption of MicroStrategy as a BI tool.
- Modified ETL using dynamic SQL to meet clients' requirements.
- Improved queries performance using some query hints and table configurations.
Experience
Real State Bargains
To get the data, it scraps real state portals like www.idealista.com or www.habitaclia.com.
Nowadays, I have it as a private repository, and it just scraps in Spanish real state websites.
Education
Bachelor's Degree in Business and Technology
Universitat Autònoma de Barcelona - Sabadell, Barcelona, Spain
Certifications
AI for Trading | Nanodegree
Udacity
Artificial Intelligence | Nanodegree
Udacity
Machine Learning Engineer | Nanodegree
Udacity
Deep Learning | Nanodegree
Udacity
Data Analyst | Nanodegree
Udacity
Business Analyst | Nanodegree
Udacity
Skills
Libraries/APIs
Scikit-learn, Pandas, NumPy, Matplotlib
Tools
BigQuery, Git, Google Cloud Composer, Tableau, Apache Airflow, Amazon Athena
Languages
Python, SQL, Bash, Java, Scala
Paradigms
ETL, ETL Implementation & Design, Business Intelligence (BI), Dataflow Programming
Platforms
Oracle Data Integrator (ODI), Google Cloud Platform (GCP), Docker, Oracle Database, QlikView, Amazon EC2, Amazon Web Services (AWS), MacOS, AWS Lambda
Storage
PostgreSQL, Data Pipelines, MySQL, Redshift, Amazon S3 (AWS S3), Relational Databases, Datastage
Frameworks
Django, Apache Spark
Industry Expertise
Marketing, Accounting
Other
Data Engineering, Data Warehousing, Data Science, Machine Learning, Data Build Tool (dbt), Google Data Studio, Amazon RDS, APIs, Message Queues, Economics, Mathematics, Customer Relationship Management (CRM), Enterprise Resource Planning (ERP), Logistics, Law, Finance, MicroStrategy
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring