Omar Helwani, Developer in Montcada i Reixac, Spain
Omar is available for hire
Hire Omar

Omar Helwani

Verified Expert  in Engineering

Bio

Omar is an experienced data engineer who has worked with several databases, cloud providers, and BI tools. He has some experience performing data science and machine learning tasks with Python and has performed data architect tasks in many projects. Omar is highly interested in business processes and how they can be improved using data in multiple ways, such as automation, data quality checks, fraud detection, and assets optimization.

Portfolio

Vic.ai
Python, Bash, SQL, Data Engineering, Amazon EC2, Amazon Web Services (AWS)
Packlink
SQL, Python, Java, Google Cloud Platform (GCP), BigQuery, Data Build Tool (dbt)...
Hemav Technology
Python, SQL, AWS Lambda, Scikit-learn, Pandas, NumPy, PostgreSQL, Django...

Experience

Availability

Part-time

Preferred Environment

Python, MacOS, Google Cloud Platform (GCP), SQL

The most amazing...

...project that I've participated in and led as an architect is the migration of the whole legacy data architecture using a business-oriented architecture.

Work Experience

Senior Data Engineer

2021 - PRESENT
Vic.ai
  • Increased the dataset generation to train machine learning models from days to a couple of hours.
  • Improved code quality and readability, eliminating redundant code and simplifying the login process.
  • Included new data from current and new sources into the datasets so that machine learning models have more data available to make predictions.
Technologies: Python, Bash, SQL, Data Engineering, Amazon EC2, Amazon Web Services (AWS)

Senior Data Engineer

2020 - 2021
Packlink
  • Built a new data warehouse on BigQuery to bring new reporting capabilities.
  • Implemented data quality checks to report anomalies in the data.
  • Orchestrated ETL with Google Cloud Composer (Apache Airflow).
  • Implemented DBT as SQL orchestrator and metadata repository.
  • Built automated data entries with GCS and cloud functions written in Python linked to BigQuery through DBT queries.
  • Implemented CI/CD with Google Cloud Build using DBT Docker image and DBT tests.
Technologies: SQL, Python, Java, Google Cloud Platform (GCP), BigQuery, Data Build Tool (dbt), Apache Airflow, Dataflow Programming, Git, Tableau, Google Data Studio, Docker, ETL, Data Engineering, Data Pipelines, Apache Spark, Relational Databases, Data Warehousing, ETL Implementation & Design, MySQL, APIs, Message Queues, Google Cloud Composer

Data Scientist

2018 - 2020
Hemav Technology
  • Implemented UI to load data gathered manually using Tkinter (Python).
  • Worked on a more robust statistical analysis using Python libraries like Pandas and Matplotlib.
  • Built machine learning models using scikit-learn to predict optimal harvest date.
  • Deployed machine learning models on AWS using Lambdas.
  • Ingested weather API data and used it in machine learning models as input.
Technologies: Python, SQL, AWS Lambda, Scikit-learn, Pandas, NumPy, PostgreSQL, Django, Matplotlib, Git, MySQL, Amazon RDS, Amazon Web Services (AWS), APIs

Data Engineer

2018 - 2018
Netquest
  • Built new data marts to analyze customer behavior on Redshift.
  • Implemented slow-changing dimension processes using SQL and Spark in AWS EMR using Scala code.
  • Created new dashboards using Qlik Sense to analyze the latest data marts.
Technologies: SQL, Redshift, Python, Scala, Git, Amazon Athena, Amazon S3 (AWS S3), ETL, Data Engineering, Data Pipelines, Apache Spark, Relational Databases, Data Warehousing, ETL Implementation & Design, Amazon Web Services (AWS)

ODI and ETL Developer

2017 - 2018
Avanttic
  • Implemented complex data quality checks using regular expressions embedded in SQL queries running on an Oracle database.
  • Improved daily load performance using parallel scheduling in ODI with dynamic SQL code and database tuning.
  • Enhanced the current data warehouse design using star modeling.
Technologies: Oracle Data Integrator (ODI), SQL, Oracle Database, ETL, Data Warehousing, ETL Implementation & Design

Business Intelligence Analyst

2015 - 2017
eDreams ODIGEO
  • Developed new dashboards and maintained existing ones using QlikView and MicroStrategy.
  • Created an outlier detection system using dynamic queries run on an Oracle database.
  • Maintained current ETL processes with ODI and QlikView.
Technologies: SQL, QlikView, MicroStrategy, Oracle Data Integrator (ODI), Oracle Database, Data Warehousing, ETL Implementation & Design, Datastage

Business Intelligence Assistant

2014 - 2015
everis Spain, S.L.U
  • Adapted existing data marts to the adoption of MicroStrategy as a BI tool.
  • Modified ETL using dynamic SQL to meet clients' requirements.
  • Improved queries performance using some query hints and table configurations.
Technologies: SQL, MicroStrategy, ETL Implementation & Design, Data Warehousing

Real State Bargains

This project was built using Python, and its main objective is to find bargains or apartments with the best quality and price ratio.

To get the data, it scraps real state portals like www.idealista.com or www.habitaclia.com.

Nowadays, I have it as a private repository, and it just scraps in Spanish real state websites.
2010 - 2014

Bachelor's Degree in Business and Technology

Universitat Autònoma de Barcelona - Sabadell, Barcelona, Spain

MARCH 2020 - PRESENT

AI for Trading | Nanodegree

Udacity

DECEMBER 2018 - PRESENT

Artificial Intelligence | Nanodegree

Udacity

AUGUST 2018 - PRESENT

Machine Learning Engineer | Nanodegree

Udacity

AUGUST 2018 - PRESENT

Deep Learning | Nanodegree

Udacity

NOVEMBER 2017 - PRESENT

Data Analyst | Nanodegree

Udacity

FEBRUARY 2017 - PRESENT

Business Analyst | Nanodegree

Udacity

Libraries/APIs

Scikit-learn, Pandas, NumPy, Matplotlib

Tools

BigQuery, Git, Google Cloud Composer, Tableau, Apache Airflow, Amazon Athena

Languages

Python, SQL, Bash, Java, Scala

Paradigms

ETL, ETL Implementation & Design, Business Intelligence (BI), Dataflow Programming

Platforms

Oracle Data Integrator (ODI), Google Cloud Platform (GCP), Docker, Oracle Database, QlikView, Amazon EC2, Amazon Web Services (AWS), MacOS, AWS Lambda

Storage

PostgreSQL, Data Pipelines, MySQL, Redshift, Amazon S3 (AWS S3), Relational Databases, Datastage

Frameworks

Django, Apache Spark

Industry Expertise

Marketing, Accounting

Other

Data Engineering, Data Warehousing, Data Science, Machine Learning, Data Build Tool (dbt), Google Data Studio, Amazon RDS, APIs, Message Queues, Economics, Mathematics, Customer Relationship Management (CRM), Enterprise Resource Planning (ERP), Logistics, Law, Finance, MicroStrategy

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring