Ghassan Hallaq, Developer in Montreal, QC, Canada
Ghassan is available for hire
Hire Ghassan

Ghassan Hallaq

Verified Expert  in Engineering

Bio

Gus is passionate and curious about the latest technology (artificial intelligence, DRL, etc.) and how to apply it to his apps' architecture or even to his integration apps. Gus also has experience building scalable distributed apps based on the most recent tech stack.

Portfolio

Pillar Base, Inc.
BigCommerce API, TypeScript, Spark SQL, Amazon Cognito, Amazon DynamoDB...
Dodo Data LLC dba Audantic
JavaScript, Python, Full-stack, Amazon Web Services (AWS), Databricks, MongoDB...
Oliver Wyman
Data Engineering, PySpark, Amazon Web Services (AWS), Amazon EMR Studio, ETL...

Experience

Availability

Full-time

Preferred Environment

Linux, Amazon Web Services (AWS), Azure, Elixir, Python 3, Go, Rust, Deep Reinforcement Learning

The most amazing...

...integration solution I've built is bringing a data warehouse, Salesforce, marketing platform, and AI together into one solution.

Work Experience

Senior Data and Solution Architect

2019 - PRESENT
Pillar Base, Inc.
  • Developed an application to denormalize data from the relational database and stream it to an AWS DynamoDB single table.
  • Worked on a proof of concept (POC) to assess the performance between Spark SQL, Presto DB, and Apache Drill with parquet files.
  • Architected and developed integration solutions based on AWS Lambda, API Gateways, Cognito, WMS, Facebook Open Graph API, Google Ads API, BigCommerce, AWS DynamoDB, JavaScript, and TypeScript.
  • Built a highly concurrent and fault-tolerant back-end engine that handles over one million requests using Rust. Worked on the POC to compare Rust concurrency with Go concurrency.
  • Built and deployed ML models using PyTorch and scikit-learn.
  • Built many ETL pipelines to ingest from/to Snowflake and created a data-based app using Streamlit and Snowpark. Created ETL between Snowflake and BigQuery.
  • Enhanced the embedding method for some LLM models in Hugging Face.
  • Built notification system integrated with Solana Blockchain.
  • Built chatbot solutions based on ChatGPT and Meta-2-3.
  • Architecting and building IOS/Android apps using Objective C, Swift, and Kotlin.
Technologies: BigCommerce API, TypeScript, Spark SQL, Amazon Cognito, Amazon DynamoDB, AWS Lambda, Serverless, Rust, Web3.js, Solution Architecture, Go, Qiskit Runtime, Full-stack, Software Architecture, Architecture, Next.js, MongoDB, Artificial Intelligence (AI), APIs, Shopify, Team Leadership, OpenAI GPT-3 API, OpenAI GPT-4 API, Remote Team Leadership, Snowflake, Data Architecture, Phoenix, Google BigQuery, Elixir, Erlang (OTP), Java 8, NoSQL, PostgreSQL, Big Data, ETL Development, Data Engineering, MySQL/MariaDB, Functional Programming, Git, GraphQL, ChatGPT, OpenAI, Embedded Rust, Elixir Oban, Elixir Broadway, SQLAlchemy, FastAPI, PyTorch, ClickHouse, WebRTC, RTCP, RTP, Kotlin, Gradle, Distributed Software, Swift, iOS, Android, Terraform, Geospatial Data, Geospatial Analytics, Asyncio, Python Asyncio

Full-stack Engineer

2024 - 2024
Dodo Data LLC dba Audantic
  • Built the MVP app for real-state analytics using the Remix/React framework.
  • Developed integration with ThoughtSpot to use the analytic components.
  • Built integration with Auth provider and Stripe on Heroku cloud provider.
  • Built IOS Mobile app to render properties on the maps.
Technologies: JavaScript, Python, Full-stack, Amazon Web Services (AWS), Databricks, MongoDB, Swift, iOS

Data Engineer

2024 - 2024
Oliver Wyman
  • Assisted the team in writing better Apache Spark jobs.
  • Collaborated on setting up an Amazon EMR cluster to run jobs with huge data sizes.
  • Optimized cluster utilization when running Apache Spark jobs.
Technologies: Data Engineering, PySpark, Amazon Web Services (AWS), Amazon EMR Studio, ETL, Data Lakes, Amazon S3 (AWS S3), PyTorch, Gradle, Terraform

Senior Elixir Developer

2023 - 2024
Copia Wealth Studios
  • Worked on implementing a Pub/Sub system, as a back-end developer.
  • Optimized and enhanced the integration with some data sources.
  • Used the new features from Phoenix LiveView to enhance the web responsiveness.
  • Help the IOS/Android development team to integrate with Phoenix PubSub.
Technologies: Elixir, APIs, GraphQL, Absinthe, Phoenix, REST APIs, Elixir Oban, Elixir Broadway, Fintech, Distributed Software, Swift, iOS, Android, Kotlin, Terraform

Back-end Developer

2023 - 2024
MYY LLC
  • Built RAG framework to enhance the context in any prompt question.
  • Selected the best LLM/diffusion model that fits the requirements.
  • Helped architect/develop a benchmark system to assess the correctness and the response seep.
  • Worked on enhancing the Hugging Face LLM model response time.
Technologies: Go, Retrieval-augmented Generation (RAG), Machine Learning, Chatbots, SQLAlchemy, PyTorch, Scikit-learn, Large Language Models (LLMs), Diffusion-based AI Models, Terraform

Lead Nexus Engineer

2022 - 2023
UBS - Main
  • Analyzed and enhanced the back-end performance using multiple methods to optimize the database indexing.
  • Helped the team start migrating the back end from Oracle to a PostgreSQL database.
  • Built an integration between the Java back end and security keys repo.
Technologies: Java, Spring 5, Spring Boot, Puppet, Ansible, Rust, Go, GitLab CI/CD, C++, C, REST APIs, Java 8, NoSQL, PostgreSQL, Data Engineering, MySQL/MariaDB, Functional Programming, Git, gRPC, Fintech, PyTorch, Gradle, IBM InfoSphere Master Data Management (MDM), IBM InfoSphere Data Governance, IBM InfoSphere Streams, IBM InfoSphere (DataStage)

Expert Data Scientist/Engineer

2021 - 2023
Endeavor - Main
  • Architected a CDP data warehouse on Snowflake using the Data Vault v2 methodology.
  • Built ETLs to manage the data flow from/to the data warehouse using Prefect 2.0.
  • Implemented data governance and privacy to protect the PII data and integrate it with ChatGPT from OpenAI.
  • Created dashboards using the Streamlit library in Snowflake.
  • Built and deployed ML data models using the Snowpark library and ChatGPT from OpenAI and Hugging Face.
  • Worked on multiple AI diffusion models to generate training data for our LLM model.
Technologies: SQL, Java, Snowflake, Hadoop, Pandas, Python 3, Data Build Tool (dbt), GitHub API, ETL, Phoenix, Agile Software Development, Data Warehouse Design, Data Analytics, REST APIs, NoSQL, Big Data Architecture, PostgreSQL, Big Data, ETL Development, Data Engineering, MySQL/MariaDB, Functional Programming, Git, GraphQL, OpenAI, ChatGPT, Retrieval-augmented Generation (RAG), Salesforce Apex, Elixir Oban, Elixir Broadway, SQLAlchemy, FastAPI, gRPC, PyTorch, Scikit-learn, Diffusion-based AI Models, ClickHouse, Distributed Software, Terraform, Geospatial Data, Geospatial Analytics, Asyncio, Python Asyncio, IBM InfoSphere Master Data Management (MDM), IBM InfoSphere Data Governance, IBM InfoSphere Streams, IBM InfoSphere (DataStage)

Go Developer

2022 - 2022
INSTRUXI LIMITED
  • Developed an integration layer between Go and the Ethereum contract.
  • Integrated with Redis service for caching and speed optimization.
  • Developed a Helm chart to enhance the deployment of Casbin-Mesh.
Technologies: Go, Kubernetes, Python, Redis, JSON Web Tokens (JWT), Helm, Web3.js, Blockchain, Casbin-Mesh, Full-stack, REST APIs, PostgreSQL, MySQL/MariaDB, Functional Programming, Serverless Architecture, Git, GraphQL, Distributed Software, Terraform

Rust Smart Contract Developer

2022 - 2022
PHI VENTURES Limited
  • Added features to the Rust smart contract and some tests for them.
  • Build the IOS/Android mobile apps for the crypto trading (built the wallet, and integrate with the notification system).
  • Rearchitected the front-end React application to use the back end to access the contract features.
  • Built the back end and a deployment pipeline using GitHub actions.
Technologies: Rust, JavaScript, TypeScript, Solana, Node.js, Express.js, Web3.js, REST APIs, NoSQL, PostgreSQL, MySQL/MariaDB, Functional Programming, Serverless Architecture, Git, Zero-knowledge Proofs, Fintech, iOS, Swift, Android, Kotlin, Terraform

Elasticsearch Expert

2021 - 2021
Valence
  • Integrated Django with Elasticsearch. Built highly customizable indexes.
  • Monitored and helped the team to deploy that solution.
  • Analyzed the search result, adjusting the indexing parameters to update the priorities.
Technologies: Elasticsearch, Python 3, Django, Full-stack, Flask, Data Warehouse Design, REST APIs, NoSQL, PostgreSQL, Data Engineering, MySQL/MariaDB, Functional Programming, Apache Solr, Serverless Architecture, Git, GraphQL, Salesforce Apex, SQLAlchemy, FastAPI

Senior eCommerce Integration Engineer

2020 - 2021
Deliverr
  • Worked on a serverless project to develop a REST API to integrate with a different store management API.
  • Developed REST services to integrate with BigCommerce, Google Shopping, and Facebook Shop.
  • Used OAuth 2.0 and OIDC protocols to protect the API and manage the authentication and authorization with another eCommerce API.
Technologies: Serverless, AWS Lambda, Facebook API, BigCommerce API, TypeScript, Node.js, Full-stack, Data Warehouse Design, REST APIs, PostgreSQL, MySQL/MariaDB, Functional Programming, Serverless Architecture, Git, GraphQL, Geospatial Data, Geospatial Analytics

Senior Software Developer

2019 - 2020
ReactiveCore
  • Architected and developed a solution to implement a worker thread pattern integrated with Apache Kafka.
  • Rearchitected the microservices components to implement the service mesh (Istio) in Kubernetes over the Amazon EKS cluster.
  • Built a real-time streaming solution on AWS using Kinesis and Glue. Developed a data model.
  • Implemented Apache Spark streaming with Apache Kafka to support the data model.
  • Integrated Apache Druid with Apache Kafka to implement time-series analytics on the Amazon EKS cluster.
  • Worked on architecting the IOS/Android mobile apps to facilitate the client access to their health insurance policies.
  • Worked on a POC and built a solution to ingest from Kafka based on Rust concurrency.
  • Used Elixir, Phoenix, and Erlang to build a document parser integrated with a triplestore database tocreated a knowledge graph.
Technologies: Amazon Kinesis, React, Node.js, Kubernetes, Spark, Apache Kafka, AWS Glue, Amazon EKS, Go, Akka, Scala, Rust, Full-stack, Service-oriented Architecture (SOA), Elixir, Erlang (OTP), Phoenix, REST APIs, Java 8, NoSQL, PySpark, PostgreSQL, Big Data, ETL Development, Data Engineering, MySQL/MariaDB, Service Meshes, ETL Implementation & Design, Functional Programming, Graph Databases, Git, Salesforce Apex, Insurance Technology (Insurtech), Elixir Oban, Elixir Broadway, Gradle, Swift, iOS, Android, Kotlin, Geospatial Data, Geospatial Analytics

Senior Data Engineer

2019 - 2019
BCG GAMMA
  • Developed and optimized the Apache Airflow graph solution to support the data model.
  • Packaged the solution using Docker and Docker Compose.
  • Built a workflow in CircleCI to automate the building, packaging, and deploying processes.
  • Developed a big data ETL solution on Azure Cloud based on DataFactory and integrated with Data Lake Analytics.
Technologies: Docker Compose, Docker, Apache Airflow, Python, Data Warehouse Design, Data Analytics, REST APIs, PostgreSQL, Big Data, Oracle, ETL Development, Data Engineering, MySQL/MariaDB, ETL Implementation & Design, Functional Programming, Git, SQLAlchemy, FastAPI, Asyncio, Python Asyncio

Senior Solution Architect

2019 - 2019
CN Railway
  • Built real-time and batch processing ETL solutions based on the DataFactory.
  • Architected and developed an event-driven application using Node.js, React, and GraphQL using Serverless tech.
  • Built and architected an integration solution using Apache Kafka on the AWS EKS cluster with Azure Power BI.
  • Designed a solution based on React and Node.js and deployed the solution using Docker and Kubernetes.
  • POC on mobile application based on IOS/Android for the train tracking.
  • Developed and architected a solution to stream the data into the data warehouse on Azure.
  • Engineered interactive dashboards using Power BI with a data warehouse.
Technologies: Kubernetes, Node.js, React, Apache Spark, Apache Kafka, Amazon Elastic MapReduce (EMR), Python, Scala, Java, Solution Architecture, Data Warehousing, Full-stack, Data Vaults, Data Analytics, REST APIs, Java 8, PostgreSQL, Big Data, Oracle, ETL Development, MySQL/MariaDB, ETL Implementation & Design, Functional Programming, Git, Embedded Rust, Gradle, Kotlin, Swift, iOS, Android, Geospatial Data, Geospatial Analytics

Big Data Architect

2016 - 2019
Morgan Stanley
  • Architected and developed a big data ETL solution based on AWS EMR and AWS Kinesis.
  • Built a knowledge graph using a triple store database and Elasticsearch.
  • Created a data streaming solution using S3, Lambda functions, AWS Kinesis data streaming, and AWS EMR service.
  • Helped develop an NLP solution to enhance and enrich features in the market data reports as a data analyst.
  • Built an interactive data UI based on React and React D3 integrated with Azure Power BI.
  • Developed a search engine for market data integrated with Elasticsearch and Apache Solr.
  • Worked on a POC to compare the latency between C++ and Go.
  • Built a highly concurrent and fault-tolerant back-end engine that handles over one million requests using Rust.
  • Worked on Nexus lifecycle and repos to securely manage the open source libraries.
  • Built a document parser—using Elixir, Phoenix, and Erlang—integrated with a triplestore database to create a knowledge graph.
Technologies: Amazon Elastic MapReduce (EMR), Machine Learning, Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), React, Amazon Kinesis, Node.js, Python, Go, Java, Scala, Rust, Networks, Elixir, Erlang (OTP), Phoenix, Data Analytics, REST APIs, PySpark, Big Data Architecture, PostgreSQL, Big Data, Oracle, ETL Development, Data Engineering, MySQL/MariaDB, ETL Implementation & Design, Functional Programming, Apache Solr, Graph Databases, Git, gRPC, Zero-knowledge Proofs, Fintech, PyTorch, Gradle, Asyncio, Python Asyncio

Big Data Developer

2016 - 2016
TD Bank
  • Developed a solution to three ETL big data projects using Apache Hive and Oozie.
  • Worked as part of the team on extending Hadoop security functionality.
  • Architected and developed a multi-threaded Java application to integrate a Linux local account with the Active directory, business requirement analysis, and sketching technical solution.
  • Received “Act for Impact Award” for my work on a security project.
Technologies: Spark, Hadoop, Go, Scala, Java, REST APIs, Java 8, PySpark, Big Data Architecture, PostgreSQL, Big Data, Oracle, ETL Development, Data Engineering, MySQL/MariaDB, ETL Implementation & Design, Functional Programming, Graph Databases, Git, Fintech

Act for Impact Award

I had the privilege of being recognized for my outstanding work on a security project, where I dedicated myself to enhancing the level of protection on each machine. Additionally, I diligently closed numerous unused ports on a daily basis.

Market Adviser Chatbot

A Python-based project to build a chatbot that will answer the client's questions regarding the security market. According to some ontologies, the project ingests market reports into the knowledge graph and uses SPARQL query language to interact with the knowledge graph.

Marketing Integration Solution

This is a project based on Node.js, TypeScript, and Serverless on AWS to build an integration solution for online marketplaces like BigCommerce, Google Shopping, Facebook Commerce, and Amazon Marketplace.
1994 - 1996

Diploma in Computer Systems Analyst

Accad Institute - Aleppo, Syria

1987 - 1990

Progress Toward a Bachelor's Degree in Mathematics and Physics

Aleppo University - Aleppo, Syria

OCTOBER 2011 - DECEMBER 2014

Manager of Quality/Organizational Excellence

American Society for Quality

JUNE 2011 - JUNE 2014

Quality Engineer

American Society for Quality

MARCH 2011 - JUNE 2014

HACCP Auditor

American Society for Quality

JUNE 2010 - JUNE 2013

Six Sigma Black Belt

American Society for Quality

DECEMBER 2009 - DECEMBER 2012

Quality Auditor

American Society for Quality

SEPTEMBER 2008 - SEPTEMBER 2017

Project Management Professional

Project Management Institute

OCTOBER 2007 - PRESENT

Sun Certified Business Component Developer

Sun Certification

JUNE 2005 - PRESENT

System Administrator/Messaging

Microsoft

APRIL 2004 - PRESENT

Sun Certified Web Component Developer

Sun Certification

FEBRUARY 2004 - PRESENT

Level I Certification

Linux Professional Institute

JANUARY 2004 - JANUARY 2014

Microsoft Certified Trainer

Microsoft

JUNE 2003 - PRESENT

System Administrator

Microsoft

MARCH 2003 - PRESENT

Sun Certified Developer

Sun Certification

DECEMBER 2002 - PRESENT

Oracle 9i Database Administrator Certified Professional

Oracle

NOVEMBER 2002 - PRESENT

Oracle 9i Database Administrator Certified Associate

Oracle

MARCH 2002 - MARCH 2004

Cisco Certified Design Associate - CCDA

Cisco Career Certification

DECEMBER 2001 - DECEMBER 2003

Cisco Certified Network Associate - CCNA

Cisco Career Certification

AUGUST 2001 - PRESENT

Sun Certified Programmer

Sun Certification

APRIL 2001 - PRESENT

Professional System Engineer

Microsoft

NOVEMBER 2000 - PRESENT

Network+

CompTIA Certification Program

NOVEMBER 2000 - PRESENT

A+

CompTIA Certification Program

Libraries/APIs

Pandas, BigCommerce API, Facebook API, React, REST APIs, PySpark, Node.js, PyTorch, Elixir Broadway, SQLAlchemy, FFmpeg, WebRTC, Asyncio, Python Asyncio, GraphX, Web3.js, Natural Language Toolkit (NLTK), SpaCy, Segment.io, Snowpark, GitHub API, Scikit-learn

Tools

Apache Solr, AWS Glue, Docker Compose, Spark SQL, Git, Apache Airflow, Kafka Streams, GitHub, Absinthe, Amazon Elastic Container Service (ECS), Gradle, Celery, MQTT, Terraform, Amazon Elastic MapReduce (EMR), Amazon EKS, Amazon Cognito, MetaMask, Puppet, Ansible, IBM InfoSphere Master Data Management (MDM), IBM InfoSphere (DataStage), Helm, Prefect, GitLab CI/CD, ChatGPT

Languages

Snowflake, Java, Python, Scala, Python 3, Java 8, JavaScript, TypeScript, Rust, SQL, Go, Swift, Elixir, Erlang (OTP), GraphQL, Kotlin, Zig, C++20, Gremlin, Assembly, Embedded C, SPARQL, C++, C, Erlang

Frameworks

Serverless Framework, Spark, Apache Spark, Express.js, Next.js, Spring Boot, Flask, Django, React Native, gRPC, GStreamer, SwiftUI, Akka, JSON Web Tokens (JWT), Spring 5, Hadoop, Phoenix

Paradigms

Serverless Architecture, ETL, ETL Implementation & Design, Functional Programming, Service-oriented Architecture (SOA), Unit Testing, Agile Software Development, Six Sigma

Platforms

AWS Lambda, Oracle, Linux, Apache Kafka, Docker, Kubernetes, Salesforce, iOS, Android, Amazon Web Services (AWS), Azure, Blockchain, Shopify, IBM InfoSphere Streams, Stardog, Databricks

Storage

Amazon DynamoDB, Elasticsearch, PostgreSQL, MarkLogic, NoSQL, Graph Databases, MySQL/MariaDB, MySQL, MongoDB, ClickHouse, Redis, Data Lakes, Amazon S3 (AWS S3)

Other

Data Analysis, Common Data Service, Data Warehousing, Serverless, ETL Development, Big Data Architecture, Big Data, Natural Language Processing (NLP), Data Analytics, Data Vaults, Data Warehouse Design, Data Engineering, Data Architecture, Software Architecture, Service Meshes, Architecture, Solution Architecture, Algorithms, Fintech, Quantum Computing, Linear Algebra, Generative Pre-trained Transformers (GPT), Networks, Full-stack, LangChain, OpenAI GPT-3 API, OpenAI GPT-4 API, Artificial Intelligence (AI), APIs, Team Leadership, Remote Team Leadership, Chatbots, Data Migration, Segment, Data Management, CI/CD Pipelines, Retrieval-augmented Generation (RAG), Salesforce Apex, Insurance Technology (Insurtech), Embedded Rust, Elixir Oban, FastAPI, Zero-knowledge Proofs, Data Structures, Debugging, Biotechnology, Back-end, Video Streaming, Real-time Data, Time Series, Time Series Analysis, Trading, Low Latency, RTCP, RTP, Data, Optimization, SDKs, Task Scheduling, Distributed Software, Internet of Things (IoT), Software Engineering, Mapping, Amazon Kinesis, Machine Learning, Solana, Cryptocurrency, Non-fungible Tokens (NFT), Smart Contracts, Data Build Tool (dbt), Tokenomics, Tokenization, Token Design, Qiskit Runtime, Q#, Embedded Systems, Firmware, Microcontrollers, ARM Embedded, ARM SoC, Geospatial Data, Geospatial Analytics, IBM InfoSphere Data Governance, Natural Language Understanding (NLU), Ontologies, Casbin-Mesh, Business Systems Analysis, Google BigQuery, Training, Networking, Quality Auditing, Statistics, Data Quality, Quality Control (QC), Data Quality Management, OpenAI, Amazon EMR Studio, Large Language Models (LLMs), Diffusion-based AI Models, Deep Reinforcement Learning

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring