Ahmet Hamza Emra, Developer in Edmonton, AB, Canada
Ahmet is available for hire
Hire Ahmet

Ahmet Hamza Emra

Verified Expert  in Engineering

Machine Learning Engineer and Software Developer

Location
Edmonton, AB, Canada
Toptal Member Since
July 10, 2020

Ahmet is a skilled machine learning engineer with expertise in deep learning and reinforcement learning. He has worked on projects in academia and industry in a variety of team and client settings. One of his most exciting projects was in collaboration with NASA. Ahmet is currently working on research in advanced reinforcement learning.

Portfolio

Renegade Publishing Inc.
Data Science, Python, PostgreSQL, Machine Learning, JavaScript, ETL...
US Dream Academy
Chatbots, LangChain, Natural Language Processing (NLP), OpenAI...
Digital Vibes Inc.
Python, Pinecone, Machine Learning, OpenAI, Open-source LLMs, OpenAI GPT-3 API...

Experience

Availability

Full-time

Preferred Environment

Reinforcement Learning, Machine Learning, Artificial Intelligence (AI)

The most amazing...

...product I've developed was an analytical tool to help astronauts maintain a healthy environment in the International Space Station.

Work Experience

Data Scientist

2024 - PRESENT
Renegade Publishing Inc.
  • Built an ETL pipeline to detect the continuity of royalty payments.
  • Integrated Apache Airflow to the current system and hosted it on AWS.
  • Worked to predict royalty payments and leakage to improve reporting and analysis pages.
Technologies: Data Science, Python, PostgreSQL, Machine Learning, JavaScript, ETL, Apache Airflow

LangChain Developer

2024 - PRESENT
US Dream Academy
  • Developed a chatbot with an RAG system to ensure answer quality and confidence.
  • Implemented a document ranking model to help document retrieval.
  • Developed and deployed the chatbot UI to Azure web services.
Technologies: Chatbots, LangChain, Natural Language Processing (NLP), OpenAI, Machine Learning, OpenAI GPT-3 API, OpenAI GPT-4 API, Azure, Azure Websites, Continuous Integration (CI), Continuous Development (CD)

ML Developer

2024 - PRESENT
Digital Vibes Inc.
  • Developed a custom chatbot that utilizes a custom GPT and is supported with a new Flask API to retrieve related data.
  • Improved the RAG system by using open-source LLM to convert tabular data into a better format for GPT-4 to be ingested.
  • Led a team of developers and mentored them on system architecture.
Technologies: Python, Pinecone, Machine Learning, OpenAI, Open-source LLMs, OpenAI GPT-3 API, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Large Language Models (LLMs)

Lead AI Developer

2023 - PRESENT
Proxa
  • Developed and implemented AI models to enhance data management and reporting functionalities, contributing to the advancement of an AI-enhanced platform.
  • Utilized extensive experience with Microsoft Azure and demonstrated expertise in leveraging and creating generative AI applications to support clients' needs using LLMs and LangChain technologies.
  • Architected the data hub and formulated contextualized models based on data, optimizing the system's capabilities and performance.
Technologies: Artificial Intelligence (AI), Software Architecture, Machine Learning, Azure, ChatGPT, Generative Pre-trained Transformers (GPT), OpenAI GPT-3 API, OpenAI GPT-4 API, LangChain, Large Language Models (LLMs), Llama 2, Pinecone, Natural Language Processing (NLP), Chatbots, Live Chat, APIs, API Integration, OpenAI, Generative Pre-trained Transformer 3 (GPT-3)

Machine Learning Expert

2023 - 2023
The Listening App Inc.
  • Implemented a TensorRT version of Bark, an open source text-to-audio model, enhancing the model's speed by 35%.
  • Introduced Proximal Policy Optimization (PPO) to fine-tune the Bark model using RLHF.
  • Came up with a reward signal for reinforcement learning training.
Technologies: Machine Learning, Neural Networks, Python, Deep Learning, PyTorch, Text to Speech (TTS), Artificial Intelligence (AI), Deep Neural Networks, Generative Pre-trained Transformers (GPT), ChatGPT, OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), Bark, Software, Open Neural Network Exchange (ONNX), Google Speech-to-Text API, Speech to Text, Text2Speech, GPU Computing

Data Scientist and Consultant

2019 - 2023
MovieUnion
  • Trained a recommendation system algorithm from scratch​ using TensorFlow​.
  • Designed databases, stored procedures, reports, and data input interfaces using SQL.
  • Deployed the machine learning​ model to SageMaker.
Technologies: Amazon SageMaker, SQL, TensorFlow, Keras, Deep Learning, Neural Networks, Recommendation Systems, Artificial Intelligence (AI), Statistical Process Control Tools, Azure, Technical Consulting, Google AI Platform, Google Cloud Machine Learning, CSV

Data Scientist

2021 - 2022
Pfizer,
  • Developed models to estimate the drug production and inspection timeline. Improved the error rate from nine days to two days on average.
  • Designed and built ETL pipelines with Python, PostgreSQL, and Apache Airflow.
  • Built the new version of a legacy system used to store, retrieve, and update data.
Technologies: Python, Data Science, SQL, Machine Learning, Deep Learning, ETL, Data Engineering, Redshift, Forecasting, Data Pipelines, Amazon Machine Learning, Statistics, Statistical Learning, Models, Pharmaceuticals, Azure, Amazon S3 (AWS S3), Amazon Web Services (AWS), Amazon SageMaker, AWS Lambda

Data Scientist

2021 - 2021
InkSpace Analytics - Freelance
  • Designed process-oriented workflow and ELT processes for loading data into the data warehouse.
  • Trained a predictive model on loan approval and amount using TensorFlow.
  • Worked closely with project managers, business analysts, source owners, and the data quality team to ensure timely and accurate delivery of business requirements.
  • Managed the performance tuning of SQL queries.
Technologies: ETL, Python, KNIME, Tableau, SQL, PostgreSQL, MySQL, Machine Learning, Deep Learning, Python 3

Artificial Intelligence Engineer

2020 - 2020
VUCA INC
  • Developed a deep reinforcement learning agent to solve design problems, using PyTorch​.
  • Built the web interface of a reinforcement learning environment for training and evaluation.
  • Integrated gaze point detection for model training.
Technologies: Docker, Flask, Deep Reinforcement Learning, PyTorch, Data Wrangling, MVP Design

Research Assistant

2019 - 2020
University of Alberta
  • Researched the application of reinforcement learning algorithms in​ a supervised setting.
  • Developed deep Q-cogent networks, a new deep reinforcement learning (DRL) algorithm that improved learning speed by 50%.
  • Performed data reporting and data reconciliation processes to support the principal project investigators.
  • Initiated the publication of an academic paper on DRL.
Technologies: Artificial Intelligence (AI), Reinforcement Learning, TensorFlow Deep Learning Library (TFLearn), Data Mining, Deep Reinforcement Learning, PyTorch, TensorFlow, Data Wrangling

Data Analyst

2018 - 2019
Harmony Public Schools
  • Developed a tool to analyze students' and teachers' interactions​ to improve the success of both parties.
  • Provided complex research and analysis to support effective teaching and presented findings to management or project leaders.
  • Created weekly and per-semester reports to provide advanced insights and suggestions to the ​administration.
Technologies: Artificial Intelligence (AI), Apache Spark, Spark SQL, Data Mining, Amazon Web Services (AWS), Flask, Pandas, Python

Undergraduate Research Assistant

2015 - 2018
North American University
  • Researched a data-driven approach to bacteria development​ with Dr. Zubeyir Altintas.
  • Taught multiple​ classes including Python and deep learning.
  • Expanded breadth and depth of knowledge and skills across disciplines by working with specific fields of computer science and industry professionals.
Technologies: Data Mining, Pandas, NumPy, Keras, Python, Data Wrangling

CO2 Analysis on the International Space Station (ISS)

https://drive.google.com/file/d/1lrRF9z63MbMlJMOyTDxOsCxewMSF4qC5/view
A detection system for CO2 anomalies in the ISS. I co-developed this product in the 2018 Wearable Technologies Workshop under the auspices of NASA and the North American University. I focused on analyzing the causes of CO2 clouds and detecting them from IoT devices. We also created a user interface to gather more data for further research​ on the subject.

MLRpro

https://github.com/MichaelMMeskhi/MLRP
A system that uses supervised learning algorithms to classify resumes of top software engineers in various companies and label their skills accordingly. When a user uploads their resume, it is tested against the predicted hypothesis by the ML algorithms, and the system returns what tier(s) the user belongs to.

Intelegent_Lock

https://github.com/AhmetHamzaEmra/Intelegent_Lock
Open-source lock software with facial recognition and liveness detection. It uses a landmark model for face recognition and a 3D convolution network to predict the liveness of the camera input. It is spoofing-safe and open-source to ensure​ the ​safety of the user.

Generating Logos with Diffusion Models

This project aims to automate the logo creation process using generative AI models and web-scraped data. The proposed approach is based on diffusion models and attention blocks implemented in PyTorch. The system takes user-defined parameters and generates diverse and high-quality logos that meet the constraints. The proposed logo generation system offers advantages over traditional logo design, including speed, cost-effectiveness, and the ability to generate a large number of logo variations. The project demonstrates the potential of AI in automating creative tasks and providing value to businesses and individuals.

Clone a Friend

In this project, we leveraged the capabilities of a large language model (LLM) to create a personalized AI that simulates the conversational style of a specific individual. To achieve this, we meticulously curated a dataset of our chat history with the individual in question. This dataset served as the training ground for the LLM, enabling it to grasp and reproduce the unique linguistic nuances and patterns characteristic of our friend's communication style. The finetuning process was facilitated by Lora, a cutting-edge technique known for its efficiency in adapting LLMs to specific tasks or styles with minimal data and computational resources. The result is an AI companion that mirrors our friend's conversational idiosyncrasies, offering a new dimension in digital communication that is both personalized and engaging. Although It was successful in mimicking, it also revealed too much personal content, so this project was terminated.
2014 - 2018

Bachelor's Degree in Computer Science

North American University - Houston, TX, USA

FEBRUARY 2018 - PRESENT

Deep Learning Specialization

Coursera

NOVEMBER 2017 - PRESENT

Building and Deploying Deep Learning Applications with TensorFlow

Linkedin Learning

NOVEMBER 2017 - PRESENT

OpenCV for Python Developers

Linkedin Learning

AUGUST 2017 - PRESENT

Machine Learning Nanodegree

Udacity

APRIL 2017 - PRESENT

Data Analysis Nanodegree

Udacity

Libraries/APIs

Pandas, NumPy, Scikit-learn, TensorFlow Deep Learning Library (TFLearn), TensorFlow, Keras, PyTorch, PySpark, D3.js, OpenCV, Spark ML, Natural Language Toolkit (NLTK), Google Speech-to-Text API

Tools

Jupyter, ChatGPT, Terminal, Tableau, Slack, Spark SQL, Amazon SageMaker, Google AI Platform, Open Neural Network Exchange (ONNX), Text2Speech, Apache Airflow

Languages

Python, Python 2, Python 3, SQL, HTML, CSS, R, JavaScript

Paradigms

Data Science, ETL, Continuous Integration (CI), Continuous Development (CD)

Platforms

Windows, Debian, Linux, Amazon Web Services (AWS), Azure, AWS Lambda, MacOS, Docker, KNIME

Storage

Azure SQL, Amazon S3 (AWS S3), PostgreSQL, MySQL, Redshift, Data Pipelines, Azure Websites

Frameworks

Apache Spark, Flask, Django, Spark

Other

Deep Learning, Machine Learning, Reinforcement Learning, Data Virtualization, Deep Reinforcement Learning, Neural Networks, Deep Neural Networks, Data Wrangling, Data Cleaning, Data Mining, Artificial Intelligence (AI), OCR, Statistical Analysis, Predictive Analytics, Data Engineering, Web Scraping, Natural Language Processing (NLP), Computer Vision, Supervised Learning, Generative Pre-trained Transformers (GPT), Classifier Development, Data Analysis, Image Recognition, Object Detection, Image Processing, Recommendation Systems, PyPDF2, Text Processing, Liveness Detection, Anti-spoofing, Video Processing, Convolutional Neural Networks (CNN), Facial Recognition, Forecasting, Amazon Machine Learning, Statistics, Statistical Learning, Models, Pharmaceuticals, Tracking, Statistical Process Control Tools, Diffusion Models, Stable Diffusion, Generative Models, Technical Consulting, Google Cloud Machine Learning, CSV, MVP Design, Text to Speech (TTS), OpenAI GPT-3 API, Generative Pre-trained Transformer 3 (GPT-3), Bark, Software, Software Architecture, OpenAI GPT-4 API, LangChain, Large Language Models (LLMs), Llama 2, Pinecone, Chatbots, Live Chat, APIs, API Integration, OpenAI, DQN, Actor-critic Methods (A2C, A3C), Games, Speech to Text, Open-source LLMs, LoRa, Hugging Face, LLama7B, GPU Computing

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring