Mehmet Sahin
Verified Expert in Engineering
Data Engineer and Developer
Mehmet is a developer who works with all known RDBMS and NoSQL databases, AWS-GCP cloud providers, and popular big data tools like Hadoop, Hive, Spark, Kafka, and Elasticsearch, as well as with PySpark and Kafka. Thanks to a profound knowledge of ETL tools like Oracle GoldenGate and Informatica, he developed several apps with Python, most of them related to ETL, where Mehmet especially shines. Recently, Mehmet's been fascinated by graph databases, PyTorch, Keras, and deep learning.
Portfolio
Experience
Availability
Preferred Environment
Linux, Informatica, SQL, Python, RDBMS, NoSQL, ETL, Amazon Web Services (AWS), Google Cloud Platform (GCP), Big Data
The most amazing...
...project I've done was an ETL system that processes two TB of data daily into the HBase graph format. Users can now easily see previously hidden relationships.
Work Experience
Senior Data Engineer
Sentium Consulting -- Roche Pharmaceuticals -- Genentech
- Created ETL pipelines using Dataproc clusters on the Google Cloud Platform. Created Spark jobs and manipulated data with PySpark, Python, and Pandas. Built machine learning reproducible, maintainable, and modular pipelines using the Kedro package.
- Collected terabytes of data from various sources like sensors, RDBMS, and flat files and loaded them to GCP BigQuery.
- Gained different insights into the data using machine learning algorithms by the DS team, depending on the prepared data.
Data Engineer
Seguridad, Inc
- Developed AWS Lambda services to collect, transform, aggregate, clean, and store structured and unstructured pharmacy data using Python.
- Built the MySQL AWS Aurora database and load data with Python and SQLAlchemy ORM.
- Loaded data to S3, built the Athena database, and created Athena SQL queries.
- Built a fully automated system to collect, aggregate, and store data using AWS services.
Project Manager | Developer
Turkish Department of Information Technologies
- Managed a big data platform that included Hadoop, Hive, HBase, Spark, Kafka, Solr, and Elasticsearch on 18 nodes with Cloudera.
- Created Spark jobs and manipulated data with PySpark and a Spark stream.
- Developed graph databases, executed a migration project from RDBMS to a graph database, and developed apps with Gremlin.
- Processed 2TB of data daily by converting it to a graph format. Thus, users were able to identify previously unseen relationships easily.
- Prepared Python and Bash scripts for the transfer of external data. These were monitored and scheduled with Apache Airflow.
- Brought a new perspective to the data concerning the running of various graph algorithms, such as shortest path or PageRank.
ETL and Database Developer
Turkish Department of Information Technologies
- Gathered workflows and data from many different sources under a single ETL system.
- Built an ETL system that is more manageable, monitorable, and has a lower error rate.
- Created and maintained a very healthy and always live ETL system that processed daily terabytes of real-time data.
Business Intelligence Platform Developer
Turkish Department of Information Technologies
- Designed and built an ETL system using SSIS and MSSQL CDC.
- Quickly created reports using OLAP Cubes (which were prepared using SSAS).
- Enabled it so that users could quickly access the requested reports. Additionally, with ad-hoc reporting, they could create the reports they wanted. Reports were presented by SSRS and PowerBI.
Experience
Property Graph Database System
I worked as the ETL and database developer, and I managed all clusters, created Spark jobs, and prepared data pipelines with GoldenGate, Kafka, and Airflow. Also, I was responsible for designing the graph structure and improving the performance of Gremlin queries.
Exadata Migration
Company ETL Project
Dynamic Machine Learning Pipeline
Education
Bachelor's Degree in Computer Engineering
University of Ankara - Ankara, Turkey
Bachelor's Degree in Faculty of Law
Anadolu University - Ankara, Turkey
Bachelor's Degree in Security Sciences
Police Academy - Ankara, Turkey
Certifications
AWS Certified Database - Specialty (DBS)
Amazon Web Services
AWS Certified Cloud Practitioner
Amazon Web Services
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization, and Optimization
Coursera
Neural Networks and Deep Learning by Deeplearning.ai
Coursera
Machine Learning by Stanford University
Coursera
Deep Learning A-Z™: Hands-on Artificial Neural Networks
Udemy
Cloudera Developer Training for Apache Spark and Hadoop
Cloudera, ExitCertified
SQL Tuning for Oracle
Oracle University
Oracle Database 12c: Introduction to SQL Ed 1.1
Oracle University
Oracle Database 12c Administration Workshop Ed 2
Infopark
Oracle Database 12c: Backup and Recovery Workshop
Infopark
Oracle Database 12c Install and Upgrade Workshop Ed 1
İnfoPark
Predictive Modeling, Segmentation and Relational Rules, Time Series, Sequential Events with SPSS
AIMS
Implementing a Data Warehouse with Microsoft SQL Server 2012
BNT Pro
Skills
Languages
Python, SQL, Gremlin, Bash, MDX
Tools
Oracle GoldenGate, GIS, PyCharm, Solr, Oracle Exadata, SSAS, Microsoft Power BI, Apache Airflow, Google Cloud Dataproc, Jira, Confluence, Amazon Simple Queue Service (SQS)
Paradigms
ETL, HIPAA Compliance
Storage
Data Pipelines, HBase, PostgreSQL, Elasticsearch, Microsoft SQL Server, SQL Stored Procedures, Apache Hive, Oracle RDBMS, Teradata, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), MySQL, Amazon Aurora, RDBMS, NoSQL, Redshift, Amazon DynamoDB
Other
ETL Tools, Data Warehousing, Data Warehouse Design, Informatica, GRAPH, Data Engineering, Big Data, DAX, Cloud, Kedro, Google BigQuery, Amazon Neptune, Amazon RDS
Frameworks
Spark, Hadoop
Libraries/APIs
PySpark, PyTorch, Keras, Pandas
Platforms
Oracle, Apache Kafka, Linux, SharePoint, Google Cloud Platform (GCP), Amazon Web Services (AWS), AWS Lambda
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring