
Burak Uyar
Verified Expert in Engineering
Data Developer
Istanbul, Turkey
Toptal member since October 17, 2024
Burak is an experienced full-stack data developer with vast experience as a data team manager, senior data engineer, senior data analyst, and data scientist. He is also well-versed in business stakeholder communication, requirements analysis, task prioritization and allocation, and cost and budget optimization related to technical tools and services.
Portfolio
Experience
- Python - 12 years
- SQL - 10 years
- Data Pipelines - 9 years
- APIs - 9 years
- Databases - 9 years
- Tableau - 6 years
- Amazon Web Services (AWS) - 4 years
- Google Cloud Platform (GCP) - 3 years
Availability
Preferred Environment
Python, Google Cloud Platform (GCP), Amazon Web Services (AWS), SQL, CI/CD Pipelines, Dashboards, ETL, Git, Data Pipelines, ClickHouse
The most amazing...
...projects I've contributed to involved designing, building, and transforming data infrastructures into scalable and easy-to-maintain systems.
Work Experience
Tableau Developer
Bright on Analytics Ltd
- Developed a BI solution for visualizing the sales-related data of the client's competitors, enabling the client to gain insights and achieve a higher ROI.
- Collected the requirements and arranged them according to their priorities and dependencies. At that point, I noticed a potential gap between the current and ideal tech stack and prepared a demo to present to the client.
- Created and presented solution options to the client, collaborated to finalize the optimal approach, and split the project into two phases for Agile delivery.
- Delivered the initial version in the first phase of the Tableau dashboard on the Tableau cloud to confirm that the interactive dashboards would enable the client to improve their ROI.
- Integrated ClickHouse as an OLAP between PostgreSQL and Tableau in the second phase so that the project is more aligned with the best practices and all data becomes queryable in very short processing times.
Senior Data Engineer
Toptal
- Designed and implemented data pipelines from both SQL and NoSQL sources, as well as GCS and GBQ, using Airflow (Cloud Composer) and Luigi.
- Developed a microservice for improved data quality checks. The app aimed to prevent database-breaking changes from the source microservices to the data warehouse.
- Contributed to developing a framework for Google Cloud Composer. The main aim of the framework was to define the constraints of the data ingestion and loading processes for efficient development and maintenance.
- Collaborated with multi-part business stakeholders for multiple critical business projects. These projects included financial and other sensitive data.
Chief Information Officer
Scoutium
- Led R&D processes with AI approaches on 1st-party data, mathematically modeling the proprietary data and building data products, including crowdsourcing design and data product delivery.
- Implemented BI processes and designed a self-service data analytics approach and infrastructure.
- Designed and implemented data integration processes containing data from internal sources, including relational and NoSQL sources. Data was additionally ingested from multiple APIs and public websites using scraping.
Data Engineering Manager
GroupM
- Designed, implemented, and analyzed custom attribution models for various clients, including customer journeys in online and offline touchpoints.
- Provided automation solutions for reporting processes using Python, SQL, Tableau, and Datorama.
- Leveraged data providers' APIs for data integration and data model creation.
- Delivered technical consultancy for creative solutions based on multiplatform advertising projects.
Music Information Retrieval (MIR) Researcher
CompMusic
- Defined computational research problems on Turkish makam music corpus publicly available at CompMusic.
- Created a desktop application for self-tutored rhythm ("usul") training.
- Published multiple papers for international conferences available in Google Scholar.
Experience
Microservice for Improved Data Quality Checks
End-to-end DWH and BI Solution
The company had different data sources, all of which needed to be used for different purposes, including self-service analytics, training, and testing ML models, and feeding the application database back.
The sources included PostgreSQL, MongoDB, APIs, web-scraping data, spreadsheets, and flat files.
The solution included having the brief from the internal teams, designing the structure, preparing the implementation phases, preparing the individual tasks, and resource allocation from our data team.
The solution mainly involves utilizing Python, SQL, NoSQL, Crontab, AWS S3, AWS Glue, AWS Athena, AWS IAM, Tableau, and Tableau Bridge.
End-to-end Business Intelligence Solution
The client had their data on a PostgreSQL database and wanted to use Tableau for self-service analytics to improve the efficiency of their investments, hence ROI.
The project was split into two phases. The first part was to create a PoC without using an OLAP, use smaller data, and deliver the first version of the Tableau Cloud. The second part was to improve the solution using a proper OLAP, data pipeline, and ETL setup.
The first part is delivered quickly using PostgreSQL directly as the source, Tableau extracts, and Tableau Cloud.
For the second part, ClickHouse was selected as the OLAP. The implementation included using the MaterializedPostgreSQL database engine and replicating the relevant source tables from Postgres to ClickHouse. On top of the raw replicated tables, materialized views were built to create analytics layers. That approach allowed easy-to-manage infrastructure and lower maintenance effort for the client at later steps. With that implementation, near real-time data flow was provided into ClickHouse, hence into Tableau Cloud, via a Tableau Bridge instance running on Linux.
Web-scraping Data Pipeline on Cloud Services
Education
Master's Degree in Audio Technologies and Sound Computing
Bahcesehir University - Istanbul, Turkey
Bachelor's Degree in Computer Engineering
Bogazici University - Istanbul, Turkey
Skills
Libraries/APIs
Pandas, NumPy, PySpark, OpenAI API, Tableau API, Dask, Luigi
Tools
Apache Airflow, BigQuery, Tableau, Git, Tableau Desktop, Microsoft Excel, Google Sheets, AWS Glue, Postman, Microsoft Power BI, Tableau Embedded Analytics, Amazon QuickSight, GitHub, Google Cloud Composer, Amazon Athena, Cron, Datorama, Docker Compose, Jira, Confluence, Google Analytics, AWS IAM, Looker, Terraform, Trello, IPython Notebook, Google Compute Engine (GCE)
Languages
Python, SQL, XML, YAML, Snowflake, HTML, HTML5, JavaScript, Excel VBA, Python 3, Java
Frameworks
Spark, Apache Spark, Flask, Selenium
Paradigms
ETL, Business Intelligence (BI), Database Design, User Behavioral Analytics (UBA), Unit Testing, Agile Project Management, Agile, DevOps, OLAP
Platforms
Docker, Google Cloud Platform (GCP), Amazon Web Services (AWS), Kubernetes, AWS Lambda, Amazon EC2, Azure, Databricks, Firebase, MacOS, Linux, Apache Kafka, DigitalOcean, Jupyter Notebook
Storage
Data Pipelines, Databases, PostgreSQL, Amazon S3 (AWS S3), JSON, XML Schema, Data Lakes, Relational Databases, ClickHouse, Redshift, MongoDB, NoSQL, Google Cloud Storage, Google Cloud
Other
Data Structures, Data Engineering, APIs, Web Scraping, Data Visualization, Dashboards, Data Warehousing, Google BigQuery, API Integration, SFTP, Web Dashboards, Data Modeling, Dashboard Design, ETL Tools, Scraping, Business Analysis, Data Architecture, Data, Data Strategy, Excel Macros, Debugging, Reporting, Troubleshooting, Large Data Sets, Parquet, Database Optimization, Data Scraping, Data Analysis, Star Schema, Research Analysis, excel formulas, Excel 365, Tableau Server, Looker Studio, Amazon Redshift, Artificial Intelligence (AI), Azure Databricks, Google Cloud Functions, Fivetran, CI/CD Pipelines, Algorithms, Time Complexity Analysis, Software Engineering, Programming, Web Technologies, Signal Processing, Digital Signal Processing, Data Analytics, Data Collection, Audio Processing, Data Science, Machine Learning, Data Quality, Data Governance, Data Products, Cloud Storage, Advertising Technology (Adtech), Music Publishing, GitHub Actions, Metadata, Computer Engineering, Music Information Retrieval (MIR), Reports, Team Management, Stakeholder Management, IT Project Management, Data Warehouse Design, Infrastructure as Code (IaC), GSM, Google Artifact Registry, Data Build Tool (dbt)
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring