
Dénes Bartha
Verified Expert in Engineering
Artificial Intelligence (AI) Developer
Singapore, Singapore
Toptal member since December 7, 2018
As a Ph.D. student of Computer Science, Dénes has worked as a researcher in Bioinformatics at The University of Tokyo and the National University of Singapore. He has also contributed as a Software Engineer at the Canadian Aviation Engineering (CAE). He greatly enjoys using machine learning techniques in order to solve real-world problems and to help businesses.
Portfolio
Experience
- C++ - 10 years
- Python - 8 years
- Artificial Intelligence (AI) - 6 years
- Machine Learning - 6 years
- Scikit-learn - 3 years
- BigQuery - 3 years
- TensorFlow - 3 years
- Hardware Development - 3 years
Availability
Preferred Environment
Jupyter Notebook, PyCharm, Git, Ubuntu, Google Cloud Platform (GCP), Sensor Data
The most amazing...
...tool that I have made is a DNA data compression/assembler program called Colorgram. It is a Succinct Colored de Bruijn Graph variant.
Work Experience
Senior Data Scientist
Dkatalis
- Implemented machine learning pipeline for transaction classification in GCP, orchestrated via Dataflow and Kubeflow. Conducted model training and hyperparameter tuning by Vertex AI AutoML and created custom models using Katib with LightGBM, Bert, and TensorFlow.
- Developed a machine learning pipeline to detect recurring transactions based on customer and transaction information. The model filled the feature "Plan Ahead" in the app's front end based on the detected recurring transactions.
- Integrated an in-house built TensorFlow lite YOLO model for automatically detecting IC cards. Added image quality checks to the front end and employed Google Vision Text Recognition for parsing content, applying NLP techniques for result cleansing.
- Participated in creating an API service that shows various insights to the users in the mobile app via Braze, orchestrated with Kafka, using Redis and PostgreSQL.
- Established data quality frameworks for our database in BigQuery implemented in Python using Great Expectations and orchestrated via Apache Airflow. Implemented a custom SQL unit testing framework.
Senior Data Scientist
Doctor Anywhere
- Helped finance, operations, marketing, BD, and doctors for creating/automating reports in Python and MySQL, sending out daily mails automatically from AWS and GCP Linux virtual machines.
- Automated the integration of a 3rd-party healthcare platform used by our clinics via their API in Python and JavaScript. The platform was missing some CMS functionalities, e.g., setting low stock alerts and calculating cost prices automatically.
- Implemented a pipeline in Python for pulling data from various sources, including multiple MySQL servers, MongoDB, Microsoft SQL Server, and Firebase into BigQuery.
- Created long, flat tables and views in BigQuery using Standard SQL and integrated these with Tableau so that other teams could easily access and analyze the data independently.
- Created an ensemble Random Forest classifier in Python using scikit-learn libraries to predict patient diagnoses based on symptoms, reducing doctors' logging time and filtering unsuitable cases.
- Estimated patients' claim prices using XGBoost in Python.
- Optimized medication delivery routes by analyzing rider data and geolocation data using Standard SQL and Python.
Researcher
National University of Singapore
- Worked in the bioinformatics laboratory of the Computer Science Department.
- Created design and implementation of concrete bioinformatical algorithms.
- Analyzed data and statistics of human and virus DNA.
- Worked on DNA compression and assembly-related problems.
- Created Colorgram—succinct colored de Bruijn graph.
Researcher
University of Tokyo
- Worked in a bioinformatics laboratory. Created theoretical algorithms related to bioinformatical problems.
- Analyzed mass spectrometry data and implemented and tested various DNA reconstruction algorithms.
- Created and presented statistics and published results in Acta Cybernetica scientific journal.
Software Engineer
Canadian Aviation Electronics (CAE)
- Supported the development of the pilot training system by working on both the UI and the back end.
- Maintained the components by analyzing the customers' data and feedback.
- Designed and developed a specific communication system for military aircraft.
- Collaborated (daily) between the Hungarian and Canadian sites.
Data Scientist
Nextent Informatics Co.
- Supported the data collection from the customers.
- Analyzed data using machine learning techniques.
- Created statistics.
- Supported creating the design of mobile application.
- Participated in the developed mobile application for android.
Software Developer
Key-Soft plc
- Participated in the development of a billing software.
- Designed and maintained databases using PL/SQL.
- Developed components of the billing software product.
- Supported the development of an online bookstore in PHP, SQL.
- This was an internship program besides the university.
Software Developer
Rise FM
- Created interactive banners for the website of the company.
- The main development was done in Flash (ActionScript), HTML, CSS, and PHP.
- Collected reviews and feedbacks from the viewers of the website.
- Maintained specific parts of the website based on the reviews.
- This was a summer job besides high school.
Experience
Colorgram
https://github.com/denesbartha/ColorgramTree Graph Labeling
https://github.com/denesbartha/tree-graph-labelingReconstruction of Rooted Directed Trees
https://github.com/denesbartha/RRDTEducation
Ph.D. in Computer Science
Eötvös Loránd University - Hungary
Master's Degree in Computer Science
Eötvös Loránd University - Hungary
Bachelor's Degree in Computer Science
Eötvös Loránd University - Hungary
Certifications
Associate Android Developer
Foundation Certificate in Software Testing
ISTQB
Software Information Technologist
Eötvös Loránd University
Skills
Libraries/APIs
Scikit-learn, Pandas, NumPy, Sage, Keras, TensorFlow, XGBoost, PyTorch
Tools
BigQuery, Git, PyCharm, CLion, Sublime Text 3, Sublime Text, Flash, Maple, MATLAB, Tableau, AutoML, Cloud Dataflow, Apache Airflow, Terraform, You Only Look Once (YOLO), Microsoft Power BI
Languages
Python, C++, C, R, MicroPython, Flash ActionScript, HTML, CSS, PHP, Assembly, SQL, Java, C#, Rust, JavaScript, Dart
Platforms
Google Cloud Platform (GCP), Linux, Raspberry Pi, Ubuntu, Android, Jupyter Notebook, Amazon Web Services (AWS), Kubeflow, Kubernetes
Frameworks
Boost, Django, Android SDK
Paradigms
Scrum
Storage
PL/SQL, MySQL
Other
Artificial Intelligence (AI), Machine Learning, Machine Learning Operations (MLOps), Chatbots, Computer Vision, Deep Learning, ESP32, Large Language Models (LLMs), Hardware Development, PCB Design, BERT, Natural Language Processing (NLP), Language Models, Data Build Tool (dbt), Google Colaboratory (Colab), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Sensor Data, IoT Protocols
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring