Ioannis Melas
Verified Expert in Engineering
Data Scientist and Machine Learning Developer
Cambridge, United Kingdom
Toptal member since March 30, 2021
Ioannis is a data scientist with expertise in prototyping, developing, and deploying data science and ML workflows that best leverage business data. This includes exploratory analysis such as dimensionality reduction, clustering, feature extraction, model fit/parameter estimation, and supervised analysis such as classification and regression. Ioannis is an expert in structured and unstructured data (NLP). Some of his notable clients include the U.S. Food & Drug Administration and AstraZeneca.
Portfolio
Experience
- Machine Learning - 12 years
- Data Science - 12 years
- Bioinformatics - 12 years
- Bash - 12 years
- Scikit-learn - 7 years
- Python - 7 years
- Keras - 5 years
- R - 5 years
Availability
Preferred Environment
Linux, Python 3, Flask-RESTful, Streamlit, Spotfire, Python, R, Bash
The most amazing...
...NLP solution I've developed was for the text summarization and classification of biomedical literature routinely used by research scientists.
Work Experience
Data Science Contractor
Shell
- Designed solutions for carbon sequestration. Mined structured and unstructured data on the molecular processes driving the carbon cycle in the soil. Identified interventions to optimize the carbon cycle.
- Validated findings against the published literature (20 million articles) using NLP.
- Developed an interactive dashboard with the results and published it to end users.
Machine Learning CTO
Carbon Connect Enterprise Strategies Inc.
- Developed a platform for monitoring forest growth and carbon credit budgeting.
- Mined and segmented LiDAR and satellite images for tree identification and growth.
- Built a dashboard using Streamlit in Python, deployed in GCP to allow the users to query their data.
Data Science Contractor
AstraZeneca
- Developed a machine learning workflow to leverage and interpret genetic data. This included parsing and preprocessing patient data, normalization, dimensionality reduction, statistical tests, and supervised analysis.
- Created a natural language solution for mining biomedical literature. The data was structured in an Elasticsearch database, cleaned, tokenized using the Natural Language Toolkit (NLTK), vectorized, and then used in a text classification framework.
- Built dashboards and UI using Streamlit in Python and deployed them using NGINX.
Data Science Contractor
Arm
- Built a machine learning framework for maximizing coverage in CPU verification. Development was in Python; deployed on HPC using the Slurm Workload Manager.
- Developed workflows leveraging adversarial learning using GANs and programmed in Python Keras.
- Addressed numerical optimization problems using genetic algorithms with a custom GA implementation.
Principal Data Scientist
UCB Celltech
- Built machine learning workflows to predict patient response to candidate drugs. Developed in R.
- Led a team of three developers to create exploratory analytics solutions/dashboards to visualize high-dimensional data. Results were pre-calculated in R, then imported in TIBCO Spotfire.
- Designed machine learning solutions to predict drug activity in assays. Used LSTMs to model chemical structures as free text and applied methods from text classification.
Postdoctoral Research Fellow
U.S. Food & Drug Administration
- Developed a solution for predicting drug adverse events based on their transcriptomic profiles.
- Created a linear programming formulation to model the structure of directed graphs.
- Applied a solution to predict the adverse effects of new compounds.
Experience
Mining Biomedical Literature Using Elasticsearch and NLP
Framework for CPU Verification
Method for Predicting the Efficacy of New Drugs
Education
Ph.D. in Numerical Optimization, Machine Learning, Bioinformatics
National Technical University of Athens - Athens, Greece
Master's Degree in Mechanical Engineering
National Technical University of Athens - Athens, Greece
Skills
Libraries/APIs
Scikit-learn, Keras, NumPy, TensorFlow, LSTM, Natural Language Toolkit (NLTK), Flask-RESTful, SpaCy, NetworkX, PySpark, Pandas
Tools
Spotfire, Git, NGINX, Jenkins, MATLAB, GitHub, Apache Airflow
Languages
Bash, Bash Script, Python, Python 3, R, SQL, C
Platforms
Jupyter Notebook, Linux, H20, Amazon Web Services (AWS), Google Cloud Platform (GCP), Kubernetes
Industry Expertise
Bioinformatics
Frameworks
Streamlit, Flask
Storage
Elasticsearch
Other
Machine Learning, Data Science, Mathematical Modeling, Linear Optimization, Genomics, Numerical Methods, Numerical Simulations, Numerical Modeling, Natural Language Processing (NLP), Slurm Workload Manager, Data Analysis, Mixed-integer Linear Programming, Convex Optimization, Pharmaceuticals, Generative Pre-trained Transformers (GPT), Generative Adversarial Networks (GANs), Computational Physics, Deep Learning, Genetic Algorithms, Data Analytics, Drug Development, Gunicorn, Data Visualization, Artificial Intelligence (AI), Statistics, Large-scale Computing, CTO
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring