Lead Data Scientist
2022 - PRESENTWinnow- Managed and mentored a team of data scientists. Led strategic projects/insights.
- Supported the development of computer vision models.
- Improved and automated the workflow and processes.
- Improved coding practice and reviewed existing projects' code.
- Managed the Agile way of working and sprint planning within the data science team. Also managed annotation and data quality.
Technologies: Python 3, Python, SQL, Jupyter Notebook, Mode Analytics, Data Science, Statistics, Pandas, Jira, Slack, Agile, Sprint PlanningSenior Data Scientist
2021 - PRESENTSky UK- Developed and implemented customer churn models for the business.
- Collaborated on a real-time machine learning proof of concept involving anomaly detection on hub telemetry data.
- Interviewed and recruited data scientist candidates and fulfilled line management responsibilities.
Technologies: Churn Analysis, Real-time Data, Anomaly Detection, Google Cloud Platform (GCP), BigQuery, Jupyter Notebook, Python 3, Keras, TensorFlow, Scikit-learn, Pandas, Supervised Learning, Supervised Machine Learning, Unsupervised Learning, Autoencoders, Google Cloud Storage, Google BigQuery, Data Science, Machine Learning, Python, SQL, Classification, Information Retrieval, Clustering, Algorithms, Artificial Intelligence (AI), Neural Networks, Deep Learning, Data Analysis, Data Analytics, Data Reporting, Big Data, Data, GitHub, Data Preprocessing, Document Processing, Feature Engineering, Data Processing, Deep Neural Networks, Jira, Slack, Agile, Sprint PlanningSenior Data Scientist
2021 - 2021Integral Solutions, Inc.- Investigated the pros and cons of using different NBA APIs.
- Wrote scripts to retrieve NBA data, process it, and store it on S3.
- Performed feature engineering using team stats and other handcrafted features coming from historical NBA matches.
- Designed, validated, and tested a deep neural network model to predict NBA winners and losers as well as the winning probabilities.
- Collaborated with a software engineer to put the NBA prediction model in production for the first MVP of the project.
- Achieved market-leading accuracy for predicting NBA match outcomes.
- Wrote some documentation, introduced some unit tests, and suggested future developments for the project and actions that could further improve the existing model.
Technologies: Python, Amazon S3 (AWS S3), Amazon Web Services (AWS), Deep Learning, Deep Neural Networks, Neural Networks, Predictive Modeling, Data Processing, APIs, Minimum Viable Product (MVP), Documentation, Feature Engineering, Churn Analysis, Data Science, Machine Learning, Scikit-learn, Keras, Pandas, Classification, Jupyter Notebook, Algorithms, Artificial Intelligence (AI), Supervised Learning, Data Analysis, Data Analytics, Data Reporting, Data, GitHub, Data Preprocessing, Document ProcessingSenior Data Scientist
2018 - 2019Notonthehighstreet Enterprises Ltd- Managed a topic classification NLP project using convolutional neural networks and word embeddings to be used by the partners and operations/customer service team.
- Led a deep neural network recommender system project that led to valuable customer segmentation insights to be used by the product and curation team.
- Collaborated with the digital marketing team to increase the effectiveness of marketing and advertising campaigns as well as SEO.
- Managed a competitor analysis project to be used as insights by the executive team.
- Improved data science workflow and coding practices.
- Redesigned the data science recruitment from scratch.
- Managed, guided, and mentored a mid-level data scientist.
- Built a product bundles graph to visualize insights on products frequently bought together.
- Documented data science projects on a data team wiki.
- Managed a multi-touch digital marketing attribution project using a Markov chain.
Technologies: Big Data, Data, R, Dimensionality Reduction, Unsupervised Learning, Supervised Learning, Neural Networks, eCommerce, Data Analytics, Document Processing, Google Analytics, Google SEO, Algorithms, Recommendation Systems, Continuous Integration (CI), Data Reporting, Data Analysis, Conversion Rate, Machine Learning, Data Science, Amazon Web Services (AWS), GitHub, Statistical Analysis, Mesos, Relational Databases, Tableau, Pattern Recognition, Calculus, Linear Algebra, Word Embedding, Convolutional Neural Networks, Snowflake, Information Retrieval, Clustering, Classification, Regression, Natural Language Processing (NLP), Artificial Intelligence (AI), Pandas, Python, Jira, Slack, LaTeX, CHRONOS, Jenkins, Ansible, Jupyter Notebook, Scikit-learn, Keras, TensorFlow, SQL, Python 3, Writing & Editing, Documentation, Jupyter, Data Modeling, Statistics, Matplotlib, NumPy, Deep Learning, Chatbots, APIs, Git, Docker, Text Analytics, Data Preprocessing, Amazon EC2, MySQL, Classification Algorithms, Amazon S3 (AWS S3), Churn Analysis, Feature Engineering, Data Processing, Deep Neural Networks, Agile, Sprint PlanningData Scientist
2017 - 2018Notonthehighstreet Enterprises Ltd- Worked on an NLP semantic search project using word embeddings in collaboration with tech and other product stakeholders.
- Built predictive models to evaluate our business partners' success to be used as actionable insights by the partners and operations team.
- Engaged and built relationships with senior stakeholders throughout the business.
- Worked on an external trending/social media influencers/posts ranking project in collaboration with the product and curation team that led to the development of a web app to make their job easier.
- Contributed to creating and introducing a data team learning and development culture.
- Placed an NLP project in production to detect a set of specific things in messages business partners sent to customers to be used as actionable insights by the partners and operations team and to be included in a weekly report.
- Documented data science projects on a data team wiki.
Technologies: Big Data, Data, R, Dimensionality Reduction, Unsupervised Learning, Supervised Learning, Neural Networks, eCommerce, Data Analytics, Document Processing, Algorithms, Continuous Integration (CI), Data Reporting, Data Analysis, Conversion Rate, Machine Learning, Data Science, Amazon Web Services (AWS), GitHub, Statistical Analysis, Jira, Slack, Relational Databases, Pattern Recognition, Calculus, Linear Algebra, Word Embedding, Convolutional Neural Networks, Information Retrieval, Clustering, Classification, Regression, Natural Language Processing (NLP), Artificial Intelligence (AI), Pandas, Python, LaTeX, Mesos, CHRONOS, Jenkins, Ansible, Jupyter Notebook, Scikit-learn, Keras, TensorFlow, SQL, Python 3, Writing & Editing, Documentation, Jupyter, Data Modeling, Statistics, Matplotlib, NumPy, Deep Learning, APIs, Git, Docker, Text Analytics, Data Preprocessing, Amazon EC2, MySQL, Classification Algorithms, Amazon S3 (AWS S3), Feature Engineering, Data Processing, Deep Neural NetworksData Scientist
2016 - 2016Mindi Technologies Ltd- Wrote Python scripts to analyze 36 features of DigitalOcean's servers' data such as droplets_cpu_stime, droplets_cpu_utime, droplets_network_rxbytes, and droplets_network_txbytes.
- Worked with the server's droplets of nine different sizes (512 MB, 1GB, 2GB, 4GB, 8GB, 16GB, 32GB, 48GB, and 64GB).
- Tried to infer server and droplet power usages from the datasets provided by DigitalOcean.
Technologies: Big Data, Data, Data Analytics, Data Analysis, Statistics, Matplotlib, NumPy, Python 3, SQL, MySQL, Data ProcessingAI Researcher
2016 - 2016King's College London- Worked on a project that was part of a collaboration between researchers in artificial intelligence, telecommunications, and environmental sciences. The project was carried out in partnership with Transport for London (TFL) and Ericsson.
- Used artificial intelligence planning to contribute to the design of the next generation of intelligent urban traffic controls (i.e., AI-controlled traffic lights, speed limits, and route planning).
- Visited the TFL operational center and learned about the SCOOT system. Learned about traffic systems used in other main cities around the world.
- Studied the papers written by some of the world's most prominent research groups on traffic optimization.
- Used traffic simulation tools such as SUMO (simulation of urban mobility) and PTV Vissim to simulate congestion scenarios in London.
- Wrote Python scripts that were part of the framework used to interface the DINO AI planner and SUMO.
- Attended the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016) in London.
- Guided and mentored a couple of students in the Master of Science degree program.
Technologies: Artificial Intelligence (AI), Python, Planning, APIs, Git