Martin Elias Costa, Developer in Buenos Aires, Argentina
Martin is available for hire
Hire Martin

Martin Elias Costa

Verified Expert  in Engineering

Research Developer

Buenos Aires, Argentina

Toptal member since December 11, 2020

Bio

Martin is an experienced data scientist with vast knowledge of machine learning and artificial intelligence. After completing his Ph.D. in physics, he has worked on a wide range of projects from both the private and public sectors. He has extracted meaningful insights throughout the years and created products leveraging data sources in the most varied formats, including images, geospatial data, structured databases, time series, and text.

Portfolio

Shirt Shack Limited
Python, Artificial Intelligence (AI), PyTorch, Pandas, OpenAI GPT-4 API...
Green Line, Inc.
Artificial Intelligence (AI), Natural Language Processing (NLP)...
The American Turmeric Company, Inc
Artificial Intelligence (AI), ChatGPT...

Experience

  • Python - 8 years
  • Machine Learning - 7 years
  • Research - 7 years
  • Data Science - 5 years
  • Statistics - 5 years
  • Image Analysis - 5 years
  • Deep Learning - 3 years
  • Medical Imaging - 3 years

Availability

Part-time

Preferred Environment

Visual Studio Code (VS Code), Jupyter, Python, Linux

The most amazing...

...thing I've developed is a deep learning image analysis pipeline for brain MRI images currently being used in a real life clinical setting.

Work Experience

AI Engineer

2023 - 2023
Shirt Shack Limited
  • Developed a platform to automatically generate content for Amazon products using AI. It allowed generation for individual products and bulk generation.
  • Integrated visual QA models to automatically add information about the different t-shirt designs.
  • Integrated Google Drive and storage to create a smart template and resource management system. It allowed us to track manual content easily and provided seamless integration with AI-generated content.
Technologies: Python, Artificial Intelligence (AI), PyTorch, Pandas, OpenAI GPT-4 API, Generative Pre-trained Transformers (GPT), Data Processing, Data Processing Automation, Computer Vision, Generative Systems, Streamlit, Prompt Engineering, ChatGPT, FastAPI, Containerization, Machine Learning Operations (MLOps), Data Scientist, Vector Data, Databases

AI Expert

2023 - 2023
Green Line, Inc.
  • Scoped and designed all technical aspects of the project.
  • Wrote the technical section for an NSF funding application.
  • Researched the state of the art on all the key AI fields involved in the project, including large language models, text-to-speech, automatic speech recognition, and knowledge graphs.
Technologies: Artificial Intelligence (AI), Natural Language Processing (NLP), Machine Learning, Python, TensorFlow, PyTorch, Data Preprocessing, Feature Analysis, Cloud, Distributed Computing, Deep Learning, Healthcare, OpenAI GPT-4 API, Chatbots, Software Architecture, Prompt Engineering, ChatGPT, FastAPI, Containerization, Machine Learning Operations (MLOps), Data Scientist, Vector Data, Databases

AI/GPT Expert

2023 - 2023
The American Turmeric Company, Inc
  • Developed an LLM-enabled chatbot using American Turmeric's custom data.
  • Created a prototype UI for the chatbot and an API for easy integration with their existing site.
  • Generated CI/CD code for deployment on Google Cloud Platform.
  • Created update triggers to automatically track changes on a storage folder containing the knowledge source base documents.
Technologies: Artificial Intelligence (AI), ChatGPT, Generative Pre-trained Transformers (GPT), JavaScript, Chatbots, Python, Django, Natural Language Processing (NLP), Large Language Models (LLMs), APIs, OpenAI API, OpenAI GPT-4 API, Software Architecture, Prompt Engineering, FastAPI, Containerization, Data Scientist, Vector Data, Databases

NLP Engineer | Marketing Automation Startup

2022 - 2023
Channel Growth Inc
  • Performed prompt engineering to improve automatic ad generation using GPT-3.
  • Created a dynamic prompt template and database structure to generate ads tailored to each client's industry.
  • Coordinated data collection and ran GPT-3 fine-tuning experiments.
Technologies: Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Machine Learning, Artificial Intelligence (AI), Robotic Process Automation (RPA), Ads, Marketing, Leads, Marketing Automation, OpenAI, Language Models, Text Analytics, OpenAI GPT-3 API, LangChain, Supervised Machine Learning, Machine Learning Automation, Chatbots, ChatGPT, Large Language Models (LLMs), APIs, OpenAI API, OpenAI GPT-4 API, Software Architecture, Prompt Engineering, FastAPI, Containerization, Data Scientist, Vector Data, Databases

Statistician

2021 - 2022
Brightfield Group, LLC
  • Designed and trained a new classification model for automatic product category extraction.
  • Created custom reports and devised a customer segmentation strategy for better decision-making.
  • Implemented an automated pipeline and dashboard for NLP-related KPI monitoring using Apache Airflow and Microsoft Power BI.
  • Improved performance of existing product deduplication pipeline by 100 times.
Technologies: Statistics, SQL, Mathematics, Python, Statistical Modeling, Data Analysis, Data Engineering, Data Wrangling, Jupiter, Scikit-learn, Pattern Matching, Data Matching, Language Models, Text Analytics, Supervised Machine Learning, Machine Learning Automation, Natural Language Processing (NLP), APIs, Data Visualization, Software Architecture, Startups, MySQL, Containerization, Data Versioning, Machine Learning Operations (MLOps), Data Scientist, Database Design, Databases

Data Scientist

2021 - 2022
Notewardy
  • Developed several NLP models to automatically process students' notes. The tasks included: keyword-definition pairs extraction, automatic flashcard generation, and short answer grading.
  • Suggested and designed new product features based on the latest NLP literature.
  • Wrote technical requirements for externally hired software factories.
Technologies: Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), SpaCy, Docker, Python, Linux, Data Science, Machine Learning, Jupyter, Artificial Intelligence (AI), Git, PyTorch, NumPy, Data Analytics, Code Review, Source Code Review, Data Analysis, Data Engineering, Data Wrangling, Jupiter, Language Models, Text Analytics, Supervised Machine Learning, Machine Learning Automation, APIs, Software Architecture, Containerization, Machine Learning Operations (MLOps), Data Scientist, Database Design, Databases

Data Scientist

2018 - 2021
Entelai
  • Built an end-to-end ML pipeline for medical image analysis from the ground up. The system is currently serving clinics in Argentina, Brazil, and Chile.
  • Trained deep neural networks for brain MRI segmentation and built automated reports to help radiologists make better decisions.
  • Wrote production code to deploy the trained models in a real-life setting. Tested and reviewed code before deployment to cloud servers.
  • Advised on key product decisions based on the latest literature for Medical Image Analysis.
Technologies: Statistical Modeling, Medical Imaging, Deep Learning, Linux, Image Analysis, Statistics, Data Science, Machine Learning, Jupyter, Computer Vision, Artificial Intelligence (AI), Amazon Web Services (AWS), Git, Convolutional Neural Networks (CNNs), Object Detection, PyTorch, TensorFlow, NumPy, Data Analytics, Code Review, Source Code Review, Data Modeling, Image Processing, Data Analysis, Data Engineering, Data Wrangling, Jupiter, Amazon S3 (AWS S3), Scikit-learn, Supervised Machine Learning, Machine Learning Automation, Amazon EC2, Data Visualization, Medical Diagnostics, Software Architecture, Startups, Containerization, Data Versioning, Machine Learning Operations (MLOps), Data Scientist, Databases

Data Scientist

2016 - 2018
National Government of Argentina
  • Developed a flexible text-based internal search engine for the National Chief of Staff congressional sector.
  • Created dozens of analytical reports on governmental datasets from different ministries.
  • Developed computer vision algorithms to aid in the local monitoring of the mosquito Aedes aegypti, an important disease vector.
Technologies: Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Geospatial Data, Time Series Analysis, Pandas, Python, Linux, Statistics, Data Science, Statistical Modeling, Jupyter, Computer Vision, Git, GIS, NumPy, Data Analytics, Code Review, Source Code Review, Data Modeling, Image Processing, R, Data Analysis, Data Wrangling, Jupiter, Amazon Web Services (AWS), Amazon S3 (AWS S3), Scikit-learn, PostgreSQL, Text Analytics, Supervised Machine Learning, Amazon EC2, Data Visualization, Software Architecture, MySQL, Data Versioning, Machine Learning Operations (MLOps), Data Scientist, Databases

Developer

2015 - 2015
Cran.io
  • Designed RF frequency controls to integrate into an existing system.
  • Built the RF transmitters and incorporated them into the preexisting electronics.
  • Deployed and tested the devices in a real-life setting.
Technologies: Remote Sensing, Electronics, Linux, Data Analysis, Data Analytics, Software Architecture

Consultant

2015 - 2015
Direct TV
  • Evaluated the feasibility of a pilot test involving physiological measurements that may be indicative of emotional reactions of viewers while watching audiovisual material.
  • Developed a prototype and gathered preliminary data.
  • Wrote a detailed report with findings and recommendations.
Technologies: Neuroscience, Physics, Linux, Statistics, Data Science, Statistical Modeling, NumPy, Data Analysis, Data Analytics, Data Wrangling, Data Visualization

Consultant

2014 - 2015
Cazoll & Asoc.
  • Designed a research protocol to evaluate skin moisture in users of cotton hygiene products.
  • Oversaw a field study to obtain measurements from a large cohort of customers.
  • Analyzed the data and wrote a detailed internal report summarizing findings.
Technologies: Quality Assurance (QA), Physics, Linux, Statistics, Data Science, Statistical Modeling, NumPy, Data Analysis, Data Analytics, Data Modeling, Data Wrangling, Data Visualization

National Survey Analysis for UNICEF

I analyzed data from a large national survey on IT tool usage by high school students in Argentina and wrote a very detailed report published by UNICEF regarding the use of video games and their cognitive impact on teenagers. These reports provide very valuable information for public policymakers.

Research on Rat Communication

Part of my PhD project involved the design and construction of an automated 24/7 rat training habitat. This entailed assembling and programming a series of sensors and actuators, including water pumps, infrared cameras, ultrasonic microphones, lasers, light-sensing diodes, and RFID chips to identify each animal. I also implemented motion tracking algorithms to help monitor the rats' behavior.

Educational Software Development

An online gaming platform for children aimed at strengthening key cognitive skills of primary school students. I helped develop the online platform and analyze the data coming from the experimental interventions.
2008 - 2014

Ph.D. in Physics

University of Buenos Aires - Buenos Aires, Argentina

2003 - 2008

Master's Degree in Physics

University of Buenos Aires - Buenos Aires, Argentina

Libraries/APIs

Pandas, PyTorch, NumPy, Scikit-learn, OpenAI API, jQuery, SpaCy, TensorFlow

Tools

ChatGPT, Git, Jupyter, MATLAB, Apache Airflow, GIS

Languages

Python, C, HTML, SQL, R, JavaScript

Platforms

Linux, Amazon Web Services (AWS), Docker, Visual Studio Code (VS Code), Arduino, Google Cloud Platform (GCP), Amazon EC2

Storage

Amazon S3 (AWS S3), PostgreSQL, MySQL, Databases

Frameworks

Django, Streamlit

Paradigms

Distributed Computing, Database Design

Industry Expertise

Marketing, Healthcare

Other

Image Analysis, Data Science, Natural Language Processing (NLP), Machine Learning, Artificial Intelligence (AI), Data Analytics, University Teaching, Data Analysis, Data Wrangling, Jupiter, Text Analytics, Supervised Machine Learning, Data Scientist, Physics, Research, Statistics, Deep Learning, Medical Imaging, Computer Vision, Convolutional Neural Networks (CNNs), Code Review, Source Code Review, Data Modeling, Image Processing, Data Engineering, OpenAI, Pattern Matching, Data Matching, Language Models, Generative Pre-trained Transformers (GPT), OpenAI GPT-3 API, Machine Learning Automation, Chatbots, Large Language Models (LLMs), APIs, Data Visualization, Medical Diagnostics, OpenAI GPT-4 API, Software Architecture, Prompt Engineering, FastAPI, Containerization, Data Versioning, Machine Learning Operations (MLOps), Neuroscience, Graph Theory, Statistical Modeling, Time Series Analysis, Geospatial Data, Electronics, Remote Sensing, Quality Assurance (QA), Object Detection, Technical Hiring, Interviewing, Mathematics, Robotic Process Automation (RPA), Ads, Leads, Marketing Automation, LangChain, Data Preprocessing, Feature Analysis, Cloud, Startups, Data Processing, Data Processing Automation, Generative Systems, Vector Data

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring