Pedro Lima, Developer in Porto, Portugal

Pedro Lima

Software Developer

Location
Porto, Portugal
Toptal Member Since
September 27, 2017

Pedro is a software developer and architect who specializes in data science and machine learning. He has extensive experience in designing and developing data applications on large information systems for companies such as Nestlé, Tetra Pak, and Sony Mobile.

Pedro is available for hire
Hire Pedro

Portfolio

Talkmap (formerly Discourse.ai)
Continuous Deployment, Continuous Integration (CI), Rasa.ai, GitLab CI/CD...
BiggerBrains (via Toptal)
Machine Learning, Artificial Intelligence (AI)...
Cargo Chief (via Toptal)
Amazon Web Services (AWS), OCR, Data Pipelines, Image Processing, MySQL...

Experience

Data Science - 10 yearsPython - 10 yearsMachine Learning - 8 yearsGenerative Pre-trained Transformers (GPT) - 7 yearsNatural Language Processing (NLP) - 7 yearsGPT - 7 yearsData Pipelines - 6 yearsPyTorch - 5 years

Location

Porto, Portugal

Availability

Part-time

Preferred Environment

Jupyter, Python, MacOS, Linux, Git, Sublime Text

The most amazing...

...project I've done was the development of an AI agent that learned to play text-based games and won the Microsoft TextWorld competition.

Work Experience

2018 - 2023

Freelance NLP and Machine Learning Developer

Talkmap (formerly Discourse.ai)
  • Developed machine learning models for natural language understanding, concept discovery, incremental discovery, and conversational analytics.
  • Implemented models for dialog flow identification and classification.
  • Built unsupervised models for intent discovery and natural language generation.
  • Designed and built the AI platform for incremental discovery of new customer intents in call center conversation data.
Technologies: Continuous Deployment, Continuous Integration (CI), Rasa.ai, GitLab CI/CD, Apache Kafka, Apache Airflow, Data Pipelines, Docker, Big Data, PostgreSQL, Data Science, Machine Learning, PyTorch, Artificial Intelligence (AI), Natural Language Processing (NLP), GPT, Generative Pre-trained Transformers (GPT), Natural Language Understanding (NLU), Computational Linguistics, Pytest, SpaCy, Data Engineering, Kubernetes, Amazon Web Services (AWS), Scrum, Scikit-learn, Google Cloud Platform (GCP), Jupyter, Deep Learning, Chatbots, Statistical Data Analysis, NumPy, Python, Pandas, IBM Watson, React, TypeScript, Generative Pre-trained Transformer 3 (GPT-3), OpenAI, HTMX, ChatGPT
2018 - 2019

Freelance Chatbot Developer

BiggerBrains (via Toptal)
  • Developed a natural-language-understanding model for an educational chatbot.
  • Integrated the chatbot with Slack, Facebook Messenger, and Microsoft Teams platforms.
  • Implemented the chatbot dialog engine using a Rasa machine learning system.
Technologies: Machine Learning, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), Natural Language Understanding (NLU), Computational Linguistics, SpaCy, Scikit-learn, Deep Learning, Chatbots, NumPy, Python, Pandas, Django, Rasa.ai, TensorFlow
2017 - 2018

Freelance NLP/Machine Learning Developer

Cargo Chief (via Toptal)
  • Developed a system for data extraction from natural language sources.
  • Implemented both rule-based extractions and machine learning for natural language processing.
  • Developed tools for data curation and data preprocessing.
Technologies: Amazon Web Services (AWS), OCR, Data Pipelines, Image Processing, MySQL, Data Science, Machine Learning, PyTorch, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT), GPT, Natural Language Processing (NLP), Natural Language Understanding (NLU), Computational Linguistics, SpaCy, Data Engineering, LightGBM, Scrum, Flask, Scikit-learn, GitHub, Jupyter, SQL, Deep Learning, Statistical Data Analysis, NumPy, Python, Pandas
2017 - 2017

Freelance Data Scientist

HumNutrition (via Toptal)
  • Implemented a product recommendation system that was also integrated with the webshop.
  • Built a predictive model for the churning of subscription customers.
  • Performed a clustering analysis of the customer space to extract insights for marketing.
  • Developed REST web services to integrate the models into the main business application.
Technologies: ETL, Data Pipelines, Docker, MySQL, Data Science, Machine Learning, PyTorch, Forecasting, SpaCy, Data Engineering, Recommendation Systems, XGBoost, LightGBM, Scikit-learn, GitHub, Jupyter, SQL, Deep Learning, Web Scraping, Statistical Data Analysis, NumPy, Python, Pandas, TensorFlow
2014 - 2017

Freelance Data Analyst

Nespresso
  • Worked on the blueprint design and implementation of a new retail information system with responsibilities on customer service and supply chain flows.
  • Developed web services for eCommerce integration with a supply chain (stock, availability, and sales documents).
  • Implemented an outlier detection system for alerts on customer service data entry.
  • Developed an extensive set of automated unit tests for end-to-end supply chain flows.
  • Analyzed data log for bottleneck identification and performance improvement.
Technologies: ETL, Data Pipelines, Data Science, Machine Learning, Forecasting, Data Engineering, XGBoost, LightGBM, Scikit-learn, Jupyter, SQL, Statistical Data Analysis, NumPy, Python, Pandas, R, SAP
2013 - 2013

Freelance Data Analyst and Developer

Syncronic
  • Worked as a core developer of a new SaaS platform for sales forecasting.
  • Created the general architecture design, forecasting model, back-end development, REST API, and integration with the ERP system.
  • Developed a forecasting model by combining ARIMA and ETS with a machine-learning gradient-boosting model.
Technologies: ETL, Data Science, Machine Learning, Forecasting, Scikit-learn, Statistical Data Analysis, NumPy, Python, Pandas, Time Series
2011 - 2013

Data Analyst (Contract)

Novozymes
  • Worked on the business blueprint and implementation of a new SAP supply chain solution.
  • Designed and developed new production planning heuristics.
  • Developed automated end-to-end tests for the business processes.
Technologies: ETL, Data Science, Machine Learning, Scikit-learn, Jupyter, SQL, Statistical Data Analysis, NumPy, Python, Pandas, SAP, Data Pipelines
2011 - 2011

Web Developer (Contract)

Unilever
  • Enhanced the Python back-end of Digpedia.net—a multimedia marketing internal web application.
  • Developed, in Django, a system for multimedia management with fine-grained access control.
  • Implemented in the Django application a system for temporary access to resources.
Technologies: Python, PostgreSQL, Django, HTML, CSS, JavaScript
2008 - 2011

Data Analyst | Developer (Contract)

Tetra Pak
  • Designed and developed a block-planning extension to the production planning software.
  • Collaborated on the design of the new available-to-promise solution and product-allocation solution.
  • Designed and implemented enhancements to the SAP CIF interface to support a special VMI scenario.
  • Supported and worked on solution enhancements for rollouts in multiple plants.
Technologies: Data Science, SQL, Statistical Data Analysis, Python, SAP
2009 - 2010

Data Analyst (Contract)

Sony Mobile
  • Implemented a prototype for the available-to-promise system using rules-based functionality for the project proof of concept.
  • Designed and implemented an SAP global available-to-promise system.
  • Designed and built the technical specifications for enhancements in the APO interface, backorder processing, and special reports.
Technologies: Data Science, Python, SAP
2008 - 2010

Data Analyst (Contract)

Nestlé
  • Worked on the data analysis for performance improvement and issue resolution.
  • Designed and implemented an algorithm to support products with shelf-life constraints in sales stock allocation.
  • Enhanced the available-to-promise process in the supply chain management system.
  • Designed and developed a simplified user interface for quotas/allocations maintenance; used dynamic source code generation.
Technologies: Data Science, ABAP, SQL, Python, SAP
2007 - 2007

Data Analyst (Contract)

Philips Consumer Electronics
  • Collaborated on the business blueprint design for supply chain quota allocation as a technical expert.
  • Implemented a prototype with the algorithm identified in the blueprint.
  • Worked on the project SWOT analysis.
Technologies: Data Science, Python, SAP
2007 - 2007

Data Analyst (Contract)

Johnson & Johnson Pharmaceutical
  • Conducted a data analysis in order to troubleshoot system issues and implement improvements.
  • Designed and developed enhancements on the sales forecasting and a demand-planning system.
  • Implemented enhancements to the production-planning detailed scheduling system.
Technologies: Data Science, SQL, Python, SAP
2001 - 2006

Data Analyst | Developer

Sonae Industria
  • Worked on a new greenfield SAP implementation with a focus on the supply chain system.
  • Analyzed data in order to achieve performance and stability improvement.
  • Developed a real-time asynchronous interface for a SAP integration with shop-floor control.
  • Implemented a web-based performance-monitoring dashboard.
Technologies: Java, Data Warehouse Design, Data Science, ABAP, SQL, Statistical Data Analysis, Python, R, Oracle, SAP

Experience

First Prize Winner of Microsoft Research TextWorld AI Competition

https://www.microsoft.com/en-us/research/blog/first-textworld-problems-the-competition-using-text-based-games-to-advance-capabilities-of-ai-agents/
I am the winner of a global competition on reinforcement learning and natural language understanding for building agents capable of solving text-based games.

Kaggle Master

https://www.kaggle.com/pvlima
I achieved the rank of master in a Kaggle machine learning competition platform.

Sales Forecasting Platform

I worked on the design and development of a SaaS platform for sales forecasting. I was also involved in the implementation of a hybrid time series forecasting algorithm using ARIMA, ETS, and gradient boosting.

Production Planning Heuristic

I designed and developed a heuristic algorithm for the special cyclic planning of fast-moving products. The planning heuristic is based on standard lots planning with additional leveling of production quantities and special scheduling based on product setup values.

Outlier Detection System

I implemented an outlier detection system based on a random forest algorithm to provide alerts for suspicious/wrong data in the customer service flow.

Game with Generative AI

https://museumof.ai/collection/#bot-poets-14675
Developed a game with content created by generative AI (poems and game images). The game is featured in the Museum of AI ("Love is like a foot on the beach" by Bot Poets Society 2021; AI-generated poem).

Skills

Languages

Python, SQL, ABAP, Java, R, JavaScript, TypeScript, HTML, CSS

Frameworks

LightGBM, Django, Scrapy, Flask

Libraries/APIs

PyTorch, XGBoost, Scikit-learn, Pandas, Matplotlib, Beautiful Soup, TensorFlow, Keras, SciPy, NumPy, SpaCy, jQuery, React

Paradigms

Data Science, ETL, Continuous Integration (CI), Continuous Deployment, Test-driven Development (TDD), REST, Scrum

Other

Data Analytics, Data Analysis, Statistical Data Analysis, Natural Language Processing (NLP), Data Engineering, Machine Learning, SAP, Natural Language Understanding (NLU), GPT, Generative Pre-trained Transformers (GPT), Image Processing, Serverless, Chatbots, Artificial Intelligence (AI), Recommendation Systems, lxml, Web Scraping, Deep Learning, Scientific Computing, Computational Linguistics, Generative Pre-trained Transformer 3 (GPT-3), OpenAI, ChatGPT, LangChain, Forecasting, Time Series, Big Data, Maps, OCR, Data Warehouse Design, Google BigQuery, Computer Vision, HTMX, Text Generation, Games

Tools

Rasa.ai, Apache Airflow, Jupyter, GitHub, Git, Sublime Text, GitLab CI/CD, Tableau, PyCharm, IBM Watson, Pytest

Platforms

Docker, Google App Engine, MacOS, Linux, Google Cloud Platform (GCP), Kubernetes, Oracle, Apache Kafka, AWS Lambda, Amazon Web Services (AWS)

Storage

Data Pipelines, PostgreSQL, MySQL, Elasticsearch, PostGIS, MongoDB

Education

1999 - 2007

PhD Degree in Process Systems Engineering

Coimbra University - Coimbra, Portugal

1992 - 1997

Master's Degree in Chemical Engineering

Coimbra University - Coimbra, Portugal