Pedro Lima, Machine Learning Developer in Porto, Portugal
Pedro Lima

Machine Learning Developer in Porto, Portugal

Member since September 4, 2017
Pedro is a software developer and architect who specializes in data science and machine learning. He has extensive experience in designing and developing data applications on large information systems for companies such as Nestlé, Tetra Pak, and Sony Mobile.
Pedro is now available for hire

Portfolio

Experience

Location

Porto, Portugal

Availability

Part-time

Preferred Environment

Python, Sublime Text, Jupyter, Git, Linux/Mac OSX

The most amazing...

...project I've done was the development of an AI agent that learned to play text-based games and won the Microsoft TextWorld competition.

Employment

  • Freelancer NLP/Machine Learning Developer

    2018 - PRESENT
    Discourse
    • Developed machine learning models for natural language understanding.
    • Implemented models for dialog flow identification and classification.
    • Developed unsupervised models for intent discovery and natural language generation.
    Technologies: PyTorch, Python, spaCy, Chatbots, Watson
  • Freelance Chatbot Developer

    2018 - 2019
    BiggerBrains (via Toptal)
    • Developed a natural-language-understanding model for an educational chatbot.
    • Integrated the chatbot with Slack, Facebook Messenger, and Microsoft Teams platforms.
    • Implemented the chatbot dialog engine using a Rasa machine learning system.
    Technologies: TensorFlow, Scikit-learn, Rasa, Django
  • Freelance NLP/machine learning Developer

    2017 - 2018
    Cargo Chief (via Toptal)
    • Developed a system for data extraction from natural language sources.
    • Implemented both rule-based extractions and machine learning for natural language processing.
    • Developed tools for data curation and data preprocessing.
    Technologies: Python, Natural Language Processing, Machine Learning
  • Freelance Data Scientist

    2017 - 2017
    HumNutrition (via Toptal)
    • Implemented a product recommendation system that was also integrated with the webshop.
    • Built a predictive model for the churning of subscription customers.
    • Performed a clustering analysis of the customer space to extract insights for marketing.
    • Developed REST web services to integrate the models into the main business application.
    Technologies: Python, Machine Learning, TensorFlow
  • Data Analyst (Contract)

    2014 - 2017
    Nespresso
    • Worked on the blueprint design and implementation of a new retail information system with responsibilities on customer service and supply chain flows.
    • Developed web services for the eCommerce integration with supply chain (stock, availability, sales documents).
    • Implemented an outlier detection system for alerts on customer service data entry.
    • Developed an extensive set of automated unit tests for end-to-end supply chain flows.
    • Analyzed the data for performance improvements.
    Technologies: SAP, Python, R
  • Data Analyst | Developer (Contract)

    2013 - 2013
    Syncronic
    • Worked as a core developer of a new SaaS platform for sales forecasting.
    • Created the general architecture design, forecasting model, back-end development, REST API, and integration with the ERP system.
    • Developed a forecasting model by combining ARIMA and ETS with a machine-learning gradient-boosting model.
    Technologies: Python, Machine Learning, Time Series Forecasting
  • Data Analyst (Contract)

    2011 - 2013
    Novozymes
    • Worked on the business blueprint and implementation of a new SAP supply chain solution.
    • Designed and developed new production planning heuristics.
    • Developed automated end-to-end tests.
    Technologies: SAP, Python
  • Web Developer (Contract)

    2011 - 2011
    Unilever
    • Enhanced the Python back-end of Digpedia.net—a multimedia marketing internal web application.
    • Developed, in Django, a system for multimedia management with fine-grained access control.
    • Implemented in the Django application a system for temporary access to resources.
    Technologies: Python, Django, PostgreSQL
  • Data Analyst | Developer (Contract)

    2008 - 2011
    Tetra Pak
    • Designed and developed a block-planning extension to the production planning software.
    • Collaborated on the design of the new available-to-promise solution and product-allocation solution.
    • Designed and implemented enhancements to the SAP CIF interface to support a special VMI scenario.
    • Supported and worked on solution enhancements for rollouts in multiple plants.
    Technologies: SAP, Python
  • Data Analyst (Contract)

    2009 - 2010
    Sony Mobile
    • Implemented a prototype for the available-to-promise system using rules-based functionality for the project proof of concept.
    • Designed and implemented an SAP global available-to-promise system.
    • Designed and built the technical specifications for enhancements in the APO interface, backorder processing, and special reports.
    Technologies: SAP
  • Data Analyst (contract)

    2008 - 2010
    Nestlé
    • Worked on the data analysis for performance improvement and issue resolution.
    • Designed and implemented an algorithm to support products with shelf-life constraints in sales stock allocation.
    • Enhanced the available-to-promise process in the supply chain management system.
    • Designed and developed a simplified user interface for quotas/allocations maintenance; used dynamic source code generation.
    Technologies: SAP, Python
  • Data Analyst (Contract)

    2007 - 2007
    Philips Consumer Electronics
    • Collaborated on the business blueprint design for supply chain quota allocation as a technical expert.
    • Implemented a prototype with the algorithm identified in the blueprint.
    • Worked on the project SWOT analysis.
    Technologies: SAP
  • Data Analyst (Contract)

    2007 - 2007
    Johnson & Johnson Pharmaceutical
    • Conducted a data analysis in order to troubleshoot system issues and implement improvements.
    • Designed and developed enhancements on the sales forecasting and a demand-planning system.
    • Implemented enhancements to the production-planning detailed scheduling system.
    Technologies: SAP, Python
  • Data Analyst | Developer

    2001 - 2006
    Sonae Industria
    • Worked on a new greenfield SAP implementation with a focus on the supply chain system.
    • Analyzed data in order to achieve performance and stability improvement.
    • Developed a real-time asynchronous interface for a SAP integration with shop-floor control.
    • Implemented a web-based performance-monitoring dashboard.
    Technologies: SAP, Python, Oracle, R

Experience

  • First Prize Winner of Microsoft Research TextWorld AI Competition (Other amazing things)
    https://www.microsoft.com/en-us/research/blog/first-textworld-problems-the-competition-using-text-based-games-to-advance-capabilities-of-ai-agents/

    I am the winner of a global competition on reinforcement learning and natural language understanding for building agents capable of solving text-based games.

  • Kaggle Master (Other amazing things)
    https://www.kaggle.com/pvlima

    I achieved the rank of master in a Kaggle machine learning competition platform.

  • Sales Forecasting Platform (Development)

    I worked on the design and development of a SaaS platform for sales forecasting. I also implemented a hybrid time series forecasting algorithm using ARIMA, ETS, and gradient boosting.

  • Production Planning Heuristic (Development)

    I designed and developed a heuristic algorithm for the special cyclic planning of fast-moving products. The planning heuristic is based on standard lots planning with additional leveling of production quantities and special scheduling based on product setup values.

  • Outlier Detection System (Development)

    I implemented an outlier detection system based on a random forest algorithm to provide alerts for suspicious/wrong data in the customer service flow.

Skills

  • Languages

    Python, SQL, ABAP, R, JavaScript
  • Frameworks

    LightGBM, Django, Scrapy, Flask
  • Libraries/APIs

    PyTorch, XGBoost, Scikit-learn, Pandas, Matplotlib, Beautiful Soup, TensorFlow, Keras, SciPy, NumPy, jQuery
  • Paradigms

    Data Science, Test-driven Development (TDD), REST, Scrum
  • Other

    Data Analysis, Statistical Data Analysis, Natural Language Processing (NLP), Data Engineering, Machine Learning, SAP, Natural Language Understanding, Chatbots, Artificial Intelligence (AI), Recommendation Systems, lxml, Web Scraping, Deep Learning, Scientific Computing, Computational Linguistics, Google BigQuery, Computer Vision
  • Tools

    Jupyter, GitHub, Git, Tableau, PyCharm, IBM Watson, Pytest
  • Platforms

    Google App Engine, MacOS, Linux, Google Cloud Platform (GCP), Amazon Web Services (AWS), Kubernetes
  • Storage

    PostgreSQL, MySQL, MongoDB

Education

  • PhD degree in Process Systems Engineering
    1999 - 2007
    Coimbra University - Coimbra, Portugal
  • Master's degree in Chemical Engineering
    1992 - 1997
    Coimbra University - Coimbra, Portugal

To view more profiles

Join Toptal
Share it with others