
Pedro Lima
Software Developer
Pedro is a software developer and architect who specializes in data science and machine learning. He has extensive experience in designing and developing data applications on large information systems for companies such as Nestlé, Tetra Pak, and Sony Mobile.
Portfolio
Experience
Data Science - 10 yearsPython - 10 yearsMachine Learning - 8 yearsGenerative Pre-trained Transformers (GPT) - 7 yearsNatural Language Processing (NLP) - 7 yearsGPT - 7 yearsData Pipelines - 6 yearsPyTorch - 5 yearsAvailability
Preferred Environment
Jupyter, Python, MacOS, Linux, Git, Sublime Text
The most amazing...
...project I've done was the development of an AI agent that learned to play text-based games and won the Microsoft TextWorld competition.
Work Experience
Freelance NLP and Machine Learning Developer
Talkmap (formerly Discourse.ai)
- Developed machine learning models for natural language understanding, concept discovery, incremental discovery, and conversational analytics.
- Implemented models for dialog flow identification and classification.
- Built unsupervised models for intent discovery and natural language generation.
- Designed and built the AI platform for incremental discovery of new customer intents in call center conversation data.
Freelance Chatbot Developer
BiggerBrains (via Toptal)
- Developed a natural-language-understanding model for an educational chatbot.
- Integrated the chatbot with Slack, Facebook Messenger, and Microsoft Teams platforms.
- Implemented the chatbot dialog engine using a Rasa machine learning system.
Freelance NLP/Machine Learning Developer
Cargo Chief (via Toptal)
- Developed a system for data extraction from natural language sources.
- Implemented both rule-based extractions and machine learning for natural language processing.
- Developed tools for data curation and data preprocessing.
Freelance Data Scientist
HumNutrition (via Toptal)
- Implemented a product recommendation system that was also integrated with the webshop.
- Built a predictive model for the churning of subscription customers.
- Performed a clustering analysis of the customer space to extract insights for marketing.
- Developed REST web services to integrate the models into the main business application.
Freelance Data Analyst
Nespresso
- Worked on the blueprint design and implementation of a new retail information system with responsibilities on customer service and supply chain flows.
- Developed web services for eCommerce integration with a supply chain (stock, availability, and sales documents).
- Implemented an outlier detection system for alerts on customer service data entry.
- Developed an extensive set of automated unit tests for end-to-end supply chain flows.
- Analyzed data log for bottleneck identification and performance improvement.
Freelance Data Analyst and Developer
Syncronic
- Worked as a core developer of a new SaaS platform for sales forecasting.
- Created the general architecture design, forecasting model, back-end development, REST API, and integration with the ERP system.
- Developed a forecasting model by combining ARIMA and ETS with a machine-learning gradient-boosting model.
Data Analyst (Contract)
Novozymes
- Worked on the business blueprint and implementation of a new SAP supply chain solution.
- Designed and developed new production planning heuristics.
- Developed automated end-to-end tests for the business processes.
Web Developer (Contract)
Unilever
- Enhanced the Python back-end of Digpedia.net—a multimedia marketing internal web application.
- Developed, in Django, a system for multimedia management with fine-grained access control.
- Implemented in the Django application a system for temporary access to resources.
Data Analyst | Developer (Contract)
Tetra Pak
- Designed and developed a block-planning extension to the production planning software.
- Collaborated on the design of the new available-to-promise solution and product-allocation solution.
- Designed and implemented enhancements to the SAP CIF interface to support a special VMI scenario.
- Supported and worked on solution enhancements for rollouts in multiple plants.
Data Analyst (Contract)
Sony Mobile
- Implemented a prototype for the available-to-promise system using rules-based functionality for the project proof of concept.
- Designed and implemented an SAP global available-to-promise system.
- Designed and built the technical specifications for enhancements in the APO interface, backorder processing, and special reports.
Data Analyst (Contract)
Nestlé
- Worked on the data analysis for performance improvement and issue resolution.
- Designed and implemented an algorithm to support products with shelf-life constraints in sales stock allocation.
- Enhanced the available-to-promise process in the supply chain management system.
- Designed and developed a simplified user interface for quotas/allocations maintenance; used dynamic source code generation.
Data Analyst (Contract)
Philips Consumer Electronics
- Collaborated on the business blueprint design for supply chain quota allocation as a technical expert.
- Implemented a prototype with the algorithm identified in the blueprint.
- Worked on the project SWOT analysis.
Data Analyst (Contract)
Johnson & Johnson Pharmaceutical
- Conducted a data analysis in order to troubleshoot system issues and implement improvements.
- Designed and developed enhancements on the sales forecasting and a demand-planning system.
- Implemented enhancements to the production-planning detailed scheduling system.
Data Analyst | Developer
Sonae Industria
- Worked on a new greenfield SAP implementation with a focus on the supply chain system.
- Analyzed data in order to achieve performance and stability improvement.
- Developed a real-time asynchronous interface for a SAP integration with shop-floor control.
- Implemented a web-based performance-monitoring dashboard.
Experience
First Prize Winner of Microsoft Research TextWorld AI Competition
https://www.microsoft.com/en-us/research/blog/first-textworld-problems-the-competition-using-text-based-games-to-advance-capabilities-of-ai-agents/Kaggle Master
https://www.kaggle.com/pvlimaSales Forecasting Platform
Production Planning Heuristic
Outlier Detection System
Game with Generative AI
https://museumof.ai/collection/#bot-poets-14675Skills
Languages
Python, SQL, ABAP, Java, R, JavaScript, TypeScript, HTML, CSS
Frameworks
LightGBM, Django, Scrapy, Flask
Libraries/APIs
PyTorch, XGBoost, Scikit-learn, Pandas, Matplotlib, Beautiful Soup, TensorFlow, Keras, SciPy, NumPy, SpaCy, jQuery, React
Paradigms
Data Science, ETL, Continuous Integration (CI), Continuous Deployment, Test-driven Development (TDD), REST, Scrum
Other
Data Analytics, Data Analysis, Statistical Data Analysis, Natural Language Processing (NLP), Data Engineering, Machine Learning, SAP, Natural Language Understanding (NLU), GPT, Generative Pre-trained Transformers (GPT), Image Processing, Serverless, Chatbots, Artificial Intelligence (AI), Recommendation Systems, lxml, Web Scraping, Deep Learning, Scientific Computing, Computational Linguistics, Generative Pre-trained Transformer 3 (GPT-3), OpenAI, ChatGPT, LangChain, Forecasting, Time Series, Big Data, Maps, OCR, Data Warehouse Design, Google BigQuery, Computer Vision, HTMX, Text Generation, Games
Tools
Rasa.ai, Apache Airflow, Jupyter, GitHub, Git, Sublime Text, GitLab CI/CD, Tableau, PyCharm, IBM Watson, Pytest
Platforms
Docker, Google App Engine, MacOS, Linux, Google Cloud Platform (GCP), Kubernetes, Oracle, Apache Kafka, AWS Lambda, Amazon Web Services (AWS)
Storage
Data Pipelines, PostgreSQL, MySQL, Elasticsearch, PostGIS, MongoDB
Education
PhD Degree in Process Systems Engineering
Coimbra University - Coimbra, Portugal
Master's Degree in Chemical Engineering
Coimbra University - Coimbra, Portugal