Pedro Lima
Verified Expert in Engineering
Software Developer
Porto, Portugal
Toptal member since September 27, 2017
Pedro is a software developer and architect specializing in data science, machine learning, and AI. He has extensive experience in the end-to-end process of conceiving, designing, developing, and deploying data applications for large companies and startups.
Portfolio
Experience
- Machine Learning - 10 years
- Python - 10 years
- Data Science - 10 years
- Generative Pre-trained Transformers (GPT) - 7 years
- Natural Language Processing (NLP) - 7 years
- Data Pipelines - 6 years
- PyTorch - 5 years
- Chatbots - 5 years
Availability
Preferred Environment
Jupyter, Python, MacOS, Linux, Git, Docker, LangChain, PyTorch
The most amazing...
...thing I've developed was an AI agent that learned to play text-based games and won the Microsoft TextWorld competition.
Work Experience
AI Developer
Law Offices of David A. Bressman, CO. LPA
- Designed and developed an AI assistant to support legal scenarios, including back-end algorithms and front-end user interface.
- Fine-tuned a language vision model to extract diagrams from accident reports.
- Developed algorithms for workflows related to handling medical reports and medical expenses in legal scenarios.
AI Architect
Solvent.Life Ltd
- Designed and developed an LLM pipeline for an AI assistant in financial markets.
- Implemented advanced charting for financial market data, integrated with the AI assistant.
- Defined the architecture for the application and cloud deployment in Google.
LLM and Generative AI Expert
Influencer Marketing Company
- Designed an AI content generation system tailored for the influencer content and marketing scenario.
- Implemented automatic optimization of LLM prompts to maximize content quality.
- Developed and deployed a UI to streamline human interaction with the AI system.
AI/ML Engineer (via Toptal)
Mirzacles
- Fine-tuned several models with client data using QLoRA (Llama 2 70B, Mistral 7B, Mixtral 8x7B, and Yi 34B).
- Developed SFT instructions and trained a judge model for quality evaluation.
- Developed a Gradio prototype app for model testing and evaluation.
- Deployed models with vLLM and FastAPI. Optimized model deployment with Activation-aware Weight Quantization (AWQ).
Machine Learning Engineer
Hum Nutrition Inc
- Developed a conversational AI agent for data analysis from text descriptions. The application has access to information on the internal databases and data structures for retrieval augmented generation and is able to generate and execute SQL queries.
- Developed an AI agent for customer support, using a generative LLM, retrieval augmented generation, and a sequential chain of prompts that implements the theory-of-mind algorithm.
- Deployed the AI agents in Google Cloud and Vercel platforms.
NLP and Machine Learning Developer (Freelance)
Talkmap (formerly Discourse.ai)
- Developed machine learning models for natural language understanding, concept discovery, incremental discovery, and conversational analytics.
- Implemented models for dialog flow identification and classification.
- Built unsupervised models for intent discovery and natural language generation.
- Designed and built the AI platform for incremental discovery of new customer intents in call center conversation data.
Freelance Chatbot Developer
Bigger Brains
- Developed a natural language understanding model for an educational chatbot.
- Integrated the chatbot with Slack, Facebook Messenger, and Microsoft Teams platforms.
- Implemented the chatbot dialog engine using a Rasa machine learning system.
NLP/ML Developer
Cargo Chief
- Developed a system for data extraction from natural language sources.
- Implemented both rule-based extractions and machine learning for natural language processing.
- Developed tools for data curation and data preprocessing.
Freelance Data Scientist
HumNutrition
- Implemented a product recommendation system that was also integrated with the webshop.
- Built a predictive model for the churning of subscription customers.
- Performed a clustering analysis of the customer space to extract insights for marketing.
- Developed REST web services to integrate the models into the main business application.
Freelance Data Analyst
Nespresso
- Worked on the blueprint design and implementation of a new retail information system with responsibilities on customer service and supply chain flows.
- Developed web services for eCommerce integration with a supply chain covering stock, availability, and sales documents.
- Implemented an outlier detection system for alerts on customer service data entry.
- Developed an extensive set of automated unit tests for end-to-end supply chain flows.
- Analyzed data log for bottleneck identification and performance improvement.
Freelance Data Analyst and Developer
Syncronic
- Worked as a core developer of a new SaaS platform for sales forecasting.
- Created the general architecture design, forecasting model, back-end development, REST API, and integration with the ERP system.
- Developed a forecasting model by combining ARIMA and ETS with a machine-learning gradient-boosting model.
Data Analyst (Contract)
Novozymes
- Worked on the business blueprint and implementation of a new SAP supply chain solution.
- Designed and developed new production planning heuristics.
- Developed automated end-to-end tests for the business processes.
Web Developer (Contract)
Unilever
- Enhanced the Python back-end of Digpedia.net—a multimedia marketing internal web application.
- Developed, in Django, a system for multimedia management with fine-grained access control.
- Implemented in the Django application a system for temporary access to resources.
Data Analyst | Developer (Contract)
Tetra Pak
- Designed and developed a block-planning extension to the production planning software.
- Collaborated on the design of the new available-to-promise solution and product-allocation solution.
- Designed and implemented enhancements to the SAP CIF interface to support a special VMI scenario.
- Supported and worked on solution enhancements for rollouts in multiple plants.
Data Analyst (Contract)
Sony Mobile
- Implemented a prototype for the available-to-promise system using rules-based functionality for the project proof of concept.
- Designed and implemented an SAP global available-to-promise system.
- Designed and built the technical specifications for enhancements in the APO interface, backorder processing, and special reports.
Data Analyst (Contract)
Nestlé
- Worked on the data analysis for performance improvement and issue resolution.
- Designed and implemented an algorithm to support products with shelf-life constraints in sales stock allocation.
- Enhanced the available-to-promise process in the supply chain management system.
- Designed and developed a simplified user interface for quotas and allocation maintenance. Used dynamic source code generation.
SCM Consultant
SAP
- Acted as a member of the MaxAttention team and was sent on a mission to Sappi paper company in South Africa to identify and propose solutions for the critical issues in the SAP system.
- Designed the detailed plan for implementation of Available to Promise in Philips Consumer Electronics.
- Analyzed and proposed a solution for a timezones issue in ATP scheduling in the SAP system of Clariant.
Data Analyst (Contract)
Philips Consumer Electronics
- Collaborated on the business blueprint design for supply chain quota allocation as a technical expert.
- Implemented a prototype with the algorithm identified in the blueprint.
- Worked on the project SWOT analysis.
Data Analyst (Contract)
Johnson & Johnson Pharmaceutical
- Conducted a data analysis in order to troubleshoot system issues and implement improvements.
- Designed and developed enhancements on the sales forecasting and a demand-planning system.
- Implemented enhancements to the production-planning detailed scheduling system.
Data Analyst | Developer
Sonae Industria
- Worked on a new greenfield SAP implementation with a focus on the supply chain system.
- Analyzed data in order to achieve performance and stability improvement.
- Developed a real-time asynchronous interface for a SAP integration with shop-floor control.
- Implemented a web-based performance-monitoring dashboard.
Experience
First Prize Winner of Microsoft Research TextWorld AI Competition
https://www.microsoft.com/en-us/research/blog/first-textworld-problems-the-competition-using-text-based-games-to-advance-capabilities-of-ai-agents/Kaggle Master
https://www.kaggle.com/pvlimaSales Forecasting Platform
Production Planning Heuristic
Outlier Detection System
Game with Generative AI
https://museumof.ai/collection/#bot-poets-14675US Patent | US11847422B2
Education
PhD Degree in Machine learning applied to process engineering
Coimbra University - Coimbra, Portugal
Master's Degree in Chemical Engineering
Coimbra University - Coimbra, Portugal
Skills
Libraries/APIs
PyTorch, XGBoost, Scikit-learn, Pandas, Matplotlib, Beautiful Soup, TensorFlow, Keras, SciPy, NumPy, Gradio, SpaCy, jQuery, React, HTMX
Tools
Rasa.ai, Apache Airflow, Jupyter, GitHub, Git, ChatGPT, Haystack, Sublime Text, GitLab CI/CD, Tableau, PyCharm, IBM Watson, Pytest, Yahoo! Finance
Languages
Python, SQL, ABAP, Java, R, JavaScript, TypeScript, HTML, CSS
Frameworks
LightGBM, Django, Scrapy, Flask
Paradigms
ETL, Continuous Integration (CI), Continuous Deployment, Test-driven Development (TDD), REST, Scrum
Platforms
Docker, Google App Engine, MacOS, Linux, Google Cloud Platform (GCP), Kubernetes, Oracle, Apache Kafka, AWS Lambda, Amazon Web Services (AWS), Vercel, Azure, Blockchain
Storage
Data Pipelines, PostgreSQL, MySQL, ClickHouse, Elasticsearch, PostGIS, MongoDB, Data Lakes, Google Cloud
Other
Data Analytics, Chatbots, Artificial Intelligence (AI), Data Analysis, Statistical Data Analysis, Natural Language Processing (NLP), Data Engineering, Data Science, Machine Learning, SAP, Natural Language Understanding (NLU), Generative Pre-trained Transformer 3 (GPT-3), Generative Pre-trained Transformers (GPT), Text Generation, FastAPI, Language Models, Machine Learning Operations (MLOps), Large Language Models (LLMs), Analytics, Data Scientist, Predictive Analytics, Data Modeling, Generative Artificial Intelligence (GenAI), Clustering, Image Processing, Serverless, Recommendation Systems, lxml, Web Scraping, Deep Learning, Scientific Computing, Computational Linguistics, OpenAI, LangChain, Technical Leadership, Software Architecture, Full-stack, Llama 2, OpenAI GPT-4 API, Supervised Learning, Unsupervised Learning, Retrieval-augmented Generation (RAG), Supervised Machine Learning, Data Mining, Web Development, Fraud Prevention, Software as a Service (SaaS), Algorithms, Statistics, A/B Testing, Statistical Modeling, Outlier Detection, Cloud, Hierarchical Clustering, DBSCAN, Forecasting, Time Series, Big Data, Maps, Optical Character Recognition (OCR), Data Warehouse Design, Google BigQuery, Computer Vision, Games, Optimization, AIOps, Supabase, Product Management, Web3, Architecture, Data Visualization, Dashboards, Finance, Model Tuning, Search, Fine-tuning
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring