Founding Engineer
2022 - PRESENTIntramove.ai- Engineered core microservices to process payments, register users, keep track of API credits, and perform text analysis.
- Created a PyPI API client for developers to interact with the server.
- Built a website to demonstrate Intramove.ai, an API product for developers interested in analyzing economic news. The core service is powered by SOTA deep learning models for natural language and has endpoints to analyze headlines and articles.
Technologies: Amazon EC2, Docker, PyTorch, PyPI, Stripe API, MongoDB, FastAPI, Python, GitHub, Amazon Route 53Senior Machine Learning Engineer
2022 - PRESENTRepustate- Developed interpretable, supervised deep learning solutions for text using PyTorch and for implementing and improving research paper algorithms, such as BERT, custom attention layers, and hierarchical attention networks.
- Created interpretable, unsupervised deep learning algorithms that eliminated Repustate's need for tagged data when building custom models, enabling scalable growth and taking the company's core service to the next level.
- Designed and developed a new generation gRPC microservices API, allowing the main application developed in Go to communicate with Python deep learning servers.
- Increased inference speed by 2-3x on average for prediction tasks using different techniques, such as ONNX quantization or serving and chunks batching.
- Reduced the turnaround time for onboarding clients with custom needs from an average of 4 weeks to 1-3 days using the new generation API and ML pipeline.
- Built an in-house multilingual transcription service that replaced Amazon Transcribe, reducing annual expenses by 14x.
- Managed a multi-client portfolio, led technical discussions on sales calls, and aided in landing 10+ clients using the new generation API.
- Devised and developed Amazon S3 schemas for production models and tokenizers.
- Delivered custom deep-learning Docker images and RPM packages to be installed in on-premise servers.
- Architected and developed a custom MLflow tracking server to record and monitor experimentation results and artifacts.
Technologies: Artificial Intelligence (AI), Amazon S3 (AWS S3), Algorithms, Deep Learning, Open Neural Network Exchange (ONNX), Hugging Face, gRPC, Protobuf, Git, Go, Python, PyTorch, Keras, MLflow, Amazon EC2, Docker, NumPy, Scikit-learn, PandasData Scientist
2020 - PRESENTDecathlon- Developed an in-house data-visualization pipeline that replaced a licensed tool saving $60,000 per year. Used SQL, Git, Jenkins, AWS cloud, Google Sheets API, and Google Data Studio.
- Prototyped an NLU solution with customer reviews classification, keyword extraction, and sentiment analysis that outperformed a licensed tool, saving the marketing team $15,000 per year.
- Created a visual search engine that was deployed as a product retrieval API. It's currently being used for product recommendations.
- Built an unsupervised topic modeling solution for customer reviews with visualization, using sentence transformers. Improved original solution using GPT2 and prompt engineering.
- Developed a store turnover forecasting tool using additive models and custom-made regressors (Prophet API).
- Engineered an NLU product-article recommendation solution as part of Decathlon's personalization strategy.
- Worked on data extraction, transformation, and loading tasks for each solution.
- Made a sustainability reporting tool to monitor the performance of second-life and eco-designed products.
- Built a color detection solution using k-means clustering to aid internal object detection models.
- Interviewed new data science candidates, actively contributed to the hiring process, and mentored new interns on various data-related tasks.
Technologies: Python, SQL, Jenkins, Keras, TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy, Flask, Google Data Studio, Redshift, Amazon S3 (AWS S3), Google Cloud Platform (GCP), Jenkins Pipeline, GitHubMachine Learning Intern
2019 - 2019Decathlon- Developed and deployed a deep recommendation model for user-click item prediction using LSTM RNN architecture.
- Surpassed the benchmarked precision, recall, and coverage metrics by improving the solution using attention models.
- Developed and deployed an object detection model using TensorFlow's API for a hockey-brand detection application.
Technologies: Python, Algorithms, Keras, SQL, TensorFlow, Jenkins Pipeline, GitHubMachine Learning Developer Intern
2018 - 2018Societe Generale- Developed a BI reporting tool using MicroStrategy.
- Contributed to data visualization projects using Tableau.
- Helped develop a web application using the Django framework.
Technologies: Tableau, Python, Django, MicroStrategy