Nilan Saha
Verified Expert in Engineering
NLP and Machine Learning Developer
Vancouver, BC, Canada
Toptal member since February 17, 2022
Nilan is a natural language processing and machine learning expert with a bachelor's degree in computer science and a master's degree in data science specializing in computational linguistics. Nilan has vast experience building large-scale recommendation systems, personalization technology, and NLP algorithms. A Kaggle expert, he has well-cited publications in this domain.
Portfolio
Experience
- Software Development - 7 years
- Python - 7 years
- Natural Language Processing (NLP) - 5 years
- Machine Learning - 5 years
- Generative Pre-trained Transformers (GPT) - 5 years
- SpaCy - 5 years
- Deep Learning - 5 years
- Data Science - 3 years
Availability
Preferred Environment
MacOS, Slack, Zoom, Visual Studio Code (VS Code), Jupyter Notebook
The most amazing...
...thing I've ever built is Convex, an NLP library for part-of-speech (POS) tagging using character and word-level embedding neural nets.
Work Experience
Machine Learning Engineer
AAQUA
- Built and deployed multiple personalization ML pipelines to power the main user feed for the app. The pipelines were built using Python, Kafka, AWS S3, scikit-learn, and NLTK and deployed to AWS.
- Designed, architected, and built an end-to-end search solution using Python and Elasticsearch. The analytics pipeline that drives data-driven decisions and better search results' relevance ranking was built using Kafka, Amazon Pinpoint, and AWS S3.
- Designed, architected, and built an end-to-end autocomplete solution using Python and Elasticsearch. The analytics pipeline that drives data-driven decisions and relevance ranking of search results was built using Kafka, Amazon Pinpoint, and AWS S3.
Technical Reviewer
Packt Publishing
- Collaborated with the team to test out all code samples and make sure it was easy for users to replicate the projects from the book. The code consisted of neural networks built in PyTorch and various other pre-processing utilities in NLTK.
- Worked with the editing team to review all the book chapters and make necessary corrections, technical and otherwise.
- Suggested various improvements in terms of the book content.
Machine Learning Engineer
Knowt
- Developed a deep neural net with multiple heads using ELMo embeddings to identify phrases that could be used to generate quizzes and achieve other downstream tasks. The model was built using Python, PyTorch, and Flair and deployed to AWS.
- Developed a pipeline using spaCy and Python to extract triplets from textual data, build relations using them, and represent it in the form of a knowledge graph.
- Led initiatives to build a dataset to train models based on implicit user feedback.
Data Scientist
Cookt
- Developed a heuristic algorithm using named-entity recognition (NER), spaCy, and the natural language toolkit (NLTK) to identify cooking ingredients from recipe instruction data.
- Created an algorithm to use identified ingredients to generate optimal cooking instructions to reduce friction for end-users.
- Bundled the model into an API and worked with the in-house tech team to integrate it into the entire stack.
Experience
Convex
https://github.com/nilansaha/convexMulti-label Classifier for Toxic Comments
Categorical Embedding Encoder
https://github.com/nilansaha/CategoricalEmbeddingEncoderEducation
Master's Degree in Data Science and Computational Linguistics
University of British Columbia - Vancouver, Canada
Bachelor's Degree in Computer Science
Institute of Engineering and Management - Kolkata, India
Skills
Libraries/APIs
SpaCy, PyTorch, Scikit-learn, Natural Language Toolkit (NLTK), Keras
Tools
Slack, Zoom
Languages
Python, Go, GraphQL
Frameworks
Flask
Platforms
Software Design Patterns, Apache Kafka, MacOS, Visual Studio Code (VS Code), Jupyter Notebook, Amazon Web Services (AWS)
Storage
Databases, Elasticsearch, Amazon DynamoDB, Neo4j
Other
Natural Language Processing (NLP), Machine Learning, Generative Pre-trained Transformers (GPT), Data Science, Software, Software Development, Computer Science, Deep Learning, Data Structures, Technical Writing, Computational Linguistics
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring