
Syed Mohsin Ali
Verified Expert in Engineering
AI Developer
Lahore, Punjab, Pakistan
Toptal member since April 8, 2025
Mohsin is a skilled generative AI and machine learning professional with a proven track record of driving innovation through cutting-edge NLP, GenAI, and ML solutions. With over five years of experience fine-tuning large language models and deploying scalable models, he empowers businesses to unlock new possibilities and stay ahead of the curve. Mohsin excels in cross-functional collaboration, working with teams to implement AI-driven solutions that foster technological advancement.
Portfolio
Experience
- Python - 6 years
- Natural Language Processing (NLP) - 6 years
- BERT - 4 years
- Feature Engineering - 4 years
- Retrieval-augmented Generation (RAG) - 3 years
- Prompt Engineering - 3 years
- Fine-tuning - 3 years
- Large Language Models (LLMs) - 3 years
Availability
Preferred Environment
Visual Studio Code (VS Code), Jupyter, PyCharm
The most amazing...
...achievement has been leading the development of an AI-driven BEC detection system, achieving 99% precision using BERT, RAG, and GenAI.
Work Experience
Senior Machine Learning Engineer
SlashNext
- Spearheaded the design and implementation of advanced email security algorithms leveraging BERT and large language models (LLMs), reaching 99.2% precision in identifying business email compromise (BEC), phishing, and social engineering attacks.
- Built machine learning and statistical models for NLP tasks, including sentiment analysis, named entity recognition (NER), text classification, and feature extraction, and utilized LLMs to enhance training data through augmentation and generation.
- Collaborated closely with DevOps, QA, front-end, and UI/UX teams to deliver the BEC solution as a client-facing product, driving business growth and showcasing expertise in addressing domain-specific challenges.
- Drove industry-wide recognition through strategic adoption of the BEC solution by leading multinational corporations, including NVIDIA, P&G, Kingston, and Forbes; implemented a robust Python-based production pipeline to support deployment at scale.
Experience
RAG-based System for Advanced Malicious Email Analysis
Each email is indexed with its sender address and metadata related to the recipient mailbox, allowing context-rich retrieval. When our custom classifier flags a new malicious email, the system performs a vector-based search to retrieve semantically similar historical samples.
An LLM is then prompted with these retrieved samples to generate a comprehensive analytical summary. The response provides insights such as the recurrence of similar scams, behavioral patterns of the sender (e.g., repeated use of the same email address, tone, or scam structure), and intent detection. This enables the identification of scams that follow a similar template with only minor lexical changes, offering deeper threat intelligence and helping to preempt future attacks.
AI-driven Credential Theft Detection
To ensure high precision, I retrained a BERT model using fine-tuning techniques such as parameter-efficient fine-tuning (PEFT) and low-rank adaptation (LoRA). These approaches enabled efficient model adaptation on resource-constrained hardware without sacrificing performance.
I also implemented custom data augmentation strategies and conducted iterative A/B testing to maximize detection accuracy. The resulting classifier significantly strengthened the email security pipeline, enhancing defenses against credential theft and phishing-based attacks.
Text Normalization and Data Standardization Pipeline
To address specific challenges, I developed customized algorithms to identify and normalize complex data patterns accurately. These included handling special cases, such as copyright symbols, which are often difficult to standardize. By incorporating these tailored solutions, I maintained the integrity of the data across different formats and sources.
This initiative significantly optimized data processing workflows, enhancing data quality and more efficient analysis. It streamlined data handling and facilitated faster generation of actionable insights, driving improvements in decision-making processes.
Education
Master's Degree in Computer Science
University of Management and Technology - Lahore, Pakistan
Bachelor's Degree in Electrical Engineering
University of Engineering and Technology - Lahore, Pakistan
Skills
Libraries/APIs
PyTorch, TensorFlow, LSTM, Natural Language Toolkit (NLTK), SpaCy
Tools
Jupyter, PyCharm
Languages
Python, C++, Verilog, Regex
Platforms
Visual Studio Code (VS Code)
Other
Large Language Models (LLMs), Generative Artificial Intelligence (GenAI), Machine Language, Deep Learning, Retrieval-augmented Generation (RAG), LangChain, Prompt Engineering, Fine-tuning, Natural Language Processing (NLP), Feature Engineering, Neural Networks, Recurrent Neural Networks (RNNs), BERT, LoRa, Data Augmentation, A/B Testing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring