Abhijeet A Mulgund
Verified Expert in Engineering
Machine Learning Developer
Houston, TX, United States
Toptal member since February 1, 2019
Abhijeet is a data scientist and engineer with 3 years of experience working for companies of all sizes, from Google to startups. He specializes in machine learning and deep learning for natural language processing (NLP) and computer vision (CV). In addition, he is familiar with many data processing libraries. Abhijeet takes pride in his clean and maintainable Python code but has rapidly picked up languages including C++, Java, and JavaScript.
Portfolio
Experience
Availability
Preferred Environment
Python, Git, Linux, Visual Studio Code (VS Code), Integrated Circuits, Circuit Design, Verilog HDL
The most amazing...
...project I've built was a toxicity classifier for Wikipedia forum comments. I designed an ensemble of over 100 models, scoring 0.99 ROC AUC and 99% accuracy.
Work Experience
Software Engineering Intern
- Designed and implemented a complex algorithm using FastText and deep learning for detection of over 1,000 platforms and technologies used by over 5 million small and medium business websites.
- Processed terabytes of website data using scalable and parallelized unsupervised machine learning algorithms.
- Enabled Facebook to target previously unknown, but popular, platforms and technologies for lucrative ads partnership integrations.
- Utilized HDBSCAN clustering algorithm to accurately and efficiently cluster website source keywords that signified newly detected platforms and technologies.
- Built an internal web app in PHP/Hack for Facebook employees to visualize newly detected platforms and technologies along with the websites using these platforms and technologies.
Data Science Intern
CS Disco
- Prototyped state-of-the-art attentional deep learning architectures for legal NLP to help more than 400 law firms.
- Studied and tested deep convolutional and recurrent models on the iMDb Sentiment Classification problem.
- Deployed a Hierarchical Attention Network to classify documents in legal discovery by category and attributes.
- Researched novel data augmentation techniques for natural language data to achieve 92% accuracy (state-of-the-art) on the iMDb Sentiment Classification problem.
- Authored a paper detailing my novel embedding-driven data augmentation technique for natural language data.
Software Engineering Intern
- Developed a MapReduce pipeline to run simulations with Google’s Dynamic Search Ads product in C++ over thousands of nodes.
- Enabled Google’s Dynamic Search Ads to grow and improve through simulations with projected revenue growth of millions.
- Studied TensorFlow under the developers of the library in special classes offered at Google.
- Worked in a complex codebase of nearly 2 billion lines of code without damaging any other function of Google Dynamic Search Ads.
Experience
Kaggle Toxic Comment Classification
https://github.com/abhmul/toxic-commentsPyJet
https://github.com/abhmul/PyJetDeep Learning Chess AI
https://github.com/abhmul/DeepJetChessKaggle Leaf Classification Competition
https://www.kaggle.com/abhmul/keras-convnet-lb-0-0052-w-visualizationChar-Word2Vec
https://github.com/abhmul/576FinalProjectSwitchboard Binary Neural Network Research
Education
Bachelor of Arts Degree in Mathematics
Rice University - Texas
Bachelor of Arts Degree in Computer Science
Rice University - Texas
Skills
Libraries/APIs
PyTorch, Keras, NumPy, TensorFlow, Pandas, SciPy, Scikit-learn, React
Tools
Git
Languages
Python 3, Verilog HDL, Python 2, Python, Hack, CSS, Java, C, C++, JavaScript, SQL, PHP, HTML
Platforms
Linux, Visual Studio Code (VS Code), NVIDIA CUDA
Paradigms
Functional Programming, Object-oriented Programming (OOP), MapReduce, Agile Software Development
Frameworks
Spark, Presto
Storage
Apache Hive
Other
Machine Learning, Data Science, Deep Learning, Natural Language Processing (NLP), Mathematics, Hackathons, Software Development, Integrated Circuits, Circuit Design, Generative Pre-trained Transformers (GPT), Computer Vision, Deep Reinforcement Learning, Reinforcement Learning, Number Theory, Discrete Mathematics, fastText, HHVM
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring