Siddharth Deshpande
Verified Expert in Engineering
Data Scientist and Developer
Cambridge, United Kingdom
Toptal member since June 27, 2022
Siddharth is an interdisciplinary researcher with unique perspectives derived from translational projects and his combined educational background in materials engineering, biochemistry, healthcare, natural language processing (NLP), and data science. He has extensive experience working with biological structured and unstructured data and using state-of-art AI techniques to solve complex healthcare problems.
Portfolio
Experience
Availability
Preferred Environment
Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Biomedical Skills, Machine Learning, Language Models, Unstructured Data Analysis, Data Visualization, Artificial Intelligence (AI), Biochemistry, Amazon Web Services (AWS), Python
The most amazing...
...thing I've developed is an NLP framework that extracts and visualizes biomedical entities from documents as network graphs to discover new biomedical relations.
Work Experience
Chief Technological Officer (Interim)
Immersely
- Worked for Immersely, which was building a platform to unlock the ability for game developers to create hyper-personalized games that adapt in real time to player emotion, boosting engagement to create better, more commercially successful games.
- Took charge of developing ML models that use physiological signals to detect the emotions of a person while he is gaming to develop an interactive gaming experience.
- Tasked with developing a technical roadmap and back-end tech infrastructure for the company.
Deep Tech Venture Builder
Post Urban Ventures
- Validated technological feasibility of new startup ideas before funding, built technical prototypes (MVP) for pre-seed and seed round investor pitches, and supported early-stage startups with essential technical infrastructure.
- Worked as an interim CTO of four startups and as a technical advisor for two startups within Post Urban Ventures.
- Contributed to securing a £5 million grant in funding for startups successfully.
- Involved in preparing technical pitch decks, offered expert advice and guidance, and helped promote startup success. Designed technical roadmaps for scaling startups after pre-seed and seed rounds.
Senior AI/ML and NLP Chatbot Developer
Richmond Ayirebide
- Developed an accountant chatbot based on the client's requirements using ChatGPT, finetuned GPT-3, and Telegram.
- Streamlined the preprocessing and postprocessing to format results into easy-to-view Excel sheets for the client.
- Helped set up a plan for the future deployment of the chatbot into the cloud infrastructure.
Chief Technological Officer (Interim)
Bioleap
- Brought on board to develop the technical framework for Bioleap, a startup focused on developing AI-based single-cell models.
- Managed the building of cloud capabilities in AWS, hired a competent technical team, and improved the current mechanistic models.
- Established several strategic technological partnerships with leading bio-modeling labs. Built a cloud-based automation strategy for Bioleap models.
- Established a technology strategy (tech stack), technical roadmap, and business plan to support the growth strategy.
NLP Data Scientist
Evaluate Ltd
- Developed a press-release classifier that categorizes news articles into 40 technology classes, saving the company around 30,000 pounds per year in third-party API licenses.
- Identified digital health innovations from clinical trials, news articles, and deal documents for a custom analytics project that reduced workforce hours of manual document classification for a Japanese client.
- Created a core NLP framework to extract biomedical entities from unstructured texts and visualize them as a graphical network; the framework became popular for discovering new biomedical relations and was subsequently used in many Evaluate products.
Data Scientist
Patsnap
- Developed PatSnap Bio, a core product that is one of the largest sequence searching platforms and is actively being used by large pharmaceutical companies.
- Created PatSnap Materials, another core product under Beta testing in China.
- Engaged actively in the product development and client feedback process for PatSnap Bio and PatSnap Materials.
- Filed five patent applications involving my technology.
Experience
COVID-19 Scientific Journals Analysis
https://github.com/siddharth0112358/coronavirus_19Research papers available on GitHub:
• AutoDetect_COVID_FakeNews - Classification model for detecting Fake news regarding COVID
• BERT_semantic_search - Semantic search which finds similar sentences in COVID corpus in response to query question
• Biorelated_sentence_extraction_COVID - extract bio-related sentences from COVID corpus
• COVID_19_topic_modelling_Top2Vec - Topic modelling on COVID_19 corpus using Top2Vec
• COVID_explore_drugs - Explore drugs in the COVID corpus
• CoVID19_Ques_and_Ans - Covid papers Questions and Answering system based on doc2vec
• CoVID_19_NER_text_summarization_and_topic_modelling - BART summarization and LDA topic modelling and NER
• Covid_19_genome_analysis - COVID_19 genome analysis
• Covid_paper_rank_display - NER and covid papers recovery based on topic
• Medical_NER_Corona - NER on coronavirus dataset
• Mining_COVID_keywords - mining keywords using bigrams and trigrams
Alibaba Cloud Global AI Innovation Challenge
The goal of my project was to analyze the effect of weather on energy generation and demand and find a solution that can predict renewable energy generation and energy demand using weather parameters.
SOLUTION HIGHLIGHTS
• Solar, wind, and hydro energy generation prediction using climate and time parameters.
• Energy demand prediction was done using time and energy parameters (Model 1) and time, energy, and climate parameters (Model 2). Model 2 showed slightly higher accuracy than Model 1. It shows that climate parameters do not affect energy demand as significantly as energy parameters.
• Energy price prediction was done using time and energy parameters (Model 1) and time, energy, and climate parameters (Model 2). Model 2 showed higher accuracy than Model 1. It shows that climate parameters affect energy prices significantly.
For all the above cases, 10 million regression algorithms were tested. The ExtraTreeRegressor algorithm showed the best performance and was used to build the regression model.
URL: https://www.alibabacloud.com/blog/project-showcase-%7C-effect-of-weather-on-energy-generation-and-demand_598252
Conversational Chatbots
• Conversation helper - This bot helps to simulate tough conversations so that the clients can practice the conversations beforehand. The client is scored on 2-3 conversation skills, and a report is generated at the end that shows his score and how to improve his conversation ability.
• Fashion assistant - This bot recommends fashion items based on client needs and stock inventory of the business. It uses a combination of GPT-3 and DALL-E.
• Google bot - This bot has a Google search engine capability and acts as an advisor/friend to whom you can ask any questions, and it will run a Google search in the back end to provide you with the most updated answers.
Bot previews can be shown during interviews.
Education
Doctorate in Medicine
National University of Singapore - Singapore
Master's Degree in Materials Science and Engineering
National University of Singapore - Singapore
Bachelor's Degree in Metallurgy and Material Science
College of Engineering Pune - Pune, India
Certifications
Healthcare NLP for Data Scientists
John Snow Labs
Spark NLP for Data Scientists
John Snow Labs
TensorFlow: Advanced Techniques Specialization
DeepLearning.AI | via Coursera
Deep Learning for Healthcare Specialization
University of Illinois at Urbana-Champaign | via Coursera
Customizing Your Models with TensorFlow 2
Imperial College London | via Coursera
Generative Adversarial Networks (GANs) Specialization
DeepLearning.AI | via Coursera
Deployment of Machine Learning Models
Udemy
Natural Language Processing in Python
DataCamp
Natural Language Processing Specialization
DeepLearning.AI | via Coursera
AI in Healthcare Specialization
Stanford University | via Coursera
Deep Learning Specialization
DeepLearning.AI | via Coursera
Skills
Libraries/APIs
Spark NLP, TensorFlow, PySpark, Spark ML
Tools
Microsoft Excel, SOLIDWORKS
Languages
Python, Python 3
Industry Expertise
Bioinformatics, Healthcare
Storage
JSON
Platforms
Amazon Web Services (AWS)
Other
Natural Language Processing (NLP), Machine Learning, Data Visualization, Biochemistry, Analytics, Biology, Pharmacology, R&D, Engineering, CSV File Processing, Excel Expert, Interactive Charts, Chatbots, Patents, Generative Pre-trained Transformers (GPT), Biomedical Skills, Language Models, Unstructured Data Analysis, Artificial Intelligence (AI), Biomaterial, Composite Materials, Data Science, Deep Learning, Dash, Deep Neural Networks, Convolutional Neural Networks (CNN), Sequence Models, Entrepreneurship, Web Scraping, Time Series Analysis, Computational Biology, Game AI, Emotion Recognition, Chatbot Conversation Design, LangChain, Weaviate, Pinecone, Cell Biology, Materials Science, 3D Printing, Product Development, Model Deployment, Generative Adversarial Networks (GANs), Single-cell Modeling, CTO, Pitch Preparation, Medical Diagnostics, OpenAI, Generative Pre-trained Transformer 3 (GPT-3), Google Custom Search
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring