
Laurence Cullen
Verified Expert in Engineering
Machine Learning Developer
Cambridge, United Kingdom
Toptal member since April 10, 2019
Laurence is a machine learning engineer and founder of venture-backed startup Vanellus, developing accelerated physics simulations. Laurence combines extensive experience executing and deploying R&D heavy machine learning projects with six years of industrial software engineering. His machine learning experience ranges from LLMs and computer vision to audio synthesis and cutting-edge approaches to accelerate numerical programming problems.
Portfolio
Experience
- Python - 10 years
- Docker - 5 years
- Machine Learning - 5 years
- JAX - 3 years
- FastAPI - 3 years
- Go - 2 years
- Rust - 2 years
- Venture Capital - 1 year
Preferred Environment
Machine Learning, Python, Go, TensorFlow, Keras, Artificial Intelligence (AI), Natural Language Processing (NLP), Computer Vision, JAX, Rust
The most amazing...
...models I've trained run physics simulations faster, help victims of abuse in the Indian Armed Forces, and translate ancient Sumerian into English.
Work Experience
Founder
Vanellus
- Developed a CAE electronics cooling simulation application and physics solver from scratch. Featuring 3D graphics, loading PCB and CAD files, mesh generation, and solving using an internally developed GPU-accelerated CFD engine.
- Hired and trained academic mathematicians into industry-grade Python and Rust software engineers.
- Raised £700.000 of venture capital funding to develop my startup.
- Won a £200,000 grant from the UK R&D funding agency to develop an automatically differentiable fluid simulation engine.
Machine Learning Engineer
Self-employed
- Fine-tuned cutting-edge LLM on a corpus of legal documents to develop a classification model for legal documents for a successful legal tech company.
- Developed a neural singing system implementing the paper Arxiv.org/abs/1704.03809 in Keras.
- Created a structured chatbot to help users automatically navigate questions about NDAs.
- Refactored the UI of weather tech startup to create a better looking and more maintainable codebase.
Machine Learning Engineer
Sensity
- Assembled from scratch a video scraping (Python and youtube-dl), labeling (Vue), and dataset building pipeline for deep fake videos found in the wild.
- Trained CNN models in PyTorch to accurately identify deep fake videos.
- Built the company's MVP comprising a React web app deployed with Firebase and dependent on several async detector services able to identify a variety of media manipulations.
- Productionized the deep fake detection model combining a preprocessing pipeline coordinated with a Pub/Sub architecture and deployed on GCP.
- Built a customer-exposed API in Go using the Go kit toolkit to allow external customers to interface with our detector systems.
- Contributed to the majority of features and infrastructure, leading to the company's successful seed round.
- Developed a named entity recognition pipeline using spaCy to extract the names of people mentioned in video titles.
Data Analyst and Engineer
Owlstone Medical
- Reduced the run time of test suites from 30 minutes to 5 minutes.
- Developed statistical and machine learning methods for detecting and classifying chemical weapons using Keras and Python.
- Audited core Python libraries' performance and drove large-scale improvements using high-performance libraries like Numba.
- Performed data exploration and analysis of the discriminatory power of company spectrometers in telling chemicals apart.
Software Engineer
Fetch.ai
- Integrated LiDAR and high-precision GPS sensors onto a drone flight platform using Arduino and Python.
- Built a simulation environment in Python to test drone navigation strategies.
- Processed imagery collected during drone surveys of agricultural land to provide actionable insights for farmers.
- Prototyped novel cryptocurrency architectures and tested performance under variable network conditions.
Experience
Sumerian to English Translation System
https://github.com/Laurence-Cullen/cuneiformThis involved creating a tokenizing model from the text using Google's SentencePiece tool and building word and word fragment embeddings from the entire corpus to use as a pre-trained embedding layer for the translation model with Facebook's FastText encoding tool.
Finally, I built a sequence-to-sequence model using an encoder-decoder LSTM neural network in Keras and trained it on the fraction of the Sumerian corpus for which translations exist. Some of the better experiments have achieved BLEU scores of 12.8, giving pretty sensible results for a good fraction of untranslated sentences.
Medical Ontology Alignment
https://github.com/Laurence-Cullen/ontology_alignmentI used the recently released Google sentence embedding model BERT to build embeddings of SNOMED and HPO terms and matched them based on cosine similarity. The results were surprisingly effective, and in most cases, the system could correctly translate a term from one ontology to another.
Bellingcat Tech Fellowship
https://github.com/Laurence-Cullen/bellingcat-hackathonI developed a tool to make the army tribunal documents published by the Indian government searchable with keyword search, semantic search, and reverse document search. This was achieved by combining web scraping of tribunal documents with the embedding of these documents using OpenAI API calls and a vector database to calculate distances between document embeddings. I deployed it into a web application with Google Cloud Run as a FastAPI server wrapped in a Docker container.
Education
Bachelor of Science Degree in Astrophysics
University of Exeter - Exeter, UK
Certifications
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
Coursera
Structuring Machine Learning Projects
Coursera
Neural Networks and Deep Learning
Coursera
Machine Learning
Coursera
Skills
Libraries/APIs
OpenCV, JAX, REST APIs, Keras, TensorFlow, Vue, Beautiful Soup
Tools
ChatGPT
Languages
Python, JavaScript, Go, Rust, C
Platforms
Unix, Jupyter Notebook, Linux, Docker, Google Cloud Platform (GCP), Amazon Web Services (AWS)
Other
Artificial Intelligence (AI), Machine Learning, Data Science, Cloud Services, Large Language Models (LLMs), Convolutional Neural Networks (CNNs), Natural Language Processing (NLP), Deep Learning, Generative Pre-trained Transformers (GPT), Computational Fluid Dynamics (CFD), Software Architecture, APIs, WebAssembly (Wasm), Containers, Physics, Neural Networks, LSTM Networks, OpenAI GPT-4 API, Computer Vision, Numerical Analysis, Venture Capital, FastAPI, Technical Leadership, Chatbots
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring