Laurence Cullen, Developer in Cambridge, United Kingdom
Laurence is currently unavailable

Laurence Cullen

Machine Learning Developer

Cambridge, United Kingdom

Toptal member since April 10, 2019

Bio

Laurence is a machine learning engineer and founder of venture-backed startup Vanellus, developing accelerated physics simulations. Laurence combines extensive experience executing and deploying R&D heavy machine learning projects with six years of industrial software engineering. His machine learning experience ranges from LLMs and computer vision to audio synthesis and cutting-edge approaches to accelerate numerical programming problems.

Portfolio

Vanellus
Machine Learning, Numerical Analysis, Venture Capital, Python, FastAPI...
Self-employed
Artificial Intelligence (AI), Machine Learning, Python...
Sensity
Vue, JavaScript, Python, Go, Beautiful Soup, Venture Capital, Computer Vision...

Experience

  • Python - 10 years
  • Docker - 5 years
  • Machine Learning - 5 years
  • JAX - 3 years
  • FastAPI - 3 years
  • Go - 2 years
  • Rust - 2 years
  • Venture Capital - 1 year

Preferred Environment

Machine Learning, Python, Go, TensorFlow, Keras, Artificial Intelligence (AI), Natural Language Processing (NLP), Computer Vision, JAX, Rust

The most amazing...

...models I've trained run physics simulations faster, help victims of abuse in the Indian Armed Forces, and translate ancient Sumerian into English.

Work Experience

Founder

2022 - PRESENT
Vanellus
  • Developed a CAE electronics cooling simulation application and physics solver from scratch. Featuring 3D graphics, loading PCB and CAD files, mesh generation, and solving using an internally developed GPU-accelerated CFD engine.
  • Hired and trained academic mathematicians into industry-grade Python and Rust software engineers.
  • Raised £700.000 of venture capital funding to develop my startup.
  • Won a £200,000 grant from the UK R&D funding agency to develop an automatically differentiable fluid simulation engine.
Technologies: Machine Learning, Numerical Analysis, Venture Capital, Python, FastAPI, Computational Fluid Dynamics (CFD), Google Cloud Platform (GCP), Physics, Linux, Docker, Software Architecture, Technical Leadership, Data Science, APIs, Rust, JAX, WebAssembly (Wasm), REST APIs, Containers, Cloud Services, Large Language Models (LLMs)

Machine Learning Engineer

2020 - 2023
Self-employed
  • Fine-tuned cutting-edge LLM on a corpus of legal documents to develop a classification model for legal documents for a successful legal tech company.
  • Developed a neural singing system implementing the paper Arxiv.org/abs/1704.03809 in Keras.
  • Created a structured chatbot to help users automatically navigate questions about NDAs.
  • Refactored the UI of weather tech startup to create a better looking and more maintainable codebase.
Technologies: Artificial Intelligence (AI), Machine Learning, Python, Google Cloud Platform (GCP), FastAPI, Computer Vision, Linux, OpenCV, Amazon Web Services (AWS), Generative Pre-trained Transformers (GPT), Software Architecture, Chatbots, Data Science, APIs, Unix, Convolutional Neural Networks (CNNs), LSTM Networks, REST APIs, Containers, Cloud Services, Large Language Models (LLMs), Jupyter Notebook

Machine Learning Engineer

2019 - 2020
Sensity
  • Assembled from scratch a video scraping (Python and youtube-dl), labeling (Vue), and dataset building pipeline for deep fake videos found in the wild.
  • Trained CNN models in PyTorch to accurately identify deep fake videos.
  • Built the company's MVP comprising a React web app deployed with Firebase and dependent on several async detector services able to identify a variety of media manipulations.
  • Productionized the deep fake detection model combining a preprocessing pipeline coordinated with a Pub/Sub architecture and deployed on GCP.
  • Built a customer-exposed API in Go using the Go kit toolkit to allow external customers to interface with our detector systems.
  • Contributed to the majority of features and infrastructure, leading to the company's successful seed round.
  • Developed a named entity recognition pipeline using spaCy to extract the names of people mentioned in video titles.
Technologies: Vue, JavaScript, Python, Go, Beautiful Soup, Venture Capital, Computer Vision, Linux, Docker, Google Cloud Platform (GCP), OpenCV, Machine Learning, Amazon Web Services (AWS), Software Architecture, Technical Leadership, Data Science, APIs, Convolutional Neural Networks (CNNs), REST APIs, Containers, Cloud Services, Large Language Models (LLMs), Jupyter Notebook

Data Analyst and Engineer

2018 - 2018
Owlstone Medical
  • Reduced the run time of test suites from 30 minutes to 5 minutes.
  • Developed statistical and machine learning methods for detecting and classifying chemical weapons using Keras and Python.
  • Audited core Python libraries' performance and drove large-scale improvements using high-performance libraries like Numba.
  • Performed data exploration and analysis of the discriminatory power of company spectrometers in telling chemicals apart.
Technologies: Keras, Python, Docker, Machine Learning, Amazon Web Services (AWS), Data Science, Convolutional Neural Networks (CNNs), Jupyter Notebook

Software Engineer

2016 - 2018
Fetch.ai
  • Integrated LiDAR and high-precision GPS sensors onto a drone flight platform using Arduino and Python.
  • Built a simulation environment in Python to test drone navigation strategies.
  • Processed imagery collected during drone surveys of agricultural land to provide actionable insights for farmers.
  • Prototyped novel cryptocurrency architectures and tested performance under variable network conditions.
Technologies: Python, C

Experience

Sumerian to English Translation System

https://github.com/Laurence-Cullen/cuneiform
Built an entire NLP pipeline to translate raw transliterated cuneiform script from the global archive of cuneiform writings, focusing initially on the Sumerian language.

This involved creating a tokenizing model from the text using Google's SentencePiece tool and building word and word fragment embeddings from the entire corpus to use as a pre-trained embedding layer for the translation model with Facebook's FastText encoding tool.

Finally, I built a sequence-to-sequence model using an encoder-decoder LSTM neural network in Keras and trained it on the fraction of the Sumerian corpus for which translations exist. Some of the better experiments have achieved BLEU scores of 12.8, giving pretty sensible results for a good fraction of untranslated sentences.

Medical Ontology Alignment

https://github.com/Laurence-Cullen/ontology_alignment
I undertook experiments aligning the SNOMED and HPO medical ontologies using sentence embeddings.

I used the recently released Google sentence embedding model BERT to build embeddings of SNOMED and HPO terms and matched them based on cosine similarity. The results were surprisingly effective, and in most cases, the system could correctly translate a term from one ontology to another.

Bellingcat Tech Fellowship

https://github.com/Laurence-Cullen/bellingcat-hackathon
Won a £4,000 tech fellowship with open-source research organization Bellingcat by winning their hackathon.

I developed a tool to make the army tribunal documents published by the Indian government searchable with keyword search, semantic search, and reverse document search. This was achieved by combining web scraping of tribunal documents with the embedding of these documents using OpenAI API calls and a vector database to calculate distances between document embeddings. I deployed it into a web application with Google Cloud Run as a FastAPI server wrapped in a Docker container.

Education

2012 - 2015

Bachelor of Science Degree in Astrophysics

University of Exeter - Exeter, UK

Certifications

NOVEMBER 2018 - PRESENT

Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization

Coursera

NOVEMBER 2018 - PRESENT

Structuring Machine Learning Projects

Coursera

MAY 2018 - PRESENT

Neural Networks and Deep Learning

Coursera

FEBRUARY 2018 - PRESENT

Machine Learning

Coursera

Skills

Libraries/APIs

OpenCV, JAX, REST APIs, Keras, TensorFlow, Vue, Beautiful Soup

Tools

ChatGPT

Languages

Python, JavaScript, Go, Rust, C

Platforms

Unix, Jupyter Notebook, Linux, Docker, Google Cloud Platform (GCP), Amazon Web Services (AWS)

Other

Artificial Intelligence (AI), Machine Learning, Data Science, Cloud Services, Large Language Models (LLMs), Convolutional Neural Networks (CNNs), Natural Language Processing (NLP), Deep Learning, Generative Pre-trained Transformers (GPT), Computational Fluid Dynamics (CFD), Software Architecture, APIs, WebAssembly (Wasm), Containers, Physics, Neural Networks, LSTM Networks, OpenAI GPT-4 API, Computer Vision, Numerical Analysis, Venture Capital, FastAPI, Technical Leadership, Chatbots

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring