Lucas Sinclair, Developer in Geneva, Switzerland
Lucas is available for hire
Hire Lucas

Lucas Sinclair

Verified Expert  in Engineering

Data Visualization Developer

Location
Geneva, Switzerland
Toptal Member Since
August 14, 2018

Lucas has a master's degree from the Swiss Institute of Technology (EPFL) and a Ph.D. in bioinformatics from Uppsala University. He specializes in data analysis, mining, and visualization as well as GUIs and teaching; he often deals with biologically-related data but not always. He can automate things and create custom data processing pipelines with his eight years of experience in the field. Give Lucas any kind of IT problem and he'll solve it.

Availability

Part-time

Preferred Environment

LaTeX, MacOS, Command-line Interface (CLI), Git, PyCharm, Sublime Text

The most amazing...

...and longest-running project I’ve built was an app that would automate the analysis of high-throughput DNA sequencing data from ecological samples.

Work Experience

Application Architect

2019 - PRESENT
The European Commission's Science Hub (JRC@Ispra)
  • Designed and developed an automatic data processing pipeline to simulate the evolution and growth of all forest areas in 26 European countries from 2015 up to 2050.
  • Predicted the state of the forested areas in the coming decades based on the current forest inventory (surface, species, age), as well as projections of demand for wood products from the market.
  • Developed predictions according to several scenarios that are aimed at helping decision-makers in Brussels to establish policies.
Technologies: React, JavaScript, Git, Amazon EC2, R, Python

Freelance Consultant in Data Science, Biology, Education, and IT

2016 - 2018
Sinclair.Bio
  • Consulted on various projects for different partners across Europe.
  • Developed online platforms.
  • Built custom pipelines.
  • Conducted ecological analyses of microbial communities.
  • Worked on other projects that are under NDAs.
Technologies: System Administration, Azure Machine Learning, Flask, Django, LaTeX, Git, C, Python

Post-doctoral Fellow

2017 - 2017
University of Geneva
  • Enrolled in the laboratory of Molecular Systematics & Environmental Genomics.
  • Performed ecological studies on microbial communities using high-throughput DNA sequencing.
  • Wrote scientific articles for publication.
  • Collaborated with over-seas scientific partners.
Technologies: PyCharm, Microsoft Excel, Git, LaTeX, Python

Lead Teacher for Graduate Python Course

2016 - 2016
Uppsala University
  • Designed and obtained funding for a Python course aimed at data analysis in STEM fields.
  • Taught up to 20 PhDs students at a time.
  • Taught theory, provided exercises, and graded personal projects during the course.
Technologies: Big Data, Git, Python

Scientific Assistant in the Bioinformatics Core Facility

2010 - 2012
École Polytechnique Fédérale de Lausanne
  • Collaborated as a consultant with different laboratories within the university on research projects.
  • With a team helped to develop and use novel tools for processing and analyzing genomic high-throughput sequencing data catering to their scientific questions.
  • Developed longer lasting software and online platforms designed to be used by our scientific partners in research. One major project was an interactive genome browser.
Technologies: User Experience (UX), Subversion (SVN), Git, Ruby, JavaScript, Python

Linking Sequences to Environments Through Text Mining

https://peerj.com/articles/2690/
A first-author publication in which we analyzed high-throughput sequencing data in novel ways, in particular with text mining.

We performed similarity searches of short DNA sequences against publicly available nucleotide database and proceeded to apply text-mining on the associated metadata of every hit.

Classifying the resulting vocabulary in a hierarchical-ontology method, enabled us to better contextualize the results of microbial ecology experiments through comparative statistics and quantitative descriptions.

Python Library for Easy IO with Genomic Tracks

https://github.com/xapple/track
One of the routine operations in medical bioinformatics is to read and write results from "genomic track" files. These files come in a wide variety of formats but all represent either a series of intervals along a one-dimensional genome or a discrete signal varying along the genome.

This library enables easy IO operations in Python in a fashion that is independent of the underlying format.

Python Course for PhDs in STEM Fields

https://sinclair.bio/pdf/python_syllabus.pdf
In this project, I designed, obtained funding for, and led a Python course aimed at teaching data analysis to graduate students at the University of Uppsala.

The course fit 20 PhDs students at a time (from all the STEM fields) and included theory, exercises, and a personal project.

It focused on writing programs to process, filter, clean, analyze, and visualize scientific data as well as the ability to automate the computational tasks that are used in day-to-day research.

Development of an Online Platform for Data Distribution

In this project, I planned, developed, and deployed a Python web server acting as a public data repository. The results contained in the repository concern microbial community analyses from industrial digesters producing biogas.

Code Sample | Typical Interview Question 1

To avoid uploading 2,000 line-long projects in this section, I opted to include a simple solution to a problem that I got recently as an interview question.

The technical product owner asked me to write a function that received two inputs, a triangle, and a point, in a 2D space. The function was to answer yes or no if the point was within the triangle.

Code Sample | Typical Interview Question 2

To avoid uploading 2,000 line-long projects in this section, I included a simple solution to a typical interview question. These are good for quickly assessing a candidate's coding skills.

This problem is titled the "Inverse Fizz Buzz" and requires the candidate to write a function that finds the shortest sequence of consecutive numbers, which produce the required output when run through the fizz_buzz algorithm.

Pipeline for Automation of Microbial Community Analyses

https://sinclair.bio/pdf/cluster_report.pdf
In this project, I designed and developed a Python pipeline that automates the cleaning, processing, analysis, and visualization of the data obtained from sequencing a series of microbial communities.

The program takes as input N files containing the raw output of the DNA sequencer and outputs a PDF report (example shown below) summarizing the salient characteristics of each sample as well as biodiversity indicators and a comprehensive comparison.

Publication: Bacterial Diversity Along a 2,600-kilometer River

In this publication, I was tasked with performing the required bioinformatics and biostatistics on the ~100 samples sequenced as part of a comprehensive study of the Danube's microbial communities.

My tasks also included producing the various figures that the article needed.

Languages

Bash, Python, LiveCode, R, AppleScript, Ruby, SQL, JavaScript, C, Java

Libraries/APIs

Pandas, NumPy, React, Scikit-learn, Node.js, HDF5

Tools

PyCharm, LaTeX, Biopython, Adobe Illustrator, Sublime Text, Git, Subversion (SVN), Azure Machine Learning, Microsoft Excel, MATLAB, Adobe Photoshop

Platforms

Linux, MacOS, Heroku, Amazon EC2, Windows, Docker

Industry Expertise

Bioinformatics

Other

University Teaching, Data Visualization, Test Case Development, Molecular Biology, Genetics, User Interface (UI), System Administration, Statistical Analysis, Command-line Interface (CLI), Big Data, User Experience (UX), Data Mining, Image Processing, Ecology, Machine Learning, Dash, Functions, Algorithms

Paradigms

Pipeline Programming, Agile Software Development, Functional Programming

Storage

SQLite, PostgreSQL, MongoDB

Frameworks

Django, Flask

2012 - 2016

Doctor of Philosophy (PhD) Degree in Bioinformatics, Genetics, and Aquatic Microbial Ecology

Uppsala University - Uppsala, Sweden

2008 - 2010

Master of Science Degree in Life Sciences and Technologies

École Polytechnique Fédérale de Lausanne - Lausanne, Switzerland

2005 - 2008

Bachelor of Science Degree in Life Sciences and Technologies

École Polytechnique Fédérale de Lausanne - Lausanne, Switzerland

2002 - 2005

Swiss High School Diploma in Specialization: Physics and Applied Mathematics

Collège de Saussure - Geneva, Switzerland

JANUARY 2016 - PRESENT

Academic Teacher Training

Uppsala University

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring