Francesco Strino, Developer in London, United Kingdom
Francesco is available for hire
Hire Francesco

Francesco Strino

Verified Expert  in Engineering

Software and Algorithm Developer

London, United Kingdom

Toptal member since December 3, 2020

Bio

Francesco is a software engineer and scientist specializing in algorithms. He has a Ph.D. in medical biochemistry and a Master's degree in bioinformatics. He has written algorithms to analyze biological, medical, and real forensic data and developed a part of the CLC Genomics Workbench, one of the most popular software for bioinformatics analysis used by tens of thousands of users. He is a fan of BDD and excels in bioinformatics, statistics, high-performance computing, and financial technology.

Portfolio

PCMGF Limited
Python, TensorFlow, Docker, Git, GitHub, Java, Groovy, Version Control...
Department of Health and Social Care, Joint Biosecurity Centre
Python, Biopython, SQL, Version Control, Data Engineering, Genomics...
PrePay Solutions
Microservices, Spring, SDET, Software QA, Hibernate, Oracle, Cucumber, Java...

Experience

Availability

Part-time

Preferred Environment

Apache Maven, Jenkins, Git, Debian Linux, IntelliJ IDEA

The most amazing...

...algorithms I wrote for QIAGEN are used to analyze biological, medical, and real forensic data.

Work Experience

Founder and CEO

2020 - PRESENT
PCMGF Limited
  • Consulted for a bioinformatics company developing a complete pipeline for single-cell analysis.
  • Developed a system for retrieving data and estimating property values for buy-to-let applications.
  • Set up project infrastructures for version control, continuous integration, and deployment.
  • Development of an assembler for viral data using ML techniques.
Technologies: Python, TensorFlow, Docker, Git, GitHub, Java, Groovy, Version Control, Data Engineering, Biopython, Genomics, Data Science

Genomics Bioinformatician

2021 - 2021
Department of Health and Social Care, Joint Biosecurity Centre
  • Monitored and analyzed SARS‑CoV‑2 lineages outside of the UK using data from GISAID.
  • Reported findings at weekly meetings and contributed to technical reports.
  • Built software to automate a part of the reporting pipeline.
Technologies: Python, Biopython, SQL, Version Control, Data Engineering, Genomics, Data Science

Software Development Engineer in Test (High Tech)

2019 - 2020
PrePay Solutions
  • Developed and improved tests to ensure high quality of deliverables.
  • Worked with payments systems, especially SEPA and MasterCard.
  • Improved the test automation framework using Java, Cucumber, Spring, Hibernate, and Selenium.
Technologies: Microservices, Spring, SDET, Software QA, Hibernate, Oracle, Cucumber, Java, Version Control, Selenium

Research and Development Scientist

2019 - 2019
BioEpic, Ltd.
  • Led the team in charge of creating machine learning and statistical models.
  • Built predictive models using preprocessed in-house biomedical data and public epidemiological data.
  • Trained and mentored junior scientists across several teams and organized book clubs.
Technologies: Machine Learning, Scikit-learn, Python, Version Control, Data Science

Lead Bioinformatics Scientist

2013 - 2019
QIAGEN
  • Developed algorithms and tools for classifying peaks in sequencing data using shape information. The tool is part of the CLC Genomics Workbench, one of the most popular software for bioinformatics analysis used by tens of thousands of users.
  • Developed algorithms and tools for the analysis of metagenomics data, including statistical and visualization tools.
  • Covered the test manager role for the CLC Microbial Genomics Module, checking the code and scientific quality of the product. Created and signed off test plans, strategies, and runs.
  • Disseminated results through conferences and publications and collaborated with academia.
  • Defined software architecture for new features and aligned with other software architects.
Technologies: Docker, Code Architecture, Test Management, Software QA, Bioinformatics, Statistics, Optimization, Machine Learning, Java, Version Control, Genomics, Computational Biology, Data Science

Webmaster and Manager of the Swedish Translation Group

2008 - 2017
Italia dall'estero (pro bono)
  • Served as the main webmaster of the site italiadallestero.info. The website translated news about Italy from the foreign press into Italian and had about 1,000 unique viewers per day and approximately 100 volunteers.
  • Developed WordPress plugins for database management.
  • Managed the Scandinavian translation group and translated articles from Swedish and occasionally Danish into Italian.
Technologies: PHP, WordPress, Version Control

Postdoctoral Associate

2010 - 2013
Yale School of Medicine
  • Built a novel algorithm for cancer subclonal deconvolution using Java.
  • Developed tools to perform analysis of next-generation sequencing data (particularly ChIP-seq, Exome-seq, FAIRE-seq, and 4C-seq) and visualize results, primarily using Java.
  • Created statistical methods for combining predictions from different algorithms.
Technologies: Statistics, Machine Learning, MATLAB, R, Java, Version Control

Bioinformatician

2004 - 2010
Biognos AB
  • Developed machine learning-based methods (genetic algorithms) for predicting oligosaccharides 3D structures in Java.
  • Optimized code and distributed computations over a small cluster using Java RMI.
  • Developed and maintained a ligand-based database for screening purposes by extending PostgreSQL types.
Technologies: Cheminformatics, Genetic Algorithms, Chemistry, Java, Version Control

QIAGEN CLC Microbial Genomics Module

The QIAGEN CLC Microbial Genomics Module module extends the capabilities of the QIAGEN CLC Genomics Workbench to support the analysis of bacterial, viral, and eukaryotic (fungal) genomes and metagenomes. It offers tools and workflows for a broad range of bioinformatics needs for microbiome analysis, isolate characterization, functional metagenomics, and resistance identification.

I developed tools for performing OTU clustering, including demultiplexing, clustering, statistical analyses (i.e., alpha/beta diversity, PERMANOVA, differential abundance), and visualization. I also developed most of the current pipeline in the Microbial Genomics Module or functional annotation of metagenomic sequences, i.e., a gene finder to annotate nucleotide sequences using a variable-order Markov model and differential functional abundance.

CLC Shape-based Peak Caller

https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1042-5
I was the main developer of the CLC shape-based peak caller, a tool used to identify narrow and broad peaks in sequencing data using shape information.

The tool implements all the signal detection steps, quality control, normalization, discovering obvious peaks, learning the peak shape, peak shape score, and peak detection in a single, easy-to-use algorithm. The algorithm delivers a QC report containing metrics about the ChIP-seq experiment's quality, a peak shape score value for every genomic position, and a list of all called peaks.

A performance evaluation showed that the CLC shape-based peak caller ranks well among popular state-of-the-art peak callers while requiring a minimum intervention and parameterization from the user.

TrAp - Tree Approach to Clonality

https://sourceforge.net/projects/klugerlab/files/TrAp/
I was the principal author of TrAp, a tool designed to infer subclonal tumor composition in human patients from sequencing data using a few biologically motivated constraints.

More information is available in the paper at doi.org/10.1093/nar/gkt641.
2005 - 2010

Ph.D. Degree in Medical Biochemistry

Gothenburg University - Göteborg, Sweden

2002 - 2004

Master's Degree in Bioinformatics

Chalmers University of Technology - Göteborg, Sweden

Libraries/APIs

Scikit-learn, TensorFlow

Tools

Cucumber, IntelliJ IDEA, Apache Maven, GitHub, Git, Jenkins, MATLAB, Biopython

Languages

Java, SQL, R, PHP, Python, Groovy

Industry Expertise

Bioinformatics

Storage

PostgreSQL

Paradigms

Distributed Computing, Microservices

Platforms

WordPress, Debian Linux, Oracle, Docker

Frameworks

Hibernate, Spring, Spring Boot, Selenium

Other

Machine Learning, Optimization, Statistics, Software QA, Test Management, Scraping, Version Control, Chemistry, Genetic Algorithms, Cheminformatics, Data Engineering, SDET, Code Architecture, Genomics, Data Science, Computational Biology

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring