Mark Farragher
Verified Expert in Engineering
Statistical Methods Developer
Mark is a highly analytical data scientist and open-source developer who most recently worked in a startup environment. He trained as an economist and specializes in statistical modeling, causal inference, and experimentation. He has a breadth of experience, having worked in contracting and employment on data projects for public, private, and third-sector clients. He is an excellent communicator who often gives talks at conferences and writes blog posts on data topics.
Portfolio
Experience
Availability
Preferred Environment
Google Cloud Platform (GCP), NumPy, Pandas, Linux, MacOS, Windows, STATA, Tableau, Looker, Git, BigQuery, SQL, Python
The most amazing...
...personal project I've worked on is an open-source econometrics library for Python called Appelpy. It reached over 120 stargazers on GitHub.
Work Experience
Performance Analytics Officer
Genomics England
- Drove the development of new data products consumed across Genomics England internally and externally, including analytics used by senior stakeholders in government.
- Established analytics capability in a Center of Excellence team to enable self-serve analytics in late 2021 and shaped the analytics roadmap.
- Developed best practices in analytics to support user needs and set the technical end-to-end ELT pipeline.
Data Scientist
Boutique Health Consultancy (Contract)
- Developed scalable analytics tools on a healthcare data project for a boutique healthcare consultancy's analytics team. Led software engineering and ETL-intensive tasks.
- Moved Python code away from notebooks, with some refactoring, into a more robust system. Introduced best practices in software engineering, e.g., logging, unit testing, or modularization of code.
- Developed ETL pipelines with Parquet files and analytics tools to reduce the team's time spent on ad-hoc requests, with a long-term view of making the analytics more self-serve for the client.
- Automated the generation and sending of data outputs, e.g., with Microsoft Teams Flows or BAT scheduling.
- Automated advanced Excel outputs, with a mix of Python libraries such as Xlwings or Openpyxl.
Data Scientist
Triptease
- Developed logic for systematically managing hotels' online presence on metasearch advertising channels, harnessing their unique characteristics and multiple data sources. Helped to launch the MVP and iterate logic once we had more customers.
- Built dashboards and tooling to help our cross-functional squad monitor the health of the metasearch product, e.g., anomaly detection system, integration health, or model and product performance.
- Developed internal Python libraries to analyze our data more robustly and systematically. Automated dozens of my common analytics tasks and modeling workflow.
- Delivered a talk at the PyData London 2019 conference on the applications of causal inference to hotel websites.
- Undertook research that influenced the sales process like new guard rails, budget optimization, and models tailored to hoteliers' different objectives, like whether to optimize acquisition or the return on ad spend.
- Upskilled and onboarded new colleagues, such as data analysts, on parts of our tech stack.
- Undertook the largest A/B test of on-site messaging across thousands of hotel websites. Wrote the white paper on the test results and discussed the test in a webinar with customers.
Data Analyst
Triptease
- Ran multiple A/B tests to demonstrate the value of Triptease's products to the largest hotel groups. The rigorous process and outputs contributed to reducing client churn in Q1 2018.
- Increased the efficiency and effectiveness of the original A/B testing process via, e.g., reporting templates, automation of tasks, or dashboard and tool development.
- Undertook analysis and research that led to developing new product and platform features.
- Analyzed conversational data with NLP to refine the front desk chatbot product, which led to higher accuracy.
- Assisted in hiring new data scientists in H2 2018 and eventually moved into a data science role in a new squad.
Analyst/Consultant
Self-employed
- Contracted at Triptease as a data analyst for one month before moving into a permanent role.
- Developed strategy and evidence bases for supporting the startup of new universities internationally for consultancy for Susan Jackson Associates.
- Undertook quantitative and qualitative analysis on international higher education policy.
Consultant
Public and Corporate Economic Consultants (PACEC)
- Authored a publication on England's knowledge exchange (KE) policy and developed a database tool for monitoring universities' KE strategies as part of one of our largest consultancy projects (£100,000+). I wrote the proposal.
- Wrote the baseline evaluation report for the Newton Fund, a £735 million fund to develop UK research partnerships in 15 countries. Developed the quantitative analysis methodology and became an SME on international research partnerships.
- Authored dozens of pieces of research on economic policy and higher education policy for clients at different levels of government, e.g., UK central government, local authorities, local enterprise partnerships, public-private consortia, etc.
- Introduced and applied new technology to our analytics, like the use of Tableau.
Experience
The State of the English University Knowledge Exchange Landscape
https://dera.ioe.ac.uk/29698/I was one of the main authors of this publication, and the wider project was one of the largest revenue contracts that the business had secured. The web link above has the full report, so that may be useful for context. Findings from previous PACEC reports on knowledge exchange have been used in the HE sector at workshops and conferences, especially by policymakers, so it is likely to be the same for this report over the next few years.
Appelpy (Applied Econometrics Library for Python)
https://github.com/mfarragher/appelpyIt was a personal project I worked on over some weekends and evenings. I marketed the project, and it became one of the most starred projects on regression modeling on GitHub.
Newton Fund Baseline Evaluation
I undertook quantitative analysis and desk research to develop the baseline evaluation for the Newton Fund by drawing upon international and national literature and datasets encompassing innovation policy. I authored 15 country-level reports assessing the partner countries' innovation capacity and a high-level report.
Skills
Languages
Python, SQL, R, Excel VBA
Libraries/APIs
Pandas, Scikit-learn, NumPy
Tools
Looker, Tableau, Git, STATA, BigQuery, Microsoft Excel
Paradigms
Agile, Scrum, Test-driven Development (TDD)
Other
Google BigQuery, Consulting, Economics, Machine Learning, Statistical Methods, Causal Inference, SaaS, Data Analysis, A/B Testing, Product Analytics, Data Analytics, Bayesian Statistics, Parquet, Biostatistics, Research, Genetics, Product Roadmaps
Platforms
Google Cloud Platform (GCP), Windows, MacOS, Linux
Industry Expertise
Project Management
Education
Master's Degree in Population Health Sciences
University of Cambridge - Cambridge, UK
Bachelor of Science Degree in Economics
London School of Economics & Political Science - London, United Kingdom
Certifications
Chartered Statistician
Royal Statistical Society
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring