Jon Scott, Architect and Database Developer in Newcastle-under-Lyme, United Kingdom
Jon Scott

Architect and Database Developer in Newcastle-under-Lyme, United Kingdom

Member since March 15, 2021
Jon is a data consultant dedicated to helping his customers fulfill their goals. He has extensive experience designing and building large-scale big data solutions for enterprises through to small systems for start-up companies, assisting clients at all stages of their growth to exploit the value held in their data. His technical expertise and business acumen come together to give clients the best experience of design, implementation, and long-term value.
Jon is now available for hire




Newcastle-under-Lyme, United Kingdom



Preferred Environment

Linux, PostgreSQL, AWS, Amazon Athena, Data Lake Design, Data Lakes, Data Warehouse Design, Redshift, Python 3, Apache Airflow

The most amazing...

...product I have designed and built is a SaaS solution for a marketing attribution company. This included industry leading analytics and blistering speed.


  • Architect and Data Engineer

    2020 - PRESENT
    • Designed an architecture to capture, analyze, and dashboard social media posts and videos. Complex data and reporting requirements had to be met.
    • Tracked features and bugs using Jira and created technical specifications based upon client requirements.
    • Built complex ETL flows to manage the transformations using Athena.
    • Built user friendly dashboards in AWS Quicksight to enable users to explore the data.
    Technologies: Amazon Athena, Apache Airflow, Amazon QuickSight, Python
  • Technical Lead

    2018 - PRESENT
    Marketing Attribution Partners
    • Designed the architecture to meet requirements. Iteratively enhanced this as requirements increased in a managed way over time.
    • Built the data model system service using PostgreSQL and Python (PyMC3), leading edge technology for marketing analysis.
    • Built the data model and Django back-end API service for a SaaS solution to surface the model.
    • Led the expansion of the team with processes to manage the quality and specification. Used Jira system to build an Agile CI/CD process. Managed team on a day-to-day basis.
    • Enhanced the data processing with a 100x speed performance boost using low level Python (NumPy) to replace an intensive mechanism previously performed in SAS.
    Technologies: Data, Python, PyMC3, Data Warehouse Design, Django, SQL, PostgreSQL, NumPy
  • Lead Architect

    2020 - 2021
    Pharma Data Company
    • Designed and communicated a novel ETL architecture to meet exacting requirements around data provenance, quality, security, and performance.
    • Built the project plan and engaged with the development teams to ensure alignment. Managed the work alongside the program manager using Jira.
    • Developed solutions to overcome complex edge cases to ensure smooth running of the system and to allow the project to be completed. Using SQL and PySpark.
    • Supported implementation and quality assurance while the system was embedded.
    Technologies: Apache Airflow, Code Architecture, SQL, Redshift, Redshift Spectrum, AWS Glue, Data Lake Design, SAS
  • MI Manager

    2013 - 2015
    • Led the development of analytics and business reporting (a team of six data engineers) and the micro-strategy reporting team.
    • Led the enhancement of ETL processes to meet tighter timescales and more features.
    • Worked on legal and compliance reporting across geographies to help roll out services worldwide.
    Technologies: Data Warehouse Design, ETL Tools, SQL, kognitio
  • Lead BI Architect

    2009 - 2013
    • Served as senior architect within Capgemini UK BIM (Business Information Management) practice. Communicated regularly with stakeholder management at partner (VP) level within Capgemini and at the senior executive level within the customer organization.
    • Oversaw solution design for Capgemini customers, specializing in public sector. Significant customer stakeholder management and pre sales activity, including sales strategy and solution design work for large scale data solutions.
    • Collaborated with partners including Cloudera to ensure optimal solution design. Fed back into open source community where possible.
    Technologies: Data Warehouse Design, SQL, Teradata, Hadoop, Cloudera


  • BCV Social Platform

    Designed and technically managed the build of a data analytics solution for managing social media reporting for clients of BCV social. Managed data flows to ensure data quality and low latency. Created a reporting back-end API used by the front-end reporting system.


  • Languages

    Python 3, Python, SQL, SAS
  • Frameworks

    Django, Hadoop
  • Tools

    Amazon Athena, Redshift Spectrum, Amazon QuickSight, Apache Airflow, AWS Glue, Cloudera
  • Storage

    PostgreSQL, Data Lake Design, Data Lakes, Redshift, Teradata
  • Other

    AWS, Data Warehouse Design, Data, Data Analytics, Code Architecture, Dashboard Design, PyMC3, ETL Tools, kognitio
  • Platforms

  • Libraries/APIs


To view more profiles

Join Toptal
Share it with others