Architect and Data Engineer
2020 - PRESENTGoalcast- Designed an architecture to capture, analyze, and dashboard social media posts and videos. Complex data and reporting requirements had to be met.
- Tracked features and bugs using Jira and created technical specifications based upon client requirements.
- Built complex ETL flows to manage the transformations using Athena.
- Built user friendly dashboards in AWS Quicksight to enable users to explore the data.
Technologies: Amazon Athena, Apache Airflow, Amazon QuickSight, Python, Data Engineering, Data Science, Machine Learning, Data Reporting, Data Modeling, Data Analysis, Business Intelligence (BI), Microsoft Excel, ETL, Data Warehousing, Data Architecture, Big Data Architecture, Architecture, RoadmapsTechnical Lead
2018 - PRESENTMarketing Attribution Partners- Designed the architecture to meet requirements. Iteratively enhanced this as requirements increased in a managed way over time.
- Built the data model system service using PostgreSQL and Python (PyMC3), leading edge technology for marketing analysis.
- Built the data model and Django back-end API service for a SaaS solution to surface the model.
- Led the expansion of the team with processes to manage the quality and specification. Used Jira system to build an Agile CI/CD process. Managed team on a day-to-day basis.
- Enhanced the data processing with a 100x speed performance boost using low level Python (NumPy) to replace an intensive mechanism previously performed in SAS.
Technologies: Data, Python, PyMC3, Data Warehouse Design, Django, SQL, PostgreSQL, NumPy, Data Engineering, Data Reporting, Data Modeling, Data Analysis, Business Intelligence (BI), Microsoft Excel, ETL, Data Warehousing, Data Architecture, Architecture, RoadmapsLead Architect
2020 - 2021Pharma Data Company- Designed and communicated a novel ETL architecture to meet exacting requirements around data provenance, quality, security, and performance.
- Built the project plan and engaged with the development teams to ensure alignment. Managed the work alongside the program manager using Jira.
- Developed solutions to overcome complex edge cases to ensure smooth running of the system and to allow the project to be completed. Using SQL and PySpark.
- Supported implementation and quality assurance while the system was embedded.
Technologies: Apache Airflow, Code Architecture, SQL, Redshift, Redshift Spectrum, AWS Glue, Data Lake Design, SAS, Data Engineering, Data Science, Machine Learning, Data Reporting, Data Modeling, Data Analysis, MySQL, Business Intelligence (BI), Microsoft Excel, ETL, Data Warehousing, Data Architecture, Big Data Architecture, ArchitectureMI Manager
2013 - 2015Bet365- Led the development of analytics and business reporting (a team of six data engineers) and the micro-strategy reporting team.
- Led the enhancement of ETL processes to meet tighter timescales and more features.
- Worked on legal and compliance reporting across geographies to help roll out services worldwide.
Technologies: Data Warehouse Design, ETL Tools, SQL, kognitio, Data Engineering, Machine Learning, Data Reporting, Data Modeling, Data Analysis, MySQL, Business Intelligence (BI), ETL, Data Warehousing, Data Architecture, Big Data Architecture, Architecture, RoadmapsLead BI Architect
2009 - 2013Capgemini- Served as senior architect within Capgemini UK BIM (Business Information Management) practice. Communicated regularly with stakeholder management at partner (VP) level within Capgemini and at the senior executive level within the customer organization.
- Oversaw solution design for Capgemini customers, specializing in public sector. Significant customer stakeholder management and pre sales activity, including sales strategy and solution design work for large scale data solutions.
- Collaborated with partners including Cloudera to ensure optimal solution design. Fed back into open source community where possible.
Technologies: Data Warehouse Design, SQL, Teradata, Hadoop, Cloudera, Data Analysis, ETL, Data Warehousing, Data Architecture, Architecture