Clayton Lemons
Verified Expert in Engineering
Software Developer
Ithaca, NY, United States
Toptal member since April 23, 2020
Clayton is a transformative software engineer and leader with over 15 years of experience in the software industry, innovating at the intersection of data, AI/ML, and cloud engineering. As both an individual contributor and a visionary leader, he is a dynamic force in transforming challenges into user-centric, high-quality software solutions. Renowned for his technical mastery, strategic foresight, and principled approach to software, Clayton elevates teams to collaborate and perform their best.
Portfolio
Experience
- Python - 14 years
- Back-end - 10 years
- Software Architecture - 10 years
- C++ - 10 years
- Machine Learning - 7 years
- Spark - 5 years
- Amazon Web Services (AWS) - 4 years
- Kubernetes - 4 years
Availability
Preferred Environment
Full-stack, TypeScript, PostgreSQL, Docker, Scala, Python, Kubernetes, Amazon Web Services (AWS), React, Machine Learning Operations (MLOps)
The most amazing...
...solution I've architected and developed is an internal AI/ML platform deployed on Kubernetes, leveraging EKS, JupyterHub, Spark, MLflow, Feast, and Pachyderm.
Work Experience
Enterprise Strategy Architect
U.S. Department of Veterans Affairs
- Produced a detailed technical strategy document for a large-scale, cloud-based data and analytics platform.
- Revised existing and created new architecture diagrams in support of the technical strategy document.
- Planned the pilot of a novel synthetic data generation system.
Senior Director of AI Technology
Elevance Health
- Managed an 8-member engineering team skilled in AI/ML, data science, back-end API development, and DevOps, fostering a diverse technical environment.
- Mentored eight engineers in technical and leadership skills, enabling two engineers to achieve technical leadership roles.
- Oversaw the maintenance of a cloud-native, elastic AI/ML platform built on JupyterHub and a data pipeline that processed HL7 FHIR resources for over 70 million patients, handling billions of claims and clinical records.
- Headed a cross-functional team to develop a research-focused data science platform on Google Kubernetes Engine, ensuring secure access to deidentified data for over 70 million patients.
- Designed a secure solution using Kasm to prevent data exfiltration from container-based JupyterLab workspaces running on the data science platform.
- Containerized a customized JupyterLab environment for the data science platform equipped with an extensive set of AI/ML tools.
- Architected a secure, multi-tenant persistent storage system to support both user and shared project data in JupyterLab workspaces, employing Google Cloud Storage buckets mounted with gcsfuse.
- Engineered a deployment strategy for an innovative synthetic data generation system.
- Served as the cloud engineering lead for an LLM-powered, internal knowledge management tool for CSR queries, providing solutions for both the LLM's development and its deployment and integration with the tool's front-end application.
- Implemented Ray on Kubernetes with autoscaling and GPU support and showcased its ability to fine-tune a 70 billion parameter Llama 2 LLM using the Fully Sharded Data Parallel (FSDP) technique.
AI Software Engineering Lead
Elevance Health
- Led the architecture and team efforts to build and deploy a cloud-native AI/ML platform on AWS for enhanced data science on scalable infrastructure, integrating technologies such as Kubernetes, JupyterHub, Spark, MLflow, Feast, and Pachyderm.
- Secured cloud accounts for the AI/ML platform deployment, working in close collaboration with internal committees and DevSecOps teams to ensure compliance and governance alignment.
- Spearheaded the deployment of JupyterHub on Amazon EKS in close collaboration with DevSecOps engineers, leveraging EFS, EBS, S3, and custom Docker images to meet specific user environment needs.
- Developed a single sign-on (SSO) solution integrating JupyterHub and MLflow via Auth0, facilitating a seamless authentication experience for users.
- Developed a high-performance computing solution for users of the AI platform, seamlessly integrating on-demand, scalable GPU resources and elastic, Kubernetes-hosted Spark jobs with JupyterHub.
- Proposed and contributed a security solution to Pachyderm, improving the security posture of its "JupyterLab Pachyderm Mount Extension" and making it easier for users to integrate.
- Influenced Pachyderm's development roadmap by identifying numerous performance issues and suggesting new features and optimizations, several of which were implemented.
- Conducted comprehensive training sessions for data scientists and engineers on the effective use of the AI/ML platform, enhancing team capabilities.
- Directed a cross-functional team on the development and operationalization of a predictive model for type 2 diabetes on the AI/ML platform, successfully advocating for the use of Feast and MLflow to achieve MLOps best practices.
- Led the architectural design and development of a new FHIR generation pipeline to replace an old one, cutting processing time from three weeks to 24 hours for 70 million patients and substantially lowering operational costs.
AI Solutions Engineer Executive Advisor
Anthem
- Conducted comprehensive research to identify Pachyderm as an enterprise-grade COTS software solution that met specific needs for pipeline orchestration, distributed processing, incremental processing, and data tracking and provenance.
- Drove the procurement process for Pachyderm, successfully navigating licensing, negotiations, and acquisition.
- Directed and assisted a team of cloud engineers with the deployment of Pachyderm within the enterprise's AWS cloud infrastructure, specifically leveraging Amazon EKS.
- Oversaw the operationalization of Pachyderm, establishing robust processes and best practices for building and executing large-scale data pipelines.
- Standardized and documented data engineering best practices for the organization.
- Designed an internal, patient-focused health trajectory data structure, significantly enhancing data scientists' ability to rapidly analyze data and develop AI-driven health models.
- Led the design and development of a Pachyderm pipeline that hydrated the above health trajectory data structure with data from over 70 million patients, encompassing 2+ TB of data.
AI ETL Solutions Engineer via Toptal
Anthem AI - Telehealth/PIP
- Spearheaded the design and execution of a complex data pipeline that enables the seamless delivery of AI-driven health insights from an on-prem server to a cloud-based application.
- Engineered a Python-based API and storage framework for the storage and retrieval of AI-driven health insights on Amazon S3, utilizing compression, Base64 encoding, and indexing for flexibility and efficiency.
- Enhanced data science operations by providing expert ETL and ML pipeline engineering support to data scientists in the form of code reviews, debugging, pair programming, and performance optimization.
- Maintained on-prem ETL pipeline components built using Hive, PySpark, and Airflow.
- Advocated successfully to leadership for the transformation of an on-prem ETL pipeline to a cloud-native solution, leveraging Snowflake, Kubernetes, PySpark, and Pachyderm.
Research Software Engineer (Machine Learning)
GrammaTech
- Accelerated the static and binary analysis of a large-scale codebase by implementing a data and ML pipeline using Python, MongoDB, and JavaScript.
- Reduced computational costs for a binary analysis program by implementing a pupil-style ML model with scikit-learn.
- Crafted data analysis techniques in Python to detect security issues in JavaScript functions, such as swapped callback and error arguments in higher-order continuation-style functions.
- Leveraged the Doc2Vec model to vectorize function call sites and definitions, streamlining the detection of swapped arguments through semantic similarity of parameter and argument names.
- Developed the back end of a feature for binary scanning in a SaaS binary analysis tool.
Software Engineer I – III
National Instruments
- Earned recognition for outstanding performance, receiving the "Rookie of the Year" award, multiple fast-track promotions, and the opportunity to lead a key project.
- Standardized and streamlined the firmware downloading framework across multiple devices in the NI-DCPower and NI-DMM product families.
- Designed and implemented over 20 features in the NI-DCPower and NI-DMM driver APIs for Windows, with many improvements directly visible to the end user.
- Led the research and definition of three major features for a key product, collaborating with project managers across hardware teams and various stakeholders to ensure alignment and address technical requirements comprehensively.
- Designed and led the development of an internal programming language and compiler that targeted a proprietary instruction set for power supply output control, enabling flexible device behavior reconfiguration and complex output control.
- Implemented a client-server system that enables remote management of NI-DCPower API driver sessions, facilitating debugging and introspection.
- Improved developer workflow efficiency by implementing a Sublime Text plugin to integrate Perforce.
- Developed a VS Code extension that integrates NI's custom build system with Microsoft's C/C++ extension, enabling advanced features like semantic code completion.
- Researched the Tarantula fault localization technique, successfully created a prototype for select NI codebases, and showcased the findings at an internal engineering conference.
- Mentored more than 10 interns and junior engineers.
Web Developer
CleanTelligent Software
- Optimized several database queries and storage layouts, including the file storage system for customer photos, which reduced several API response times to just milliseconds.
- Implemented the back-end API for a customizable report generation tool.
- Applied a new UI theme to several parts of the website.
Software Engineer Intern
National Instruments
- Designed an essential kernel driver feature that streamlined driver communication with embedded storage devices on over 10 commercial products.
- Developed a code generation tool to support the driver feature, which automatically leveraged Python and Mako templates to generate C++ and LabVIEW code from metadata.
- Investigated and presented the advantages and disadvantages of various metadata schema formats for the code generation tool, then led a consensus meeting to select the most suitable one.
Web Developer
CleanTelligent Software
- Pinpointed and documented multiple user experience inconsistencies across related functionalities and addressed the corresponding issues successfully.
- Broadened the capabilities of a crucial job scheduling tool by integrating additional back-end queries and introducing new user interface elements on the front end.
- Resolved over 30 bugs throughout the CleanTelligent website's front end and back end, enhancing overall performance and user experience.
Experience
Capstone Project for the Coursera Course "Functional Programming in Scala"
https://github.com/claytonlemons/fp-in-scala-capstoneNI-DCPower Soft Front Panel Debug
http://www.ni.com/en-us/innovations/white-papers/14/introducing-debug-driver-session-technology.htmlI implemented the project by creating a DLL that initiates an Apache Thrift server for command relay between remote and driver sessions over localhost, enabling seamless integration of debugging tools. Thrift's flexibility was key for future extensions to support connections from non-local processes.
I collaborated closely with LabVIEW and C# developers, who consumed the client API in order to implement the NI-DCPower Soft Front Panel Debug feature.
Subforce
https://github.com/claytonlemons/SubforceGMF Aquatics Website
Education
Master of Science Degree in Software Engineering
The University of Texas at Austin - Austin, TX, USA
Bachelor of Science Degree in Computer Science
Brigham Young University - Provo, UT, USA
Certifications
CKS: Certified Kubernetes Security Specialist
The Linux Foundation
CKA: Certified Kubernetes Administrator
The Linux Foundation
CKAD: Certified Kubernetes Application Developer
The Linux Foundation
Parallel Programming in Scala
Coursera
Functional Programming in Scala Capstone
Coursera
Functional Programming Principles in Scala
Coursera
Functional Program Design in Scala
Coursera
Big Data Analysis with Scala and Spark
Coursera
Advanced Python
Skills
Libraries/APIs
TensorFlow, Keras, NumPy, SciPy, Pandas, Scikit-learn, PySpark, React, PyTorch, Ruby ERB, Backbone.js, Mustache, Windows API
Tools
Sublime Text 3, Visual Studio, Gensim, Git, Perforce, GitLab, GitLab CI/CD, LabVIEW, Apache Airflow, Pachyderm, Amazon EKS, Pytest, Helm, Logging, AWS Step Functions, Bitbucket, Amazon Elastic Block Store (EBS), AWS ELB, AWS IAM, AWS CLI, Artifactory, Docker Hub, Docker Compose, Google Kubernetes Engine (GKE), Terraform, CVS, ChatGPT
Languages
Python, C++, Bash, TypeScript, PHP, CSS, Ruby, Java, Scala, GraphQL, HTML, JavaScript, Bytecode, Python 3, Embedded C, C, Bash Script, SQL, Snowflake, C#
Platforms
Kubernetes, Docker, Amazon Web Services (AWS), Cloud Native, Windows, NetBeans, Software Design Patterns, Linux, AWS Lambda, Jupyter Notebook, OCI Artifact Registry, Director, Google Cloud Platform (GCP), Visual Studio Code (VS Code)
Frameworks
Spark, Flask, Hadoop, Jakarta Server Pages (JSP), Apache Struts 2, CakePHP, Ray, Bootstrap, Apache Thrift
Paradigms
Functional Programming, Concurrent Programming, Compiler Design, Distributed Programming, Software Testing, Parallel Programming, ETL, Pair Programming, Distributed Computing, DevOps, HL7 FHIR Standard, Agile, Scrum
Storage
PostgreSQL, MongoDB, Cloud Deployment, Databases, Apache Hive, Data Pipelines, Amazon S3 (AWS S3), Amazon EFS, Google Cloud Storage, Datadog
Other
Windows Kernel Drivers, Software Architecture, Back-end, Machine Learning, Cloud, Development, Full-stack, Firmware, API Design, Concurrent Computing, Thread Scheduling, Processing & Threading, Code Validation, Data Mining, Software Project Management, Data Engineering, Programming, Operating Systems, Data Structures, Software Design, Security, Asymmetric Encryption, Server Development, Non-blocking I/O, Natural Language Processing (NLP), Compilers, Digital Signal Processing, Computer Engineering, Algorithms, Machine Learning Operations (MLOps), Statistical Analysis, CI/CD Pipelines, Big Data, Data Analytics, Data Modeling, Data Profiling, APIs, Frameworks, Storage, Artificial Intelligence (AI), Code Review, Debugging, Advisory, Flake8, pre-commit, Orchestration, Data Science, Distributed Software, Negotiation, Procurement, COTS, Enterprise SaaS, Provenance, Data Lineage, IT Security, Technical Leadership, Functional Design, Kubernetes Operations (kOps), Open Source, Container Orchestration, Troubleshooting, Scheduling, Site Reliability Engineering (SRE), System Administration, Containerization, JupyterLab, MLflow, Feast, AWS Auto Scaling, GPU Computing, Architecture, Single Sign-on (SSO), Optimization, Coaching, Career Coaching, Feedback Review, Software Engineering, Cross-functional Team Leadership, Strategic Planning & Execution, Idea Synthesization and Application, Staff Management & Development, Goal Management, Project Coordination, Business Requirements, Open-source LLMs, Large Language Models (LLMs), New Product Development, FastAPI, Cross-functional Collaboration, Llama 2, Fine-tuning, Data-level Security, Data Exfiltration Prevention, Mako, Version Control, Plugin Development, Client-server Model, DLL, Device Drivers, Full-stack Development, API Integration, ETL Tools, Data Management, IT Strategy, Leadership, Team Leadership, Remote Team Leadership, Protegrity, Retrieval-augmented Generation (RAG), Chatbots
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring