Lead Data Scientist
2018 - PRESENTFINRA- Led deployment of NLP models in production using Docker and Lambda on AWS.
- Developed supervised and unsupervised models to identify insider trading XGBoost, market manipulation with DBScan, fraud by using Bayesian analysis, and triage external communication with XGBoost, and BERT.
- Gave internal talks on software engineering for data scientists, countering sample bias, measuring model drift, thresholding, and normalizing flows.
- Developed and open-sourced toolkit for validating and monitoring machine learning models and introduced this software at ODSC East 2022.
Technologies: Python, XGBoost, Scikit-learn, Mathematics, Statistics, Machine Learning, Bayesian Statistics, Model Validation, Deep Learning, Explainable Artificial Intelligence (XAI), Keras, TensorFlow, Plotly, ClassificationAnalytics Engineer
2018 - 2019Catalist LLC- Optimized, parallelized, and deployed an NLP model with Keras.
- Wrote SQL parser using Python that refactored over one million lines of legacy SQL scripts.
- Designed and wrote a data processing pipeline for election results as they became available the night of an election.
- Wrote internal technical guides on parallel processing.
Technologies: Python, SQL, Bash, Linux, Git, Machine Learning, Keras, ClassificationDeveloper
2016 - 2017Comsol- Researched models and techniques to simulate physical phenomena of interest to engineers and scientists.
- Wrote technical specifications of new front and back-end components.
- Implemented algorithms used for numerical simulations and user interfaces in Java.
Technologies: Java, C++Freelance Developer
2011 - 2016Self-Employed- Used dynamic programming to reduce the run time of quantum computing simulation from five days to 50 minutes (UMBC Physics Department).
- Performed data visualization and image processing with Python, named the second author in publication summarizing results (American Dental Association Foundation).
- Wrote code to tunnel citizens of countries with internet censorship to uncensored internet via Google Chat and Tor (Tor).
- Helped build initial versions of iCARE, a cancer research and networking nonprofit.
Technologies: Python, PHP, JavaScript, HTML, Matplotlib, Mathematics