Derek Owens-Oas
Verified Expert in Engineering
Data Scientist and Developer
Ashland, OR, United States
Toptal member since January 21, 2020
Derek has a PhD in statistical science from Duke and has worked as a data scientist and software developer at Xylem. He's the founder of Tech Smart Magic. Published in the Journal of Classification and a TA of the year, he's an expert in research and teaching. He provides interactive apps, visualizations, and reports. Derek's Python and R programming, top AI and ML methods, and ability to communicate make him an asset to any company. His specialties include text data and online social networks.
Portfolio
Experience
Availability
Preferred Environment
GitHub, Python, R, GoDaddy, Social Networks, ChatGPT, Facebook, Google, Microsoft, SQL
The most amazing...
...contribution I made at Xylem was an interactive app to help city utilities visualize water-pipe-network quality in Dallas, DC, and Howard County.
Work Experience
Tutor | Consultant
Varsity Tutors
- Developed a web application to visualize cost distribution with health insurance claims data.
- Used machine learning and labeled data to estimate the sentiment of tweets on Twitter.
- Quantified wound volume reduction for treated and control groups of patients.
- Estimated username from internet session activity data.
- Edited code on programming and statistics homework assignments with high school, college, and graduate students.
Data Scientist
Toptal Client
- Consulted with the company chairman and CEO about the sale of health test kits.
- Analyzed spreadsheet of customer communication for patterns.
- Discussed plan to provide an automated solution by the chatbot.
Data Scientist
Shopper Media Group
- Developed code to estimate the number of visitors at shopping centers with WiFi data.
- Implemented methods for predicting shopper visits using a proxy center.
- Imported table with visitation frequency charts into Redshift warehouse.
- Gave video and audio reports with a daily status.
- Typed up documentation about the process from surveying to a presentation on the web application.
Data Scientist | Software Developer
SureTint Technologies
- Integrated customer relationship management software for a beauty salon application.
- Continued the development of a Python package about color combination.
- Reorganized the data and code file folder structure.
- Gathered and added new data into the existing pipeline.
- Tested and ensured the good quality of the program performance.
- Deployed a basic Django app and experimented with an alternate methodology.
- Typed code in the AWS SageMaker computing environment.
- Trained multiple linear models to estimate hair color with products.
- Applied nearest neighbor method to convert a hair formula product line.
Data Scientist
Xylem, Inc.
- Developed a predictive model and application to efficiently prioritize water pipe inspection for major US city utilities.
- Recruited talent to Xylem at an American Statistical Association event.
- Wrote technical reports with data graphics and statistical language to inform management and a company executive.
- Composed blog posts to emphasize and clarify company impacts.
- Created and presented an interactive visualization of water quality and algae levels in Lake Erie.
Experience
Online Social Network Report and Application
https://github.com/dmo11/political_blog_posts/blob/master/link_block_lda_results.pdfHere is a link to the video showing this application:
https://drive.google.com/file/d/1-Goo7OjKdGs9cvYxDfAu58GUuzDNSQg3/view?usp=sharing
Water Pipe Inspection Prioritizing Application
Lake Erie Water Quality Assessment
Health Procedure Cost Explorer | Web App
https://drive.google.com/file/d/1IwtWOAObd1aBcfm2IukvtzqNQaR_PjiP/viewA second bar-graph version allows the user to mouse-over various procedural choices for treating osteoarthritis. Here is the link:
https://drive.google.com/file/d/10gVQWka51w0RA5wmO4_BPIeEt3nt-ZRr/view?usp=sharing
A healthcare provider can view the patient outcomes to guide the choice going forward.
Learning Topics and Communities in Political Blog Posts
https://arxiv.org/pdf/1610.05756.pdfLearning Original Poster in Group Conversation Data
https://arxiv.org/pdf/1809.03648.pdfStatistics Web Blog
Learning to Make a Tableau Dashboard
https://drive.google.com/file/d/1ygKMZlXeIxfsyl8YjEJPGQGrVphbpYUg/view?usp=sharingSalon Customer Brand Converter
https://drive.google.com/file/d/1uVhkJSdCEioSStJNuitvSPb9NVxnSdJ7/view?usp=sharingI continued developing application which converts formulas from one product line to another. Data are on AWS, code is Python, and a statistical model was used.
Features developed include a filter to ensure products conform to manufacturer recommendations.
Education
Master of Science Degree and PhD in Statistical Science
Duke University - Durham, NC, USA
Bachelor of Arts Degree in Mathematics
Pomona College - Claremont, CA, USA
Skills
Libraries/APIs
Pandas, Scikit-learn, Caret, Facebook API, Matplotlib, NumPy, PySpark, PyTorch, Node.js, TensorFlow Deep Learning Library (TFLearn), Facebook API, X (formerly Twitter) API, TensorFlow, Keras, Python
Tools
Jira, Confluence, Jupyter, Excel Development, Microsoft Word, Git, GitHub, Tableau Development, Data Science, Amazon SageMaker, ARIMA, Zoom Development, ChatGPT
Languages
R, Python, SQL, JavaScript, HTML, SAS, CSS, Java
Frameworks
RStudio Shiny, Django, Spark
Paradigms
ETL, Automation, App Development, Microservices Development, Quantitative Research, Business Intelligence Development
Industry Expertise
Virtual Coaching, Healthcare App Design, Marketing Design
Storage
Database, Database, SQL, JSON, Amazon S3, Redshift, PostgreSQL, AWS, MySQL
Platforms
RStudio, Amazon EC2, WordPress Development, Docker, AWS Lambda, Google Chrome, AWS, Shopify, Microsoft Development
Other
Data Science, Data Science, Data Visualization, Data Cleaning, Analytics Development, Algorithms, NLP, Data Architecture, Data Modeling, Data Engineering, Analysis, Statistical Modeling, Excel Development, Artificial Intelligence, Quantitative Development, A/B Testing, Topic Modeling, Classification, Visualization, Predictive Analytics, SaaS, Big Data Architecture, Machine Learning, Technical Reports, Applied Mathematics, Statistics, Data Science, Data Science, Data Analysis, Mathematics, Bayesian Inference & Modeling, Bayesian Statistics, Regression Modeling, Generative Pre-trained Transformers (GPT), Design Consulting, Time Series, Data Matching, Higher Education, E-commerce marketing, Scraping, Web Scraping, Video Production, Predictive Modeling, Text Mining, Kalman Filtering, Time Series Analysis, Financial Modeling, UI Development, Web Development, Dashboard, APIs, Scheduling, Custom Audio Embedding, Deep Learning, Advertising Management, Serverless, K-nearest Neighbors (KNN), Computer Science, Amazon Redshift, Data Science, Data Handling, Software Development, Publishing, Blogging, Neural Network, Finance, Consumer Products, Surveying, Compliance, Documentation, OCR, GoDaddy, Social Networks, Facebook, Google Software
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring