
Ian is available for hire
Hire IanIan Roy
Verified Expert in Engineering
Data Engineer and Developer
Location
Cleveland, OH, United States
Toptal Member Since
December 21, 2020
Ian is a data engineer with six years of experience providing scalable and verifiable solutions. His primary focus has been on large scale data collection operations across a few different industries. Ian also has abundant experience working in the Docker ecosystem, so he excels at smoothly and efficiently integrating his code into an existing DevOps system.
Portfolio
TPA Stream
Web Scraping, Data, Python
Trilogy Education Services
Machine Learning, APIs, D3.js, JavaScript, NoSQL, SQL, Python
Dakota Software
JavaScript, SQL, Python
Experience
Availability
Part-time
Preferred Environment
Python, Linux
The most amazing...
...thing I've created would be my solutions to for dealing with many different anti-bot networks when scraping data.
Work Experience
Data Engineer
2019 - PRESENT
TPA Stream
- Created tools that simplified the process of writing web scrapers.
- Containerized the web scrapers so that they could be part of the DevOps pipeline.
- Maintained the existing data pipelines by troubleshooting bugs as they occured.
Technologies: Web Scraping, Data, Python
Data Science Instructor
2018 - PRESENT
Trilogy Education Services
- Instructed students who were learning basic programming and data analysis.
- Troubleshot student configuration issues in a variety of different hardware and software enviorments.
- Provided 1-on-1 tutoring to students in need of further instruction.
Technologies: Machine Learning, APIs, D3.js, JavaScript, NoSQL, SQL, Python
Software Engineer
2015 - 2018
Dakota Software
- Maintained web scrapers for government regulations.
- Created Selenium-based automation tools for web app testing.
- Deployed tested code to the production environment.
Technologies: JavaScript, SQL, Python
Experience
Stock Data Pipeline
https://github.com/Ian-Roy/data_pipeline_stockI built a distributed Python desk application that gathers and analyzes data on current stock trends. The data is stored in a document database and is designed to scale as needed for many different stock interests.
Skills
Languages
Python, SQL, JavaScript
Other
Data Engineering, Web Scraping, Big Data, Data Analysis, Data, APIs, Machine Learning
Tools
Git, DataViz
Libraries/APIs
D3.js
Paradigms
Distributed Computing
Platforms
Docker, Linux
Storage
Data Pipelines, NoSQL
Education
2012 - 2017
Bachelor of Science Degree in Mathematics
Cleveland State University - Cleveland, OH, United States