Ian Roy, Data Engineer and Developer in Cleveland, OH, United States
Ian Roy

Data Engineer and Developer in Cleveland, OH, United States

Member since December 21, 2020
Ian is a data engineer with six years of experience providing scalable and verifiable solutions. His primary focus has been on large scale data collection operations across a few different industries. Ian also has abundant experience working in the Docker ecosystem, so he excels at smoothly and efficiently integrating his code into an existing DevOps system.
Ian is now available for hire

Portfolio

Experience

Location

Cleveland, OH, United States

Availability

Part-time

Preferred Environment

Python, Linux

The most amazing...

...thing I've created would be my solutions to for dealing with many different anti-bot networks when scraping data.

Employment

  • Data Engineer

    2019 - PRESENT
    TPA Stream
    • Created tools that simplified the process of writing web scrapers.
    • Containerized the web scrapers so that they could be part of the DevOps pipeline.
    • Maintained the existing data pipelines by troubleshooting bugs as they occured.
    Technologies: Web Scraping, Data, Python
  • Data Science Instructor

    2018 - PRESENT
    Trilogy Education Services
    • Instructed students who were learning basic programming and data analysis.
    • Troubleshot student configuration issues in a variety of different hardware and software enviorments.
    • Provided 1-on-1 tutoring to students in need of further instruction.
    Technologies: Machine Learning, APIs, D3.js, JavaScript, NoSQL, SQL, Python
  • Software Engineer

    2015 - 2018
    Dakota Software
    • Maintained web scrapers for government regulations.
    • Created Selenium-based automation tools for web app testing.
    • Deployed tested code to the production environment.
    Technologies: JavaScript, SQL, Python

Experience

  • Stock Data Pipeline
    https://github.com/Ian-Roy/data_pipeline_stock

    I built a distributed Python desk application that gathers and analyzes data on current stock trends. The data is stored in a document database and is designed to scale as needed for many different stock interests.

Skills

  • Languages

    Python, SQL, JavaScript
  • Other

    Data Engineering, Web Scraping, Big Data, Data Analysis, Data, APIs, Machine Learning
  • Tools

    Git, DataViz
  • Libraries/APIs

    D3.js
  • Paradigms

    Distributed Computing
  • Platforms

    Docker, Linux
  • Storage

    Data Pipelines, NoSQL

Education

  • Bachelor of Science Degree in Mathematics
    2012 - 2017
    Cleveland State University - Cleveland, OH, United States

To view more profiles

Join Toptal
Share it with others