Hiro Shioi, Data Scientist Developer in Pleasanton, CA, United States
Hiro Shioi

Data Scientist Developer in Pleasanton, CA, United States

Member since February 22, 2022
Hiro is a customer-facing data scientist. He excels at very high-level data science problems (ideation, road mapping, ROI estimation, solution and data architecting) to hands-on execution (data cleansing, feature engineering, modeling, operationalization). At General Electric, Hiro led a million-dollar digital transformation advisory project and developed 15 data science products across the healthcare, mining, telecommunications, manufacturing, transportation, power, and financial industries.
Hiro is now available for hire

Portfolio

  • dotData Inc.
    Python, PySpark, Tableau, Microsoft Power BI, SQL, Snowflake, Databricks, AWS...
  • General Electric
    Python, Anomaly Detection, PySpark, Machine Learning, Digital Signal Processing

Experience

Location

Pleasanton, CA, United States

Availability

Part-time

Preferred Environment

Python, Jupyter Notebook, AWS, Azure

The most amazing...

...customer-focused and executed business outcome I delivered was worth $8 million to the client and completed within 60 days.

Employment

  • Senior Data Scientist

    2020 - PRESENT
    dotData Inc.
    • Served as a single customer-facing data scientist, closing the deal for the happiest customer by validating an $8 million revenue increase per month and operationalizing the ML model within 60 days.
    • Developed automated data science (automated feature engineering and AutoML) use cases for businesses across industries, e.g., eCommerce, manufacturing, retail, and finance.
    • Directed engineers to improve the products and add-on features as a product manager by leveraging customer-facing knowledge.
    Technologies: Python, PySpark, Tableau, Microsoft Power BI, SQL, Snowflake, Databricks, AWS, Azure
  • Senior Data Scientist

    2016 - 2020
    General Electric
    • Developed 15 data science products and solutions for customers across verticals such as healthcare, mining, telecommunications, manufacturing, transportation, power, and the financial industries.
    • Led the million-dollar contract for a digital transformation advisory project of a financial institution.
    • Developed anomaly detection models using machine learning and physics-based models applying signal processing based on 14 time-series sensors in Python.
    • Delivered an analytic report for millions of time-series log and service data records.
    Technologies: Python, Anomaly Detection, PySpark, Machine Learning, Digital Signal Processing

Experience

  • Flask App for Anomaly Detection Using User Session Logs

    I developed a RESTful API service using Flask (Python framework) that consumed user logs to detect unusual behavior using machine learning techniques during a hackathon. In this application, I developed the parsing and preprocessing script, machine learning models (logistic regression, decision tree, XGBoost, etc.), and the training and prediction pipeline.

Skills

  • Languages

    Python, SQL, Snowflake
  • Paradigms

    Anomaly Detection
  • Platforms

    Jupyter Notebook, Databricks, Azure
  • Other

    Machine Learning, Client Presentations, AWS, Object Detection, RESTful APIs
  • Frameworks

    Flask
  • Libraries/APIs

    PySpark, XGBoost
  • Tools

    Tableau, Microsoft Power BI

Education

  • Master's Degree in Aerospace Engineering
    2011 - 2014
    The University of Tokyo - Tokyo, Japan
  • Research Towards a Degree in Computer Science
    2012 - 2013
    ETH Zurich (Swiss Federal Institute of Technology in Zurich) - Zurich, Switzerland
  • Bachelor's Degree in Aerospace Engineering
    2007 - 2011
    The University of Tokyo - Tokyo, Japan

To view more profiles

Join Toptal
Share it with others