Matthew Mitsui, Developer in New York City, NY, United States
Matthew is available for hire
Hire Matthew

Matthew Mitsui

Verified Expert  in Engineering

Software Developer

Location
New York City, NY, United States
Toptal Member Since
November 30, 2017

Matthew is a diligent data scientist with over eight years of experience in machine learning, research, statistics, natural language processing, behavioral modeling, and software development. He helps companies build data-driven products to analyze user behavior and enable users to drive their brands and success.

Portfolio

VidIQ
Python 3, TensorFlow, Jupyter Notebook, Amazon Web Services (AWS), Amazon EC2...
Digitalware, Inc.
JavaScript, Python 3, Django
Rutgers University
Amazon Web Services (AWS), Bootstrap, JavaScript, jQuery, PHP, Scikit-learn...

Experience

Availability

Part-time

Preferred Environment

GitHub, Python, Scikit-learn, Jupyter Notebook, Redshift, SQL, Amazon Web Services (AWS), Pandas, TensorFlow, NumPy

The most amazing...

...thing I've worked on was build a SQL pipeline for a new consumer-facing TikTok app and taking qualitative feedback from non-technical stakeholders.

Work Experience

Head of Data Research

2020 - 2022
VidIQ
  • Performed data science and data analysis individual contributor duties for a tech startup.
  • Developed a YouTube video title recommender system for over two million creators.
  • Developed a regression model of YouTube search traffic for millions of daily keywords.
  • Expanded search traffic and video idea product coverage by up to 50% of corner cases.
  • Aggregated daily trends of popular sounds in TikTok for 50K+ users of the TikTok app.
  • Communicated results with managers, technical stakeholders, and content creators.
  • Assisted in hiring and onboarded five new data scientists.
  • Executed Agile and Waterfall projects with engineers and fellow data scientists.
Technologies: Python 3, TensorFlow, Jupyter Notebook, Amazon Web Services (AWS), Amazon EC2, Data Science, Spark, Regression, Unsupervised Learning, Supervised Learning, Data Analysis, Recommendation Systems, Deep Learning, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), GPT, Data Visualization, SQL, Statistics, GitHub, Pandas, NumPy, Matplotlib, PyCharm, Natural Language Toolkit (NLTK), Scikit-learn, Asana, Keras, Statistical Analysis, Redshift, Sisense, Data Analytics, Git, Jupyter, Predictive Modeling, Machine Learning

Python and GraphQL Middleware API Engineer

2019 - 2020
Digitalware, Inc.
  • Wrote APIs and data handlers for various product platforms.
  • Designed and implemented data parsers for data ingestion.
  • Wrote APIs and services using JavaScript, Python, and Django.
Technologies: JavaScript, Python 3, Django

Postdoctoral Associate

2018 - 2019
Rutgers University
  • Predicted problems encountered by web searchers using supervised learning.
  • Designed a master's level course in research methods, teaching statistics and R in the library and information science department.
  • Assisted with grant writing on emerging big data infrastructure, deep learning, bias in information retrieval, and explainable recommendations.
  • Mentored research projects of undergraduate and graduate students.
Technologies: Amazon Web Services (AWS), Bootstrap, JavaScript, jQuery, PHP, Scikit-learn, MySQL, R, Python, Predictive Modeling, Machine Learning, Eye Tracking, Sequel Pro, REST

Graduate Student Research Assistant

2014 - 2018
Rutgers University
  • Predicted searcher intentions from web search patterns, improving accuracy by 1.5-8x over naïve baseline with logistic regression.
  • Mapped relationships between searcher demographics, search behaviors, and their search tasks using structural equation modeling, improving six of seven goodness-of-fit metrics.
  • Developed a structured learning approach to discover unknown relationships between searcher characteristics, search tasks, and behavior.
  • Designed a user-friendly interface to collect web browsing data using Python, HTML, and JavaScript in a Model-View-Controller framework.
  • Extracted crowd annotations of search logs using Amazon Mechanical Turk.
Technologies: Bootstrap, JavaScript, jQuery, PHP, Scikit-learn, Python, PHP 5, SQLite, PhpStorm, Predictive Modeling, Machine Learning, Eye Tracking, Sequel Pro, REST

Teaching Assistant

2012 - 2014
Rutgers University
  • Engaged undergraduate students in Computer Science courses.
  • Served as a teaching assistant for an undergraduate algorithm course for one year.
  • Acted as a teaching assistant for an undergraduate course in C programming for one year.
  • Held regular office hours on weekly bases for each course.
  • Assisted in the grading and designing of tests, periodic homework assignments, and group projects.
Technologies: Python, C

Graduate Research Fellow

2011 - 2012
Rutgers University
  • Attended regular multidisciplinary seminars intersecting computer science and psychology.
  • Involved in multidisciplinary coursework intersecting the aforementioned fields.
  • Took part in research in computer graphics regularly.
  • Engaged in coursework required by the computer science curriculum as required by my fellowship.
Technologies: Python, Python 2, Python 3, Scikit-learn

Computer Science Intern

2011 - 2011
National Security Agency
  • Participated in a 12-week internship program for computer science.
  • Contributed to an NSA's mission through the project.
  • Completed a technical paper as part of the project.
  • Worked in a collaborative group with a senior project on the project.
Technologies: Java

Other

Writing & Editing, Data Structures, Regression, Unsupervised Learning, Supervised Learning, Data Analysis, Recommendation Systems, Natural Language Processing (NLP), Data Visualization, Statistics, Data Analytics, Predictive Modeling, Machine Learning, Algorithms, Statistical Data Analysis, GPT, Generative Pre-trained Transformers (GPT), SFTP, SSH, Eye Tracking, Deep Learning, Statistical Analysis, Neural Networks, Bayesian Statistics, Time Series Analysis

Languages

Python 3, Python 2, JavaScript, Python, SQL, PHP 5, R, PHP, C, Java

Libraries/APIs

Pandas, Scikit-learn, jQuery, NumPy, Matplotlib, Natural Language Toolkit (NLTK), SciPy, TensorFlow, Keras

Tools

PyCharm, PhpStorm, Jupyter, Asana, Sequel Pro, GitHub, Terminal, Git, Sisense, LaTeX

Paradigms

Data Science, Object-oriented Programming (OOP), REST

Platforms

MacOS, Jupyter Notebook, OS X, Amazon Web Services (AWS), Linux, Amazon EC2, Docker

Storage

MySQL, SQLite, Redshift

Frameworks

Bootstrap, Laravel, Django, Spark

2011 - 2018

Ph.D. Degree in Computer Science

Rutgers University - New Brunswick, NJ, USA

2007 - 2011

Bachelor of Science Degree in Computer Science

Rutgers University - New Brunswick, NJ, USA

APRIL 2019 - PRESENT

Neural Networks and Deep Learning

Coursera

JUNE 2017 - PRESENT

Bayesian Statistics

Coursera

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring