Tianshi Zhu, Software Developer in San Francisco, CA, United States
Tianshi Zhu

Software Developer in San Francisco, CA, United States

Member since April 3, 2020
Tianshi loves optimizing distributed systems, big data pipelines, and real-time streaming applications. It thrills him when an out-of-memory issue is fixed or a Spark job becomes 50% faster, or a streaming application's throughput is increased by 10x. Tianshi looks forward to building the next reliable, scalable, and highly available distributed system.
Tianshi is now available for hire

Portfolio

  • Stripe
    Redis, Apache Kafka, Hadoop, Scalding, Spark, Python, Java, Scala
  • LinkedIn
    Python, Apache Pig, Apache Avro, Apache ZooKeeper, Apache Lucene...
  • Oracle
    OpenStack, Apache ZooKeeper, Java, C++

Experience

Location

San Francisco, CA, United States

Availability

Part-time

Preferred Environment

G Suite, Zoom, Slack, Windows

The most amazing...

...optimization I've done is a 100-line change that makes Redis transactions faster by 20x.

Employment

  • Staff Software Engineer

    2017 - PRESENT
    Stripe
    • Made the ML feature pipeline faster and more reliable which saved $3 million annually.
    • Optimized a data pipeline's performance by 20x so a product can be launched on time.
    • Rebuilt an online feature store based on a Redis cluster and Lua script that cuts latency from 100 milliseconds to 5 milliseconds (ms).
    • Led a team to design/implement a multivariate experimentation service that can handle 3,000 QPS (queries per second) per node and a client-side p50 of 5 ms.
    Technologies: Redis, Apache Kafka, Hadoop, Scalding, Spark, Python, Java, Scala
  • Senior Software Engineer

    2014 - 2017
    LinkedIn
    • Implemented a serialization system which reduced the p50 latency by 30% and CPU usage by 15%, and achieved $2.5M annual saving.
    • Mentored an intern to design and implement a prototype for facet search, and boosted exploratory search CTR by 15%.
    • Migrated 80% of Linkedin's search traffic from a legacy search system to the new APIs.
    Technologies: Python, Apache Pig, Apache Avro, Apache ZooKeeper, Apache Lucene, Apache Kafka, Java
  • Member of Technical Staff

    2013 - 2014
    Oracle
    • Developed an OpenStack Neutron plugin for Oracle Virtual Network Controller.
    • Built a UI to visualize and automate virtual network setup.
    • Developed an API back end to interact with OpenStack and virtual machines.
    Technologies: OpenStack, Apache ZooKeeper, Java, C++

Experience

  • An ML Feature System That Supports Batch and Real-time Processing

    An ML feature system that allows users to define the business logic in one place and compiled it to multiple back ends, including Spark, Scalding, and real-time streaming processing. The machine learning (ML) features are guaranteed to not have time-traveling issues.

Skills

  • Frameworks

    Spark, Hadoop, Django
  • Libraries/APIs

    Scalding, Apache Lucene
  • Platforms

    Apache Kafka, Windows, Apache Pig, OpenStack, Amazon Web Services (AWS), Kubernetes
  • Other

    Big Data Architecture, Distributed Systems, AWS
  • Languages

    Scala, Java, Python, C++
  • Paradigms

    MapReduce, Functional Programming
  • Storage

    Redis, PostgreSQL
  • Tools

    Slack, Zoom, G Suite, Apache ZooKeeper, Apache Avro

Education

  • Master's degree in Telecommunications
    2011 - 2013
    University of Pennsylvania - Philadephia, PA, USA
  • Bachelor's degree in Telecommunications
    2007 - 2011
    Fudan University - Shanghai, China

To view more profiles

Join Toptal
Share it with others