Chen-Kuo Chen
Verified Expert in Engineering
Data Engineering Developer
Hong Kong, Taiwan
Toptal member since July 28, 2020
Chen-Kuo is a data engineer exceptionally good at web scraping, big data ETL (Python/SQL/Hadoop), and data model design. He is also a full-stack engineer who builds high-quality web applications via React and Django/Node.js. He built an entire data ETL infrastructure that integrates APIs and beautiful pulls data using a multi-process method. His strategies have reduced legacy processing times by 1/10 and have generated valuable insights for clients via data modeling and analytics.
Portfolio
Experience
Availability
Preferred Environment
Slack, MacOS, Linux
The most amazing...
...data infrastructure I delivered as a one-man engineering team included data sourcing, ETL, data visualization, and data analytics.
Work Experience
Full-stack Engineer
Self-employed
- Developed various web applications via React/Vue and Django.
- Created an online exam system for 400+ student simultaneously taking online exams. This is one example of the type of applications I've built.
- Optimized back-end performance via refined SQL queries. Designed load balancing server architecture.
Data Engineer
Tybource Capital Managemnt, Ltd.
- Developed an entire data infrastructure from scratch using Python and AWS infrastructure, including EC2, S3, Redshift, and EMR.
- Built various web scraping projects via Scrapy and Crawlera as the proxy pool.
- Deployed data analytics and modeling to generate valuable data insights.
Software Engineer
Machine Zone
- Created a web application via React, Node.js, and Django.
- Built a big data process via Hadoop and a real-time data streaming pipeline via Spark.
- Reduced the time of a legacy process to 1/10 by refactoring it to multi-processes.
Software Engineer
Synology
- Served as a full-stack web developer on Synology NAS using JavaScript and C/C++.
- Used TCP/IP networking programming in C/C++ for Synology DSM operating system.
- Built and launched three standalone apps on Synology DSM.
Experience
Data ETL Infrastructure
I built it using Python, AWS, S3, and Redshift EC2. I used Python to integrate with APIs and pull data via a multi-process method. Completed data cleaning with Python and SQL, put raw files into S3, and created Redshift tables on top of that. I ran Apache Airflow and Jupyter Notebook on EC2 to schedule tasks and provide a data analytic/data visualization platform.
Complete Web Application for Data Visualization
Online Web Application for Exams
The web application was built with Vue.js and Django and is capable of handling 400+ students online simultaneously. I was the full-stack engineer in charge of both front-end and back-end development.
Skills
Libraries/APIs
React, Node.js, Vue, REST APIs
Tools
Apache Airflow, Slack
Languages
SQL, Python, JavaScript, C++, C, Python 3
Paradigms
ETL, RESTful Development
Storage
Databases, MySQL, PostgreSQL, Redshift
Frameworks
Django, Spark
Platforms
Amazon Web Services (AWS), Linux, MacOS, Jupyter Notebook
Other
Web Scraping, Data Engineering, APIs, Data Visualization
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring