Gonzalo Andres Diaz
Verified Expert in Engineering
Data Engineer and Developer
Gonzales has around 10 years of experience learning and developing solutions with multiple languages and technologies. In the data science and big data scene, he became a data engineer, as it is his strongest suit. It's not rare to find him learning topics related to machine learning as he learns about the data engineering ecosystem and state-of-the-art big data platforms.
Portfolio
Experience
Availability
Preferred Environment
Amazon Web Services (AWS), GitHub, MacOS, PyCharm
The most amazing...
...thing I've evolved is the data infrastructure to handle real-world big data and customer needs.
Work Experience
Data Engineer
Rappi
- Embedded in a data science team that is recently adopting data engineering practices. In charge of the communication with platform and infrastructure teams. Set up the tooling, documentation, and data modeling, and planned the shape of the data pipelines moving forward.
- Set up a machine learning and data engineering orchestration tool for the company, using Airflow in collaboration with DevOps.
Data Engineer
Olapic
- Served as back-end analytics engineer, supporting the maintenance of the data infrastructure and ClickStream API.
- Authored an anomaly detector tool to identify problems in customer data to enable early detection and fixing of the underlying data and tooling.
- Collaborated on the design and implementation of the new version of the data pipeline, from Jenkins and SQL scripts to Apache Airflow.
- Collaborated with the design and implementation of the ETL to sync data from operational databases in RDS/Aurora (MySQL) to the data warehouse in Redshift.
- Supported the business intelligence team to write and maintain the funnels and reducers needed to fuel the reporting of the company.
Full-stack Developer
Bytelion
- Operated as back-end Node.js developer using ES6. Collaborated to write the data ingestion framework to provide a third-party news provider.
- Wrote and maintained the data pipelines that ingested the LawIQ proprietary information.
- Set up the codebase of the project. Delivered the MVP from scratch to the initial group of clients.
- Collaborated to improve the API and help improve the performance of the machine learning algorithm at SameGrain (iOS Social Network).
Software Developer
Santex America
- Developed and maintained several tools used to convert from DOCX and HTML to a proprietary XML variation and the other way around.
- Developed an HTML importer for proprietary XML uses JTidy, CssToXslfo, XSLFO, and XSLT. After months of improving this module, I took charge of the migration from XSLT to Java, using Jsoup to marshall HTML and Jaxb to unmarshal the proprietary XML.
- Uses Agile methodologies with a remote team in Argentina and the customer HQ in Ames, Iowa.
- Built in-house CI tool integrated with GitHub.
Software Development
Globant
- Developed and enhanced the insurance module of Orbitz World Wide and its entire platform.
- Built an internal tool to manage configuration files of the entire platform (XML files). Written an interface to Accurev (SCM) and Atlassian Jira.
- Coordinated with European product owners the products to include on each release.
- Developed the configuration change files (XML) requested by the product owners and coordinated with the release management team to include these changes in the next deployment. The goal was to deliver a configuration bundle every two weeks without production issues.
- Led a successful migration from Ant2 to Gradle.
- Prepared and maintained VMware virtual machines.
Experience
Insurance Vertical for Orbitz World Wide
MVP of a Social Media
Next Gen Data Infrastructure
LawIQ Ingestion Pipeline
https://www.lawiq.com/Skills
Languages
Python, SQL, Snowflake, Java, XML, XSD, XSLT, PHP
Tools
GitHub, PyCharm, Apache Airflow, Amazon Simple Queue Service (SQS), Apache, Celery, Jenkins, Vagrant, Periscope, Chartio, Amazon Simple Notification Service (Amazon SNS), Gradle, Apache Maven
Paradigms
Continuous Integration (CI), Object-oriented Programming (OOP), Functional Programming, Agile Software Development, Dimensional Modeling, Distributed Computing, Agile, Scrum
Platforms
MacOS, Linux, Amazon EC2, AWS Lambda, Heroku, Rakam, Amazon Web Services (AWS), Docker
Storage
JSON, Redshift, SQLite, MySQL, PostgreSQL, Distributed Databases, MongoDB, Amazon S3 (AWS S3)
Other
Amazon Kinesis Data Firehose, Data Engineering, Amazon Kinesis, MVP Design, Data Modeling, Streaming
Libraries/APIs
Pandas, Node.js, PySpark
Frameworks
Ruby on Rails (RoR), Ant Design, GAE, Express.js, Django, AngularJS
Education
Information System Engineer (Incomplete) Degree in Information System Engineering
UTN National University of Technology - Cordoba
Certifications
Reproducible Research
Coursera Course Certificates
Hadoop Starter Kit
Udemy
Data Analysis and Statistical Inference
Coursera
Exploratory Data Analysis
Coursera Verified Certificates
Getting and Cleaning Data
Coursera Verified Certificates
R Programming
Coursera Verified Certificates
The Data Scientist's Toolbox
Coursera Verified Certificates
Machine Learning
Coursera
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring