Luis Villavicencio, Data Engineering Developer in Montreal, QC, Canada
Luis Villavicencio

Data Engineering Developer in Montreal, QC, Canada

Member since June 24, 2020
Luis has more than nine years of technical experience as a software engineer, data developer, and data analyst. He has been focusing on big data technologies and machine learning models for the past three years. Luis is a versatile, analytical, and methodical professional.
Luis is now available for hire


  • Intelcom
    Snowflake, Python, SQL, JavaScript, Data Governance, Data Engineering...
  • Consultant
    Google Data Studio, Google Cloud Platform (GCP), Python, Keras, Flume...
  • Muraflex
    Pandas, NumPy, Scikit-learn, Python, SAP HANA, SQL, Microsoft Power BI



Montreal, QC, Canada



Preferred Environment

Google Cloud Platform (GCP), Snowflake, Data Building Tool (DBT), AWS

The most amazing...

...project I have worked on is a data and ML solution to predict fire events in Montreal.


  • Senior Data Engineer | Lead Analytics

    2020 - PRESENT
    • Designed data integration flows using available platforms such as AWS (Lambda, Kinesis firehose, S3, and more), Azure (Data Factory, Azure SQL DB, etc.), Snowflake (SQL, Tasks, Stream, and Pipe), BigQuey, Apache Pulsar (streaming) and DBT.
    • Fed the different analytical environments like OPS datamarts, finance, and more.
    • Searched continuously for possible improvements to the analytical platform to better support data analytics needs, such as orchestration, architecture, data models, data quality, performance, DataOps, and MLOps.
    • Contributed to both technical and user-oriented documentation of reports and metrics.
    Technologies: Snowflake, Python, SQL, JavaScript, Data Governance, Data Engineering, Data Building Tool (DBT), ETL, ELT, Microsoft Power BI
  • Data Engineer Consultant

    2016 - PRESENT
    • Designed a proof of concept for Twitter data analyses and data ingestion using Apache Flume to a Hadoop environment.
    • Developed a machine learning model to predict fire events in the city of Montreal. The models were done in Python (Keras) using neuronal networks.
    • Designed a proof of concept for an IoT project with data ingestion from different IoT devices to GCP and data visualization using Google Data Studio.
    Technologies: Google Data Studio, Google Cloud Platform (GCP), Python, Keras, Flume, Apache Pig, Impala, Apache Hive, Hadoop
  • Principal Data Engineer

    2017 - 2021
    • Managed all data and analytics projects. Provided data-architecture design leadership and direction to reach the realization of enterprise goals. Served as a leader for data-transformation initiatives from on-premise to cloud-data technologies.
    • Built strong and measurable processes, tools, and engineered data-management solutions to support all data platforms such as data warehouse, lakes, reporting/BI tools, data science, data catalog, data visualization, etc.
    • Worked with developers and analysts to understand their daily workloads and suggest techniques and tools for optimization.
    • Managed relationships with IT partners and vendors, security, and other teams across the company.
    • Worked with data from different sources (SQL, NoSQL, text, Excel, and CSV files) to get business insights and build ETL pipelines.
    • Designed and created data models as needed. Served as an expert in SQL and different data-management systems.
    • Designed, built, and deployed various Power BI applications, from data extraction to data visualization, providing advanced dashboards and insights about users' activity log, CRM, and ERP system.
    • Worked alongside other team members and stakeholders to gather business requirements and design KPIs.
    Technologies: Pandas, NumPy, Scikit-learn, Python, SAP HANA, SQL, Microsoft Power BI
  • BI Consultant | Power Platform

    2020 - 2020
    • Created a cost-effective solution to access key reporting and analytics. The solution was based on Microsoft Power Apps with secure access and a multi-device solution. It involved dashboards in Power BI and data connection to SharePoint lists.
    • Trained the client with the use of Microsoft Power Apps and the admin portal, so the client was able to do it independently.
    • Helped to improve reports on Power BI and provide some guidelines regarding the whole analytics solution.
    Technologies: SharePoint, Microsoft Power BI, Microsoft Power Apps
  • Manager BI

    2015 - 2016
    • Managed the BI initiatives, ensure the task execution and control of the requirements. Managed the stakeholders expectations and led the team that developed a BI platform.
    • Coordinated activities throughout the software development life cycle. Prioritized the request from the central operations department.
    • Designed a datamart and visualization tools for the business operation of the company.
    • Optimized the data architecture, recognize system deficiencies, and implement effective solutions, data warehouse creation.
    Technologies: JavaScript, Microsoft SQL Server, MySQL, Oracle, HTML, PHP, Java
  • Data Analyst

    2012 - 2015
    • Created different dashboards and reports to provide insights about the status of the business operation.
    • Presented the insights generated through analyses to all types of audiences in a storytelling format.
    • Worked with management to prioritize business and information needs. Proposed and implemented various improvements in business processes, data architecture, and definition of optimal KPIs and visual types.
    Technologies: Microsoft SQL Server, MySQL, Oracle, JavaScript, CSS, HTML, PHP


  • Bixi Montreal

    BIXI Montréal is a public bicycle sharing system serving Montréal, Quebec, Canada.
    It was required to provide insights about user behavior and peek usage. This analysis helped to optimize the bike availability and service in the city.
    These are some points that were addressed:
    - At what moment we have the highest number of rides? (season, day of the week, time)
    - What are the busiest bike stations?
    - What are the most common starting and destination area? (popular routes)
    - Behavior difference between user members and non-members. Route duration and user speed.


  • Languages

    SQL, Python 3, Python, Snowflake, PHP, Java, HTML, JavaScript, CSS
  • Tools

    Microsoft Power BI, Git, Apache Impala, Tableau, Apache Sqoop, BigQuery, Microsoft Excel, Tableau Desktop Pro, Microsoft Power Apps, Impala, Flume
  • Storage

    MySQL, Databases, Apache Hive, HBase, NoSQL, PostgreSQL, Microsoft SQL Server
  • Other

    Data Visualization, Data Modeling, Reporting, Machine Learning, Data Engineering, Apache Flume, Google Data Studio, ETL Tools, Data Cleaning, Big Data, Data Migration, Excel Macros, Tableau Server, Data Analysis, Data Building Tool (DBT), Pulsar, Data Governance, AWS, ELT
  • Frameworks

  • Libraries/APIs

    JSON API, REST APIs, PySpark, Scikit-learn, NumPy, Pandas, Keras
  • Paradigms

    Database Design, ETL, Business Intelligence (BI), Data Science
  • Platforms

    Apache Pig, Amazon Web Services (AWS), Jupyter Notebook, Docker, SAP HANA, Visual Studio Code, Google Cloud Platform (GCP), Eclipse, SharePoint, Oracle


  • Bachelor's Degree in Engineering
    2006 - 2010
    UDEP - Lima - Peru


  • Data Science and Machine Learning
    McGill University

To view more profiles

Join Toptal
Share it with others