Fernando Melchor, Data Scientist and Software Developer in Denver, CO, United States
Fernando Melchor

Data Scientist and Software Developer in Denver, CO, United States

Member since June 2, 2021
Fernando is a data scientist with a background in mechatronics engineering. He is passionate about cities and all the data that describes them, especially geospatial data. During his career, he designed, prototyped, and launched data products used worldwide in diverse sectors, such as automotive, government, on-demand streaming, and real estate. He has experience deploying machine learning models, data science projects, ETL processes in Airflow, and KPI initiatives in Tableau and Metabase.
Fernando is now available for hire

Portfolio

  • Flat.mx
    Python 3, PostgreSQL, Dash, Plotly, Scikit-learn, TensorFlow, Docker, AWS ECS...
  • Mexico City Government
    Python 3, Pandas, SQL, NetworkX, Plotly, Spark, SVMs, Scikit-learn, Neo4j
  • Discovery, Inc.
    Pandas, Python 3, SQL, Apache Hive, NetworkX, Plotly, Dash, Tableau, Cron...

Experience

Location

Denver, CO, United States

Availability

Part-time

Preferred Environment

Jupyter Notebook, Python 3, Apache Airflow, AWS, Tableau, Metabase, PostgreSQL, Neo4j, Scikit-learn, TensorFlow

The most amazing...

...thing I've developed is an ROI marketing campaigns tool (Tableau) at Discovery, Inc. The tool helps understand the campaign lifecycle and forecasts the ROI.

Employment

  • Chief Data Scientist

    2020 - PRESENT
    Flat.mx
    • Developed machine learning predictive models for real estate properties in Mexico City, including long and short-term rent prices and selling prices.
    • Built a machine learning DevOps framework to easily deploy and update models on ECS.
    • Developed an automated offer system that reduced visits to offer lead time from 13 days to minutes.
    • Created data visualizations dashboards to democratize data and insights inside the company. Led the KPI efforts to measure every business unit with continuous QA and improvement activities.
    • Developed complex Airflow pipelines to curate data from multiple sources and formats to feed the acquisition team with high conversion rates, leading to scaling the business.
    • Modeled the public transportation network of Mexico City to understand the access and centrality of different areas.
    Technologies: Python 3, PostgreSQL, Dash, Plotly, Scikit-learn, TensorFlow, Docker, AWS ECS, GitHub
  • Director of Data Architecture and Data Analysis

    2019 - 2020
    Mexico City Government
    • Developed the city’s security dashboard for the city’s police department for everyday reporting and crime tracking. This dashboard is a tool that is used daily for crime tracking and decision-making.
    • Created optimization algorithms for police distribution on the subway system, including a visualization tool with metrics and a graphic scheduler.
    • Diagnosed emergency response time of ambulances identifying the three main root causes of delays. The actions implemented reduced ten minutes the average response time.
    • Mentored a team of five data analysts and scientists with best practices and product development methodologies. Implemented on-hands training to develop ETL pipeline and database capabilities on the team.
    • Created a data analytics team to assist the mayor and other city departments with decision-making.
    • Presented and produced multiple exploratory data analyses to inform decision-makers regarding security, emergency response, and mobility.
    Technologies: Python 3, Pandas, SQL, NetworkX, Plotly, Spark, SVMs, Scikit-learn, Neo4j
  • Data Scientist | Researcher | Digital Analytics and Insights

    2018 - 2019
    Discovery, Inc.
    • Developed a data-product dashboard (Tableau) based on app reviews, including an automated ETL pipeline that gets the new reviews and ratings. At the core, I implemented a review text classification model using SVM, achieving 80% accuracy.
    • Developed the alarm dashboard in Tableau that helped to track performance across different platforms and the discovery ecosystem. It allowed exploring the daily performance of eight variables with an anomaly detection system.
    • Created a bipartite graph of streamers and shows for audience clustering. Performed network analysis to understand how audiences overlap in each channel. This data product is used by marketing and programming to guide their business strategies.
    • Implemented a network analysis framework to analyze a market research survey creating a visualization tool for the team to explore the survey results adding demographics filters, helping them design future products.
    • Developed the ROI marketing campaigns dashboard (Tableau) to measure marketing campaign performance. The dashboard helps visually understand the campaign lifecycle and forecasts the expected ROI.
    Technologies: Pandas, Python 3, SQL, Apache Hive, NetworkX, Plotly, Dash, Tableau, Cron, Natural Language Processing (NLP), Gensim, SpaCy, Support Vector Machines (SVM), Scikit-learn
  • Data Scientist

    2017 - 2018
    ARGO Labs, California Data Collaborative
    • Created an ETL pipeline that combines data from water utilities, a web scraper, and public APIs to identify the business type of a water user (commercial or institutional).
    • Designed a classification method that uses data from APIs and NLP to assign a business type to each customer.
    • Created an automated process to aggregate the census data from the block-group level to the water districts level. This allowed the water utilities to understand their customers and the research team to create an analysis using demographics.
    • Led a team of four to develop a water usage benchmark in CA by using publicly available data and water utility data.
    Technologies: Dash, Python 3, Natural Language Processing (NLP), Statistics, Benchmarking

Experience

  • MTA Subway Network and the L Train Closure Impact
    https://github.com/fernandomelchor/L-Train_Project

    Led the strategy, identification, and characterization of the affected areas to design a better contingency plan, measure the impact, and simulate subway users' behavior in case of a disruption using census data and spatial analysis.

    Simulation:
    https://github.com/fernandomelchor/NYC_MTA_Subway_Network

  • Automated Classification of Airbnb Listings in NYC
    https://github.com/fernandomelchor/Airbnb_Project/blob/master/Airbnb_Paper.pdf

    Developed an innovative Airbnb classification method based on cost, transportation connectivity, and businesses in the area.

    • Developed a geospatial scan algorithm that gathered characteristics of the area surrounding the Airbnb listing.
    • Created a transportation connectivity index based on the subway network using graph theory and spatial analysis.
    • Implemented clustering techniques and created visualization maps.

  • Analysis of the Mexican Senate — Published by Nexos
    https://parentesis.nexos.com.mx/?p=231

    By web scraping, I created a database of the senators and their votes. I applied PCA to visualize the senators' distribution identified by name and party. Then, I compared each senator to each party to uncover real tendencies beyond the official parliament groups. The data helped me to create a compelling story about the parliament and intra-party dynamics.

Skills

  • Libraries/APIs

    Shapely, Pandas, NetworkX, Scikit-learn, TensorFlow, SpaCy
  • Tools

    Tableau, Plotly, Apache Airflow, Cron, Gensim, AWS ECS, GitHub
  • Other

    Data Wrangling, EDA, Geospatial Analytics, GeoPandas, Data Analyst, Metabase, Spatial Analysis, Data Visualization, Dash, Statistics, AWS, Machine Learning, RESTful APIs, FastAPI, QGIS, Time Series, System Design, Principal Component Analysis (PCA), Web Scraping, Geospatial Data, K-means Clustering, Simulations, Natural Language Processing (NLP), Support Vector Machines (SVM), SVMs, Benchmarking
  • Languages

    Python 3, SQL
  • Paradigms

    ETL, Data Science
  • Platforms

    Jupyter Notebook, Docker
  • Storage

    PostgreSQL, Neo4j, Apache Hive
  • Frameworks

    Spark

Education

  • Master of Science Degree in Data Science and Urban Informatics
    2016 - 2017
    New York University - New York, USA
  • Bachelor's Degree in Mechatronics Engineering
    2006 - 2010
    Tecnológico de Monterrey - Mexico City, Mexico

Certifications

  • Data Science For All: Latin America 2020
    MAY 2020 - PRESENT
    Correlation One and SoftBank Group

To view more profiles

Join Toptal
Share it with others