Juan Manuel Ortiz de Zarate, Data Scientist and Developer in Ciudad de Buenos Aires, Buenos Aires, Argentina
Juan Manuel Ortiz de Zarate

Data Scientist and Developer in Ciudad de Buenos Aires, Buenos Aires, Argentina

Member since October 2, 2019
Currently, Juan is a PhD candidate at the University of Buenos Aires, researching the subjects of AI, NLP, and social networks. He has over a decade of professional development experience under his belt. For the last few years, he’s been immersing himself in various types of data science projects and loving every minute of it. Juan relishes taking on data problems, building prediction models, and learning state-of-the-art techniques.
Juan is now available for hire




Ciudad de Buenos Aires, Buenos Aires, Argentina



Preferred Environment

RStudio, Jupyter Notebook

The most amazing...

...thing I've coded is a an entire web app to monitor social networks. It has a back end in R for statistics and a PHP front end.


  • Head Teaching Assistant

    2020 - PRESENT
    University of Buenos Aires
    • Acted as the head teaching assistant at Data Organization subject. In this subject, we try to introduce students to data science. This subject is part of the mandatory career plan for informatics engineering.
    • Designed the whole content of the subject together with another professor. Because of the lockdown, all online classes (in Spanish) are available at https://www.youtube.com/channel/UCWCkHWCIlFbzRSkKc363irw/videos.
    • Taught half of the theoric classes and half of the practical lessons. I also have to coordinate the teachers of the practical lessons and prepare and correct the final exams and practical works.
    • Designed the final test and the practical work that students must be approved to complete the subject.
    Technologies: University Teaching, Data Science, Education, Machine Learning
  • Teaching Assistant

    2020 - PRESENT
    Universidad Católica Argentina
    • Composed practical lessons for the Data science class.
    • Corrected exams and practical homework for the Data science subject.
    • Tutored and answered questions from students in the Data science topics.
    • Explained machine learnning techniques, regularizations methods, feature extraction and selection, data visualizaton methods, and more tasks related to data science.
    Technologies: Data Science
  • Ph.D. Student Researcher

    2017 - PRESENT
    Universidad de Buenos Aires
    • Created new techniques to analyze discussions on social networks with R and Python.
    • Predicted movie reviews using the IMDB database (R).
    • Predicted implication clauses with NLP through Python models.
    • Developed new techniques to predict controversy with NLP techniques on social networks with R and Python.
    • Created new techniques to graph clusters with NLP techniques on social networks with Python and R.
    Technologies: R, Python
  • Freelance Data Scientist Advisor

    2016 - PRESENT
    Massomedia S.A.
    • Predicted presidential votes via telephone surveys using Python.
    • Analyzed several discussions on Twitter and Facebook using R.
    • Developed a web app using R, PHP, and MySQL to monitor social networks and media.
    • Built a product with Python that could analyze telephone surveys about product sales.
    • Presented the results, conclusions, and explanations for each task to the client.
    Technologies: Python, R
  • Data Scientist

    2021 - 2021
    Carrie Beam Consulting
    • Created new functionalities in R so that given a graph of financial relations suggests new unions that improve trade between the actors.
    • Optimized algorithms on graphs so that they can work on large data sets.
    • Found bugs in existing R code and suggested better ones.
    Technologies: Graphs, igraph, R, Networks, Computer Science
  • Statistical Developer

    2020 - 2020
    Decentral Park Advisors LLC (via Toptal)
    • Predicted Bitcoin 1D, 3D, and 7D returns with multinomial regressions and GAM models.
    • Predicted Bitcoin 1D, 3D, and 7D positive or negative values through classification models as RandomForest, XGBoost, and Bagging.
    • Reported the results through Jupyter Notebooks and R Shiny dynamic graphs.
    Technologies: RStudio Shiny, R, Jupyter, Matplotlib, Pandas, Scikit-learn, Python
  • Data Analyst

    2020 - 2020
    LL Media, LLC (via Toptal)
    • Standarized multiple information sources about leads using Python and Pandas.
    • Scored the performance of each source over different kind of campaigns using Python, Pandas, and Matplotlib.
    • Predicted good leads by demographic data using machine learning classifiers (sklearn).
    • Predicted bad leads by demographic data using machine learning classifiers (sklearn).
    • Analyzed lead data to find simple correlations between good and bad lead performance.
    Technologies: Matplotlib, Pandas, Scikit-learn, Python
  • Teaching Assistant

    2017 - 2018
    Universidad de Buenos Aires
    • Composed practical lessons for the Computer Structures 1 class.
    • Corrected exams and practical homework for the Computer Structures 1 class.
    • Tutored and answered questions from students in the Computer Structures 1 class.
    Technologies: Structure, Computers
  • Teaching Assistant

    2016 - 2017
    Universidad de Buenos Aires
    • Composed and gave practical lessons for the Network Theory class.
    • Corrected exams and homework for the Network Theory class.
    • Tutored and answered questions from students in the Network Theory class.
    Technologies: Network Theory
  • Senior Front-end Developer

    2014 - 2015
    • Developed and maintained a system for journalists which allowed them to write different types of notes and publish them into the news site.
    • Built and maintained a system to administrate the advertisement money.
    • Developed a REST API to connect with other media news sites.
    Technologies: JavaScript, MySQL, PHP
  • Senior Front-end Developer

    2010 - 2014
    • Built and maintained a system for call-center assistance.
    • Developed and maintained features to communicate with the STB systems and reset them.
    • Constructed and supported a dynamic decision tree to give the best answers to clients based on their specific problems and configurations.
    • Created and maintained an internal ticket system to organize tasks and assign them to different teams.
    Technologies: jQuery, JavaScript, SQL, PHP
  • Principal Developer

    2008 - 2010
    • Developed the company's administrative system.
    • Built a system to administer technique services.
    • Maintained the stock system.
    • Implemented a system to print digital photos from a Kodak machine.
    • Developed the company's financial system.
    Technologies: JavaScript, MySQL, PHP


  • Application to Monitor Social Networks

    I developed, on my own, an entire application to monitor social networks like Instagram, Facebook, and Twitter. It has statistics about any public account needed and also sends messages/alarms to the client through a telegram if something important is happening.

    It has a back end in R to download information, process it, and calculate statistics and a PHP front end to show the data. I also used C to manage sessions, create, delete and modify searches, and more.

  • Stars Realigned: Improving the IMDb Rating System

    IMDb ratings have genre bias: Dramas tend to score higher, for example. Is there a way to remove such biases and discover what makes a movie unique?

    In this article, I show you how to refine IMDb scores and create a better ranking system through data science and machine learning techniques.

  • A Glimpse Into the Future of Data Science

    Data science is changing the world, it is at the heart of the fourth technological revolution. But how do we get here? How is the world changing? What else does this future hold?
    In this article, I introduce the irruption of data science in our life, how we get here, some representative cases, and where we are going.

  • 10 Best Data Science Development Frameworks to Use in 2021

    In a world where data is more valuable than oil, the demand for data scientists and analysts is skyrocketing. In this article, I present the best tools for tapping into these data reserves. Hands down, Python is the clear choice for any aspiring developer trying to break into the field of data analysis.

  • Hiring Data Scientists — Best Practices and Job Description Template

    Hiring an IT candidate is one of the hardest tasks that human resource professionals have to accomplish. Demand for IT professionals is greater than available individuals on the market, which produces competition between companies for the scarcely qualified developers.
    In this article, I advise you on how to improve your candidates' research and hire the best profiles for your team.

  • Stars Realigned: Improving the IMDb Rating System (Publication)
    IMDb ratings have genre bias: For example, dramas tend to score higher. Removing common feature bias and keeping unique characteristics, it's possible to create a new, refined score based on IMDb information.


  • Languages

    PHP 7, Python 3, R, SQL, JavaScript, CSS, Python, PHP
  • Frameworks

    RStudio Shiny, CodeIgniter
  • Libraries/APIs

    igraph, Scikit-learn, Matplotlib, Pandas, Keras, Ggplot2, NumPy, jQuery, Caret
  • Paradigms

    Data Science, Test-driven Development (TDD)
  • Platforms

    RStudio, Jupyter Notebook, Linux, WordPress, Oracle
  • Storage

  • Other

    Data Visualization, Charts, Social Networks, Social Network Analysis, Visualization Tools, OOP Designs, Machine Learning, Data Analytics, Data Analysis, Big Data, Time Series, Time Series Analysis, Clustering, Statistics, Network Theory, Computers, Structure, Education, University Teaching, Stock Market, Stock Trading, Graphs, Networks, Computer Science, Writing & Editing, Hiring
  • Tools

    Dplyr, Seaborn, Jupyter


  • Ph.D. degree (in progress) in Computer Science
    2017 - 2022
    Universidad de Buenos Aires - Buenos Aires, Argentina
  • Master's degree in Computer Science
    2010 - 2016
    Universidad de Buenos Aires - Buenos Aires, Argentina


  • Deep Learning
  • Machine Learning
  • Laboratory of Machine Learning
    ITBA | Instituto Tecnológico de Buenos Aires

To view more profiles

Join Toptal
Share it with others