Edgar Rootalu, Developer in Tallinn, Estonia
Edgar is available for hire
Hire Edgar

Edgar Rootalu

Verified Expert  in Engineering

Bio

Edgar is a data engineer with eight years of experience working with teams in California, the UK, Germany, and Singapore. He's proficient in Python, SQL, Google Cloud, and Clickhouse technologies. He specializes in data warehousing, analytics, and LLM integrations. For the past three years, Edgar has been building the internal data platform of Nansen.ai, which is the leading blockchain data provider in the web3 space.

Portfolio

Nansen
Python, SQL, ClickHouse, Google Cloud, Google BigQuery
Springboard
Google Cloud Platform (GCP), Google BigQuery, Chartio, Segment, Pandas, Python
Freelance Consultant to Digital Peace Talks
Scikit-learn, Natural Language Processing (NLP)...

Experience

  • Data Analysis - 5 years
  • SQL - 5 years
  • Tableau - 5 years
  • Python - 4 years
  • Data Science - 4 years
  • Pandas - 4 years
  • Google Cloud Platform (GCP) - 3 years
  • Natural Language Processing (NLP) - 2 years

Availability

Part-time

Preferred Environment

Google Cloud Platform (GCP), SQL, Python, LangChain, ClickHouse, OpenAI API, Segment, Business Intelligence (BI), Data Build Tool (dbt), OpenAI

The most amazing...

...project I've delivered is a comprehensive current and historical wallet PnL monitor of all blockchain wallets spanning 10+ blockchains.

Work Experience

Senior Data Engineer

2022 - 2025
Nansen
  • Built several crucial user-facing analytics dashboards that crunched 100+ TB data and contributed tens of thousands in recurring revenue.
  • Managed dbt repositories of 2,000+ data models, materializing tens of blockchain activity metrics for our users.
  • Managed a complex project of migrating hundreds of data models from a BigQuery back end to a ClickHouse back end, improving query latency from tens of seconds to sub one second.
Technologies: Python, SQL, ClickHouse, Google Cloud, Google BigQuery

Data Scientist

2018 - 2022
Springboard
  • Defined granular user event data collection with segment.com.
  • Built user models with Python to understand customer LTV.
  • Implemented a Markov chains-based conversion credit attribution model to understand the USD value of every marketing activity.
  • Built interactive dashboards for user acquisition and content teams to advise strategy based on customer journey data.
  • Used Python geographical data tools to predict conversion across US and Canadian cities.
Technologies: Google Cloud Platform (GCP), Google BigQuery, Chartio, Segment, Pandas, Python

NLP Engineer

2018 - 2019
Freelance Consultant to Digital Peace Talks
  • Used LDA topic modeling to find structure in free text opinions about climate change.
  • Implemented t-SNE clustering algorithm to cluster and visualize opinions on climate change.
  • Built an interactive Google Data Studio dashboard to allow stakeholders to explore modeling results.
Technologies: Scikit-learn, Generative Pre-trained Transformers (GPT), Natural Language Processing (NLP), Google Cloud Platform (GCP), Python

Data Scientist

2016 - 2018
Ventura TRAVEL
  • Developed a predictive product demand and capacity alert system communicated through personalized interactive dashboards using R, Python, and Tableau.
  • Created a real-time profit forecasting solution of each sale with regression models in Python and R.
  • Implemented a Markov chains-based conversion credit attribution model for analyzing returns on marketing spend (RoAS).
  • Developed a daily updating competitor market share analysis tool using web scraping, R, and Google Data Studio.
  • Integrated operations, finance, and marketing departments' data to create a unified source of truth for data science.
Technologies: Amazon Web Services (AWS), Google Data Studio, Tableau, Snowplow Analytics, Google BigQuery, R, Python

Experience

Opinion Modeling and Visualization

https://github.com/The-Edgar/Experiment-2-Clustering
In order to improve the quality of public online debate, in collaboration with the NGO Digital Peace Talks, I used LDA topic modeling and t-SNE clustering algorithms to find structure in free text opinions about climate change. I created an interactive dashboard that would allow the stakeholders to play around with parameters and find clusters in opinions.

Population Data Exploration Dashboard

https://datastudio.google.com/u/0/reporting/1qXpSwr99bAYhKVUu6LBQiT8MQlQ3egv1/page/KlT7
A Google Data Studio dashboard for exploring multiple years of census data in 54 countries. In order to understand whether there's a trend of people moving from smaller cities to bigger ones, I calculated the Gini coefficient of cities' populations per country over time. The results can be explored in the link displayed.

Education

2012 - 2015

Bachelor's Degree in Economics and Statistics

University of Amsterdam - Amsterdam, The Netherlands

Certifications

SEPTEMBER 2018 - PRESENT

Achieving Advanced Insights with BigQuery

Coursera

Skills

Libraries/APIs

Scikit-learn, Pandas, PyData, SpaCy, Natural Language Toolkit (NLTK), Web3.js, OpenAI API

Tools

Tableau, Jupyter, Git, Snowplow Analytics, Stitch Data, Microsoft Excel, GitHub, Chartio, Looker, AutoML, Plotly, Microsoft Power BI

Languages

SQL, Python, R, Python 3

Paradigms

Business Intelligence (BI), ETL

Platforms

Jupyter Notebook, Google Cloud Platform (GCP), Blockchain, Amazon Web Services (AWS)

Storage

MySQL, Data Validation, PostgreSQL, MongoDB, Redshift, ClickHouse, Google Cloud

Frameworks

Jinja

Other

Google BigQuery, Data Analysis, Segment, Data Analytics, Data Science, Data Mining, Statistics, Education Technology (Edtech), Tourism, Neural Networks, Data Engineering, Google Data Studio, Machine Learning, Natural Language Processing (NLP), Dash, Data Build Tool (dbt), geopy, Fintech, Generative Pre-trained Transformers (GPT), LangChain, OpenAI

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring