Valentin Nikotin, Developer in Lahti, Finland
Valentin is available for hire
Hire Valentin

Valentin Nikotin

Verified Expert  in Engineering

Data Engineering Developer

Location
Lahti, Finland
Toptal Member Since
September 15, 2021

Valentin is a skillful data engineer with over 10 years of experience in the information technology and services industries. He's skilled in building scalable performance-critical distributed systems, data pipelines, and data visualization. Valentin is currently leading a team of data and ML engineers.

Portfolio

Quantori
Google Cloud Platform (GCP), TensorFlow, PyTorch, Python, Java, Scala...
Pythian
Scala, Spark, Apache Beam, SQL, Java, Python, Hadoop...

Experience

Availability

Part-time

Preferred Environment

Windows 10, Slack, Google Cloud Platform (GCP), PyCharm, IntelliJ IDEA, Git, Shell, PL/SQL, Oracle, Shell Scripting, Spark, SQL, Hadoop, Tableau, Social Media, Data Visualization, ETL, Data Modeling, Snowflake, Databricks, Analytics, BigQuery, Looker, Google BigQuery, Google Data Studio, OCR, Data Pipelines, Data Architecture, Data Warehouse Design, Leadership

The most amazing...

...project I've developed is a reusable pipeline for streaming data ingestion with a transparent schema inferring and a schema evolution functionality.

Work Experience

Senior Director

2021 - PRESENT
Quantori
  • Supervised ML and data engineers and improved the team's competence and expertise.
  • Developed batch and streaming automated data pipelines and libraries with inferring and merging schemas to process semi-structured data.
  • Extended the existing functionality of Google Cloud SDK libraries.
  • Developed an ensemble of models to predict a molecular graph based on graphical representation. Created transformer-based algorithms for object detection. Developed chart recognition architecture.
Technologies: Google Cloud Platform (GCP), TensorFlow, PyTorch, Python, Java, Scala, Apache Beam, Akka, Management, Big Data, Streaming, Amazon Web Services (AWS), Data Engineering, Oracle, Shell Scripting, Spark, SQL, Hadoop, Social Media, Data Visualization, ETL, Data Modeling, Snowflake, Databricks, Analytics, BigQuery, Looker, Google BigQuery, Google Data Studio, OCR, Data Pipelines, Data Architecture, Data Warehouse Design, Leadership

Cloud Data Platform Developer

2012 - 2021
Pythian
  • Developed and delivered solutions for ingestion as well as processed performance counter data for thousands of sources.
  • Developed a framework for gaming companies to process gaming events in a cloud warehouse that can generate real-time predictions through deployed ML models.
  • Performed analysis and tuning for existing systems.
Technologies: Scala, Spark, Apache Beam, SQL, Java, Python, Hadoop, Google Cloud Platform (GCP), Apache Airflow, Pub/Sub, Event-driven Architecture, PL/SQL, Oracle, Shell Scripting, Tableau, Social Media, Data Visualization, ETL, Data Modeling, Databricks, Analytics, BigQuery, Looker, Google BigQuery, Google Data Studio, OCR, Data Pipelines, Data Architecture, Data Warehouse Design, Leadership

Technical Review of Oracle PL and SQL Programming, Sixth Edition

I performed a technical review of the Oracle PL and SQL Programming, Sixth Edition by Steven Feuerstein. I acted as the primary technical reviewer for this edition. I delivered a fantastic job, ensuring that the new 12.1 content was accurate, going through "stable" chapters from past editions, and finding many ways to improve them.
2000 - 2005

Master's Degree in Mathematics and Computer Science

Saint Petersburg University - Saint Petersburg, Russia

Libraries/APIs

TensorFlow, PyTorch

Tools

Apache Beam, BigQuery, Apache Avro, Tableau, Looker, PyCharm, IntelliJ IDEA, Git, Shell, Google Cloud Composer, Apache Airflow, Slack, AWS Glue, Amazon Athena

Frameworks

Spark, Hadoop, Akka

Paradigms

Functional Analysis, ETL, Event-driven Architecture, Management

Languages

Scala, SQL, Java, Python, Snowflake

Platforms

Google Cloud Platform (GCP), Oracle, Databricks, Apache Kafka, Amazon Web Services (AWS)

Industry Expertise

Social Media

Storage

PL/SQL, Data Pipelines, Redshift

Other

Mathematical Analysis, Computer Science, Data Engineering, Big Data, Big Data Architecture, Google Pub/Sub, Data Visualization, Data Modeling, Analytics, Google BigQuery, Google Data Studio, Data Architecture, Data Warehouse Design, Parquet, Pub/Sub, Shell Scripting, OCR, Leadership, Windows 10, Google Cloud Build, Streaming

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring