Toni Cebrián, Machine Learning Developer in Barcelona, Spain
Toni Cebrián

Machine Learning Developer in Barcelona, Spain

Member since October 7, 2018
A rare mixture of data scientist and data engineer, Toni is able to lead projects from conception and prototyping to deploying at scale in the cloud.
Toni is now available for hire

Portfolio

  • Coinfi
    Airflow, Python, Apache Beam, Dataflow, PubSub
  • Stuart
    Python, Scala, Airflow, Kafka, Redshift, Akka
  • Enerbyte
    Python, Scala, Spark, Spark Streaming, Kafka

Experience

  • SQL, 10 years
  • Machine Learning, 10 years
  • Scala, 8 years
  • Python 3, 6 years
  • Spark, 4 years
  • Akka, 4 years

Location

Barcelona, Spain

Availability

Part-time

Preferred Environment

Linux

The most amazing...

...experience has been teaching a typeclasses talk using Scala at a local Scala meetup group.

Employment

  • Lead Data Engineer

    2018 - 2018
    Coinfi
    • Created the ETL orchestration systems using Airflow with Composer in Google Cloud.
    • Created scrapping services for getting Crypto data (prices, events, news.) to ingest into the platform.
    Technologies: Airflow, Python, Apache Beam, Dataflow, PubSub
  • Head of Data Science

    2016 - 2018
    Stuart
    • Designed the company's data warehouse using Redshift.
    • Created a forecasting model for predicting drivers login into the platform and deliveries to be served.
    • Architected an event sourcing system for complex event processing.
    • Deployed a route optimization algorithm for picking drivers based on route and package size.
    • Created the data science team from scratch.
    Technologies: Python, Scala, Airflow, Kafka, Redshift, Akka
  • Chief Data Officer

    2014 - 2016
    Enerbyte
    • Architected the infrastructure for ingesting data from IoT devices.
    • Researched algorithms for energy disaggregation from a single point of measure.
    • Created the data science team from scratch.
    Technologies: Python, Scala, Spark, Spark Streaming, Kafka
  • Head of Data Science

    2012 - 2014
    Softonic
    • Created different add placement algorithms.
    • Created a recommender system based on textual content from app reviews.
    • Created an improved search engine using machine learning and Solr.
    • Created the data science team from scratch.
    Technologies: Python, Scala, Hadoop, Spark, Recommender Systems, Solr, Word2vec, RDF, Semantic Web, Triple Stores

Experience

Skills

  • Languages

    Python 3, Scala, SQL, Haskell, C++, Java
  • Frameworks

    Spark, Akka, Hadoop
  • Libraries/APIs

    Spark Streaming, Pandas, Scikit-learn, NumPy, Python Asyncio, TensorFlow, XGBoost
  • Tools

    Apache Airflow, Cloud Dataflow, Apache Beam, Apache Avro
  • Paradigms

    Functional Programming, Reactive Programming
  • Other

    Machine Learning, Akka HTTP, Data Mining
  • Platforms

    Apache Kafka
  • Storage

    Redshift, Cassandra, Redis

Education

  • Master's degree in Artificial Intelligence
    2009 - 2012
    Universitat Politecnica de Catalunya - Barcelona, Spain
  • Postgraduate degree in Quantitative Techniques for Financial Products
    2009 - 2011
    Universitat Politecnica de Catalunya - Barcelona, Spain
Certifications
  • Cloudera Certified Hadoop Professional
    MAY 2012 - PRESENT
    Cloudera

To view more profiles

Join Toptal
I really like this profile
Share it with others