Toni Cebrián, Machine Learning Developer in Barcelona, Spain
Toni Cebrián

Machine Learning Developer in Barcelona, Spain

Member since January 16, 2019
A rare mixture of data scientist and data engineer, Toni is able to lead projects from conception and prototyping to deploying at scale in the cloud.
Toni is now available for hire

Portfolio

  • D5.ai
    Google Cloud, NEO, Crypto, Python, Scala, Data Science
  • Coinfi
    PubSubJS, Data Flows, Apache Beam, Python, Apache Airflow, Data Science
  • Stuart
    Akka, Redshift, Apache Kafka, Apache Airflow, Scala, Python, Data Science

Experience

Location

Barcelona, Spain

Availability

Part-time

Preferred Environment

Linux

The most amazing...

...experience has been teaching a typeclasses talk using Scala at a local Scala meetup group.

Employment

  • Founder

    2019 - PRESENT
    D5.ai
    • Ingested the Bitcoin graph into a Neo4J database using Airflow to periodically crawl BigQuery tables with bitcoin transactions.
    • Created asyncio web crawlers in Python to scrape websites with newsworthy content.
    • Maintained and evolve an SDK in Scala and Haskell for accessing web APIs from customers using those languages.
    Technologies: Google Cloud, NEO, Crypto, Python, Scala, Data Science
  • Lead Data Engineer

    2018 - 2018
    Coinfi
    • Created the ETL orchestration systems using Airflow with Composer in Google Cloud.
    • Created scrapping services for getting Crypto data (prices, events, news.) to ingest into the platform.
    Technologies: PubSubJS, Data Flows, Apache Beam, Python, Apache Airflow, Data Science
  • Head of Data Science

    2016 - 2018
    Stuart
    • Designed the company's data warehouse using Redshift.
    • Created a forecasting model for predicting drivers login into the platform and deliveries to be served.
    • Architected an event sourcing system for complex event processing.
    • Deployed a route optimization algorithm for picking drivers based on route and package size.
    • Created the data science team from scratch.
    Technologies: Akka, Redshift, Apache Kafka, Apache Airflow, Scala, Python, Data Science
  • Chief Data Officer

    2014 - 2016
    Enerbyte
    • Architected the infrastructure for ingesting data from IoT devices.
    • Researched algorithms for energy disaggregation from a single point of measure.
    • Created the data science team from scratch.
    Technologies: Apache Kafka, Spark Streaming, Spark, Scala, Python, Data Science
  • Head of Data Science

    2012 - 2014
    Softonic
    • Created different add placement algorithms.
    • Created a recommender system based on textual content from app reviews.
    • Created an improved search engine using machine learning and Solr.
    • Created the data science team from scratch.
    Technologies: Semantic Web, RDF, Word2Vec, Solr, Recommendation Systems, Spark, Hadoop, Scala, Python, Data Science

Experience

Skills

  • Languages

    Python 3, Scala, SQL, Haskell, C++, Java, Python, RDF
  • Frameworks

    Spark, Akka, Hadoop
  • Libraries/APIs

    Spark Streaming, Pandas, Scikit-learn, NumPy, PubSubJS, Python Asyncio, TensorFlow, XGBoost
  • Tools

    Apache Airflow, Cloud Dataflow, Apache Beam, Solr, Apache Avro
  • Paradigms

    Functional Programming, Data Science, Reactive Programming
  • Other

    Machine Learning, Akka HTTP, Data Mining, Crypto, NEO, Data Flows, Recommendation Systems, Word2Vec, Semantic Web
  • Platforms

    Apache Kafka, Linux
  • Storage

    Redshift, Cassandra, Google Cloud, Redis

Education

  • Master's Degree in Artificial Intelligence
    2009 - 2012
    Universitat Politecnica de Catalunya - Barcelona, Spain
  • Postgraduate Degree in Quantitative Techniques for Financial Products
    2009 - 2011
    Universitat Politecnica de Catalunya - Barcelona, Spain

Certifications

  • Cloudera Certified Hadoop Professional
    MAY 2012 - PRESENT
    Cloudera

To view more profiles

Join Toptal
Share it with others