Diego Fustes Villadoniga, Developer in A Coruña, Spain
Diego is available for hire
Hire Diego

Diego Fustes Villadoniga

Verified Expert  in Engineering

Software Developer

Location
A Coruña, Spain
Toptal Member Since
September 18, 2015

Diego is a Ph.D. in Computer Science and has worked for several global companies as a Back-end Engineer. He combines the skills of a scientist and a developer, making him aptly suited for complex projects. Diego specializes in Big Data and Machine Learning, the topics focused on for his Ph.D. Despite his many years of experience, Diego is still always looking for new challenges.

Portfolio

Toptal Clients
Keras, PyTorch, Angular, Gensim, Scikit-learn, MongoDB, Elasticsearch, Python
Oesia Networks
Cloudera, Impala, NiFi, Flink, Solr, Spark, Hadoop
NDT GLobal
Machine Learning, HBase, Spark, Python, Scala, Java

Experience

Availability

Part-time

Preferred Environment

Apache Maven, IntelliJ IDEA, Jenkins, Git, Linux

The most amazing...

...thing I've implemented is a set of both supervised and unsupervised classification algorithms on top of Hadoop to classify one billion stars.

Work Experience

Big Data Developer

2017 - PRESENT
Toptal Clients
  • Developed several prototypes to automate customer service activities.
  • Worked with Elasticsearch advanced capabilites to create a state-of-the-art search engine for technical documentation.
  • Created a self-service machine learning application to categorize customer requests based on text descriptions.
  • Developed a speech analytics app with full audio processing capabilities: speech-to-text, classification, diarization, topics discovering, etc.
  • Used Docker to ensure app portability and OpenAPI to define REST services.
Technologies: Keras, PyTorch, Angular, Gensim, Scikit-learn, MongoDB, Elasticsearch, Python

Big Data Software Architect

2016 - 2017
Oesia Networks
  • Designed and developed a full cybersecurity application, based on both real-time and batch big data technologies.
  • Architected and implemented an analytic data lake based on the Cloudera distribution for Hadoop, with enhanced security and integration.
Technologies: Cloudera, Impala, NiFi, Flink, Solr, Spark, Hadoop

Big Data and Machine Learning Expert

2014 - 2016
NDT GLobal
  • Developed a big data back-end to store and analyze all ultrasonic pipeline inspection files produced by the company in its history (estimated 1 petabyte).
  • Implemented a REST service to access and manipulate inspection data, stored in HBase, from different applications. Developed clients for the REST service in C# and Python for language integration.
  • Enabled data mining and business intelligence by integrating several data sources and performing batch Spark processing.
  • Fulfilled the role of product owner for the back-end, defining the product functionality, interacting with stakeholders and participating in management.
  • Implemented a complex scientific algorithm to perform automated data analysis.
Technologies: Machine Learning, HBase, Spark, Python, Scala, Java

Java Developer

2014 - 2014
eDreams Odigeo
  • Developed the eDreams eCommerce platform for online travel booking.
  • Collaborated with one hundred developers.
  • Gained experience with critical release processes.
  • Worked for the private sector.
  • Gained knowledge about the eCommerce world.
Technologies: Jenkins, Mercurial, Ant Design, Java

R&D Software Engineer

2010 - 2014
University of A Coruña
  • Designed and implemented AI techniques for massive astronomical data analysis.
  • Involved in the Gaia satellite Data Processing and Consortium (DPAC).
  • Worked on astrophysical parameter estimation and object classification.
  • Developed algorithms running on top of Hadoop.
  • Worked on AI for big data such as neural networks, genetic algorithms, and clustering techniques.
  • Published four papers in international journals.
Technologies: Mahout, Hadoop, Python, MATLAB, Java

Software Engineer

2009 - 2010
EASIS S.L.
  • Designed and developed the web application hosted at https://xestor.fic.udc.es/horarios/.
  • Worked with the Administration of Faculties in the University of Coruña.
  • Started the company with three co-founders.
  • Worked on full-stack development.
  • Developed a product that is currently in use with thousands of users, including teachers and students of the computer science department.
Technologies: MySQL, Tapestry, Apache Maven, Spring, Java

R&D Software Engineer

2009 - 2010
University of A Coruña
  • Analyzed remote-sensed images (SAR images from ESA Envisat).
  • Developed AI techniques to isolate dark spots in SAR images.
  • Developed a geographic information system to locate oil spills in the ocean.
  • Created a unified, web-based interface.
  • Integrated between Java and Matlab.
  • Published one paper in an international journal.
Technologies: Apache Struts, MATLAB, GIS, PostgreSQL, Java

Gaia

http://www.cosmos.esa.int/web/gaia
Gaia is an ambitious mission to chart a three-dimensional map of our galaxy, the Milky Way, in the process revealing the composition, formation, and evolution of the Galaxy. Gaia will provide unprecedented positional and radial velocity measurements with the accuracy needed to produce a stereoscopic and kinematic census of about one billion stars in our galaxy and throughout the Local Group. This amounts to about 1% of the galactic stellar population.

PyAnnote Audio

https://github.com/pyannote/pyannote-audio
Audio processing in Python with neural networks. I've contributed some features.

Scientific Paper on the Sentinazos System

This paper describes the goals of this system and how it achieves them.

Scientific Paper on Gaia

This paper describes the work done to detect and analyze novel objects from the billion starts observed by Gaia.

JavaCK

Rule-based programming language to develop expert systems. The syntax is high level, based on CLIPS, from NASA, and it compiles to Java. The language takes advantage of the CommonKADS methodology to provide several templates which represent the most common expert systems.

Xestor FIC

Web application to administrate the Faculty of Computer Science in the University of A Coruña. It allows users to visualize their custom timetables, book rooms, upload academic projects, and much more

Languages

Java, SQL, Scala, Python, C++, C#

Frameworks

Apache Spark, Hadoop, Hibernate, Apache Struts, Ant Design, Spark, Angular, Spring MVC, Swing, Tapestry, Spring

Tools

Cloudera, Solr, Apache Sqoop, Apache Maven, Git, Weka, Eclipse IDE, Jenkins, IntelliJ IDEA, GIS, Flink, Impala, Gensim, Mercurial, MATLAB

Storage

Elasticsearch, MongoDB, HBase, PostgreSQL, MySQL, PostGIS

Other

Image Processing Software, NiFi, Data Mining, Big Data, Machine Learning, Protocol Buffers, Image Processing

Libraries/APIs

Pandas, Mahout, Scikit-learn, Keras, PyTorch, SWT

Paradigms

Data Science, Scrum

Platforms

Apache Kafka, Apache Flink, Java EE, Linux

2009 - 2014

PhD Degree in Computer Science

University of A Coruña - A Coruña, Spain

2002 - 2009

Master's Degree in Computer Science

University of A Coruña - A Coruña

AUGUST 2018 - PRESENT

Introduction to Deep Learning (with Honors)

Coursera

APRIL 2018 - PRESENT

Natural Language Processing

Coursera

MAY 2017 - PRESENT

Functional Programming Principles in Scala

Coursera

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring