Tafsuth Boumali, Data Engineering Developer in Fonsorbes, France
Tafsuth Boumali

Data Engineering Developer in Fonsorbes, France

Member since August 16, 2021
Tafsuth is a highly efficient and dedicated professional who possesses a broad software and data engineering skillset. Her career assignments have ranged from building real-time prediction pipelines for startups to leading project teams and designing and maintaining large data lakes for Fortune 500 companies. Tafsuth is interested in helping businesses make data-driven decisions and enjoys sharing her knowledge by mentoring engineers.
Tafsuth is now available for hire

Portfolio

  • Heetch
    Scala, Java, Apache Kafka, Kafka Streams, Dataiku, Apache Airflow, Kubernetes...
  • Audela
    Apache Kafka, Spark, Apache Avro, PostgreSQL
  • Societe Generale
    Spark, Spark SQL, Apache Ignite, Cloudera, HDFS

Experience

Location

Fonsorbes, France

Availability

Part-time

Preferred Environment

MacOS, GitHub

The most amazing...

...thing I created is a pipeline to validate the format and the content of millions of events coming from a mobile app in real-time.

Employment

  • Data Engineering Manager | Tech Lead

    2019 - PRESENT
    Heetch
    • Designed, wrote, and maintained real-time streaming pipelines, enriching and aggregating raw data coming from multiple different sources.
    • Implemented and designed a service to be able to predict and prevent users' bad behaviors in real-time.
    • Got rid of third-party data synchronization and transformation pipelines by writing our own custom pipelines.
    • Managed a team of four data engineers and did carrier reviews and handling rituals.
    • Communicated with stakeholders and data analysts and data scientists to monitor the success of new initiatives.
    • Advocated on schema management across the whole company.
    • Provided tools to generate schemas, used them to validate the mobile messages; handled enrichment, storage, and exposition.
    Technologies: Scala, Java, Apache Kafka, Kafka Streams, Dataiku, Apache Airflow, Kubernetes, Marathon, Redshift, Redshift Spectrum
  • Senior Software Engineer

    2017 - 2019
    Audela
    • Built real-time accurate network state view of physical, logical, and service topologies for telco operators.
    • Oversaw the full CI/CD infrastructure automation, deploying to Kubernetes.
    • Leveraged graph databases for storage and exposition.
    Technologies: Apache Kafka, Spark, Apache Avro, PostgreSQL
  • Data Engineer

    2016 - 2017
    Societe Generale
    • Designed and built a data lake that would store daily stock exchange orders.
    • Built pipelines allowing real-time data enrichment with Apache Ignite.
    • Met with stakeholders and translated their business needs into technical specifications.
    Technologies: Spark, Spark SQL, Apache Ignite, Cloudera, HDFS
  • BI Engineer

    2012 - 2016
    Carrefour
    • Managed high-volume databases (SQLServer and Oracle).
    • Worked on the development and evolution of BI infrastructures (SSIS).
    • Did data analysis and development of reporting exposed on a web portal (SSRS).
    Technologies: Scala, Java, SQL, Shell, Oracle, SSIS Custom Components, SQL Server 2000

Experience

  • Real-time Event Validation

    A streaming application that validates the content and the format of the event produced by the mobile application.

    My responsibility was to replicate what the Confluent Schema registry does but with JSON (at that time the JSON validation wasn't handled by the Confluent tool).

Skills

  • Languages

    SQL, Java 8, Scala, Java, Go, Python
  • Tools

    Kafka Streams, Apache Avro, Redshift Spectrum, ScalaTest, Apache Airflow, GitHub, Maven, SBT, Mesos, Spark SQL, Apache Ignite, Cloudera, Shell
  • Paradigms

    Business Intelligence (BI)
  • Platforms

    Apache Kafka, Docker, MacOS, JVM, Dataiku, Kubernetes, Oracle
  • Storage

    Redshift, Databases, AWS S3, Relational Databases, Database Management, PostgreSQL, HDFS, SQL Server 2000, Cassandra
  • Other

    Data, Big Data, Data Engineering, AWS, Data Analysis, Data Modeling, Data Analytics, Modeling, Data Warehousing, RESTful APIs, CI/CD Pipelines, Shell Scripting, Data Architecture, Finance, SSIS Custom Components
  • Frameworks

    Spark, Marathon, Hadoop
  • Libraries/APIs

    Circe

Education

  • Master's Degree in Information Systems
    2007 - 2012
    Paris Dauphine University - Paris

To view more profiles

Join Toptal
Share it with others