Herve Roussel, Developer in Singapore, Singapore
Herve is currently unavailable

Herve Roussel

Bio

Hervé is a CTO-level AI and engineering leader helping startups and enterprises turn ambiguous ideas into production systems. He specializes in AI workflows, LLM products, automation platforms, SaaS architecture, and rapid MVPs. Hervé is a former CTO/founder who scaled products from 5,000 to 127,000 users, built platforms serving 250+ million API calls/month, and shipped production systems across the full stack, from LLM back ends to data pipelines.

Portfolio

Enterprise AI & Digital Services Agency
Amazon Web Services (AWS), TensorFlow, React, Business Process Modeling, Python...
Indeed
Amazon Web Services (AWS), Java, Apache Airflow, Python, A/B Testing...
Quod AI
Java, Machine Learning, Big Data, SQL, Elasticsearch, Amazon Web Services (AWS)...

Experience

  • System Architecture - 10 years
  • Data Engineering - 10 years
  • API Design - 10 years
  • Java - 8 years
  • SQL - 8 years
  • Amazon Web Services (AWS) - 8 years
  • Apache Beam - 6 years

Preferred Environment

Redis, JavaScript, Python 3, PostgreSQL, Supabase, Next.js, LLM Integration, API Integration, TypeScript, Agentic AI

The most amazing...

...project I've done was pioneering one of the 1st Foundation LLMs for source code (validated by a Turing Award winner) and scaling to 127,000 users in 18 months.

Work Experience

Engineering Lead, AI & Machine Learning

2023 - 2026
Enterprise AI & Digital Services Agency
  • Delivered production ML classification system, achieving 0.98 precision and 0.94 recall, improving automated resolution by 40%. Exceeded SOTA benchmarks with 9x more detections at 4x lower error rate.
  • Built and shipped a multi-agent LLM framework, improving response quality by more than 36% and scaling automated output 11x; delivered the supporting ops portal from zero to production adoption.
  • Designed architecture and technical roadmap across 4 concurrent AI/ML products spanning URL classification, fraud detection, conversational AI, and fraud analytics, defining system design, data pipelines, and integration across multiple teams.
Technologies: Amazon Web Services (AWS), TensorFlow, React, Business Process Modeling, Python, FastAPI, LangGraph, LLM Integration, Software Architecture, Artificial Intelligence (AI), SQL, Playwright, Architecture, Large Language Models (LLMs), Technical Leadership, PostgreSQL, Code Auditing, Document Processing, Node.js, Team Leadership, Data Architecture, Technical Architecture

Engineering & ML Lead

2022 - 2023
Indeed
  • Co-captained the Pay Transparency program across 6+ product areas (SMB, Taxonomy, SCP, Metadata Salary, Salary Estimation), securing CTO approval for global rollout across Indeed's job seeker and employer platforms.
  • Managed 8 engineers directly and influenced 14 across adjacent data science and engineering teams; increased SWE-1 and SWE-2 engagement 25% by transitioning teams to Agile rituals and restructuring sprint planning.
  • Unblocked the Pay Transparency roadmap by running 3 cross-functional workshops and producing alignment documentation with contributions from 10+ engineers, 1 SPM, and 3 senior UX designers.
  • Coordinated 20+ stakeholders across 6 partner teams (Dradis, Salary Estimation, Metadata Salary, Butterfly Matching, and Ranking) to deliver program architecture and roadmap alignment on an accelerated timeline.
  • Worked across Java, Python, AWS, and Airflow on high-scale job matching and salary intelligence infrastructure serving hundreds of millions of job seekers globally.
Technologies: Amazon Web Services (AWS), Java, Apache Airflow, Python, A/B Testing, Software Architecture, Artificial Intelligence (AI), Supervised Learning, SQL, Architecture, Technical Leadership, Team Leadership, Data Architecture, Technical Architecture

CTO | Co-founder

2018 - 2022
Quod AI
  • Co-founded an AI code intelligence platform and pioneered one of the first foundation LLMs for source code, scaling from 5,000 to 127,000 users and securing Standard Chartered as an enterprise client.
  • Architected and coded the core AI search engine using TensorFlow, Elasticsearch, and Apache Beam, serving 100,000+ users with 78% month-on-month growth.
  • Hired, built, and created a productive environment for nine engineers over four teams using Apache Beam/Flink, TensorFlow, MySQL, Elasticsearch, MongoDB, Redis, Node.js, React, and Kubernetes on AWS and GCP.
  • Scaled indexing to 400,000 characters/min with 11 proprietary AI models on a restricted budget.
  • Set up training and processes for hiring, release, and the architectural board.
  • Published blogs, talks, and podcasts for engineers with 20,000+ views combined.
Technologies: Java, Machine Learning, Big Data, SQL, Elasticsearch, Amazon Web Services (AWS), DevOps, CTO, Software Architecture, REST APIs, Artificial Intelligence (AI), Generative Pre-trained Transformers (GPT), Foundation Models, RAG Architecture, Vector Databases, Unsupervised Machine Learning, Supervised Machine Learning, Web Scraping, Data Scraping, Playwright, Architecture, Large Language Models (LLMs), Technical Leadership, Venture Capital, PostgreSQL, RAG Systems, Node.js, Google Cloud Platform (GCP), Team Leadership, Data Architecture, Technical Architecture

Head of Data Engineering | Back-end Engineer | Architect

2016 - 2018
Sentifi
  • Architected and coded fintech platform ingesting 1+ billion messages, serving 250+ million API calls per month with NLP and deep learning.
  • Led and managed a team of 20 engineers using Apache Beam, Google Dataflow, PostgreSQL, Elasticsearch, MongoDB, and Spring Boot on Google Cloud.
  • Set up processes for engineering training, code review, and release.
Technologies: Redis, MongoDB, Elasticsearch, Cloud Dataflow, Apache Beam, Java, API Design, Software Architecture, REST APIs, Web Scraping, Data Scraping, Selenium, Architecture, Technical Leadership, PostgreSQL, Code Auditing, Full-stack, Node.js, Google Cloud Platform (GCP), Fintech, Team Leadership, Data Architecture, Technical Architecture

CTO

2014 - 2014
Astoria Road
  • Maintained the existing architecture developed using PHP and MySQL on AWS.
  • Implemented an admin CMS using a Yii PHP Framework.
  • Attended the Plug-and-Play Silicon Valley Accelerator.
  • Implemented an automated purchase robot as a proof of concept using Watir.
Technologies: Amazon Web Services (AWS), CTO, Software Architecture, Venture Capital, Code Auditing, Full-stack

CTO | Co-founder

2007 - 2013
Linked Senior
  • Architected a 3-tier web-based media distribution platform in Amazon Cloud (EC2, RDS, S3) using JBoss Seam, RichFaces, Hibernate, MySQL, HTML, CSS, and JavaScript.
  • Implemented and designed a CMS for managing content delivery and hardware inventory using YouTube, Wikipedia, and Flickr API.
  • Implemented and designed an analytics reporting module using the Google Analytics API and Google Charts.
  • Implemented and designed a public user authentication API using SOAP XML web service for a product partner.
  • Designed and implemented client software with expanded browser capabilities using Java Webstart, Java Swing, Jetty Server, Google Chrome extensions, JavaScript, and HTML5 WebSockets.
  • Led and managed QA using Jenkins, JUnit, Mockito, Selenium WebDriver, and Jasmine.
Technologies: Amazon Web Services (AWS), Java, CTO, Software Architecture, Venture Capital, Full-stack

Engineering Manager

2006 - 2008
CA
  • Led the creation of new architecture and the design of a unified threat management platform.
  • Managed and coordinated multiple simultaneous projects that included R&D, builds, QA, and product management teams located in America, Asia, Australia, and Europe involving over 30 people.
  • Maintained and contributed to the development of an anti­spyware research platform delivering critical security signature protection to more than 20 million customers.
Technologies: Java

Experience

ClauseCheck: AI Contract Review

https://clausecheck.pro
Built an AI-assisted contract review platform that transforms uploaded legal documents into structured clause-by-clause analysis workflows. Users can upload PDF/DOCX contracts, receive plain-English explanations, risk scoring, AI-generated rewrite suggestions, and track inline edits with version history. I designed and implemented the full-stack architecture, including document ingestion, clause extraction, AI review pipelines, rich-text editing workflows, authentication, storage, and serverless back-end services. The system focuses on workflow usability and explainability rather than generic legal chat interfaces.

Sentio: Agentic Technology Scouting Platform

https://sentio.mildly.ai/
Built a full-stack AI research workflow that helps investment and strategy teams turn fragmented market signals into structured intelligence reports. Users can define company priorities, launch AI research agents, and generate evidence-backed newsletter-style reports with attribution and quality scoring. I designed and implemented the end-to-end architecture, including AI workflows, background orchestration, Supabase/Postgres data modeling, LLM integrations, observability, and the operator-facing UI. The platform combines AI-assisted extraction, semantic workflows, and structured reporting rather than generic chatbot interactions.

RiskLens: AI Due Diligence Research Platform

https://risklens.netlify.app/
Built an AI-assisted due diligence platform that helps analysts and consulting teams transform public documents, PDFs, and web sources into source-grounded company research reports. The platform supports semantic search, cited Q&A, automated report generation, and multi-stage AI research workflows. I designed and implemented the full-stack system, including retrieval pipelines, vector search architecture, asynchronous job orchestration, OCR/document ingestion, AI synthesis workflows, and the research-focused user experience. The platform combines AI automation with structured workflows to accelerate business and investment research.

Panko: Browser-based Workflow Recorder

https://bit.ly/49LLghW
Built a full-stack workflow capture platform that records live browser interactions and converts them into reusable guided product tours. The system combines a Chrome extension, web dashboard, back-end APIs, and cloud storage to help teams document, share, and replay operational workflows inside real web applications. Users can record clicks and text inputs, automatically capture screenshots and UI targets, edit step instructions, and replay tours with contextual guidance overlays. I designed and implemented the end-to-end architecture, including browser extension logic, DOM instrumentation, front-end UX, authentication, back-end services, database modeling, and cloud deployment. The platform demonstrates workflow automation and operational tooling beyond simple documentation or screen recordings.

StreetSense: AI Traffic Monitoring Dashboard

https://bit.ly/4v3NW2u
Built an AI-assisted traffic operations platform that converts live video feeds into structured monitoring and alert workflows. The system performs real-time vehicle detection, congestion monitoring, throughput analysis, and incident alerting while providing operators with review and dispatch workflows. I designed and implemented the architecture spanning computer vision services, real-time dashboards, WebSocket infrastructure, AI-assisted frame analysis, and workflow state management. The platform demonstrates how AI and computer vision can be integrated into operational decision-support systems.

TalkWise: AI Voice Conversation Coaching Prototype

https://talkwise.live/
Built a full-stack AI coaching prototype that helps users practice high-stakes conversations through live voice simulations with AI personas. Users select a scenario, engage in a real-time voice conversation powered by ElevenLabs Conversational AI, and receive structured coaching feedback afterward based on transcript analysis.

I designed and built the end-to-end product workflow, including front-end UX, voice AI integration, serverless back-end functions, transcript evaluation pipelines, and rubric-based scoring logic. The system evaluates communication quality across dimensions like empathy, curiosity, clarity, and engagement balance, then generates actionable coaching feedback and key conversational moments.

The project demonstrates how AI can be embedded into a practical training workflow rather than a generic chatbot experience. It also showcases rapid MVP development, AI workflow orchestration, prompt engineering, and full-stack product prototyping for commercially relevant SaaS applications.

Education

2004 - 2005

Master of Engineering Degree in Computer Science

Cornell University - Ithaca, NY, USA

2000 - 2003

Bachelor of Science Degree in Computer Science

George Washington University - Washington D.C., USA

Certifications

SEPTEMBER 2017 - PRESENT

Improving Deep Neural Networks

Coursera

Skills

Libraries/APIs

REST APIs, Node.js, React, Playwright, API Development, Google Cloud API, TensorFlow, Vue 3, OpenAI API

Tools

Apache Beam, Cloud Dataflow, Apache Airflow

Languages

Java, SQL, Python, JavaScript, Python 3, TypeScript

Storage

PostgreSQL, Elasticsearch, Redis, Google Cloud, MongoDB, Databases, Database Management Systems (DBMS)

Frameworks

Next.js, Selenium, Spring Boot, LangGraph

Platforms

Amazon Web Services (AWS), Netlify, Google Cloud Platform (GCP), Docker

Paradigms

ETL, DevOps, Real-time Systems, Serverless Architecture, Foundation Models

Other

API Design, Data Engineering, Chrome Extensions, Artificial Intelligence (AI), Supabase, Full-stack Development, System Architecture, CTO, Software Architecture, Data Scraping, Web Scraping, Architecture, Large Language Models (LLMs), Technical Leadership, Venture Capital, Full-stack, Team Leadership, Data Architecture, Technical Architecture, Vector Databases, Open-source LLMs, Qwen, Code Auditing, Document Processing, RAG Systems, Fintech, Big Data, Machine Learning, Deep Learning, LLM Integration, API Integration, A/B Testing, Business Process Modeling, Workflow Automation, Agentic AI, Optical Character Recognition (OCR), Computer Vision, WebSockets, YOLOv5, Computer Security, Data Mining, Algorithms, Networks, FastAPI, Conversational AI, Supervised Learning, Generative Pre-trained Transformers (GPT), RAG Architecture, Unsupervised Machine Learning, Supervised Machine Learning

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring