Harsha H S, Developer in Cambridge, United Kingdom
Harsha is available for hire
Hire Harsha

Harsha H S

Verified Expert  in Engineering

Bio

Harsha has nearly two decades of experience in software design, architecture, development, and testing across a broad spectrum, from low-level silicon validation, board bring-ups, and device drivers to scalable distributed databases, route planning for autonomous vehicles, and AI applications. He has worked at startups alongside co-founders to bring ideas to life and at MNCs with geographically distributed cross-functional teams. Harsha believes in "simplicity well-tested and -documented."

Portfolio

Order In Chaos Technology LLP
Python, Asyncio, C++, C, Transport Layer Security (TLS), Mesos...
Amazon UK
Python, Robot Operating System (ROS), Xen, Linux Kernel, Quick EMUlator (QEMU)...
Untangle AI
PyTorch, Python, Cython, Kdb+, Q, MySQL, NumPy, SciPy, Pandas, Linux...

Experience

Availability

Part-time

Preferred Environment

Linux, C++, C, Python, Distributed Systems, Hypervisors, Algorithms, PyTorch, Cryptography, Assembly

The most amazing...

...thing I've developed is a distributed and hybrid routing service for a fleet of autonomous robotaxis under various weather, traffic, and road constraints.

Work Experience

Designated Partner/Director

2015 - PRESENT
Order In Chaos Technology LLP
  • Implemented a customized key exchange protocol on Gemalto's hardware security module (HSM) using elliptic-curve cryptography.
  • Contributed to carving our required compute and memory resources for the workload on a distributed PaaS for AI/machine learning applications, specifically on middleware, similar to how Mesos abstracts all the compute, storage, and memory resources.
  • Integrated a transport layer security on top of an HTTP server, written using proprietary eventing and the asyncio framework.
Technologies: Python, Asyncio, C++, C, Transport Layer Security (TLS), Mesos, Distributed Systems, HSM, PyTorch, Keras, TensorFlow, NumPy, Linux, Algorithms, Cryptography, Assembly, Functional Programming, SQLite, Agile Software Development, Complexity Theory, Open MPI, NVIDIA CUDA, Open Source, Object-oriented Programming (OOP), Amazon EC2, GPU Computing, SciPy, Pandas, Java, GIS, CMake, JavaScript, ARM, Graphics Processing Unit (GPU), BIOS, x64 Assembly, MIPS, AWS IoT, C#.NET, Amazon Web Services (AWS), MPI, Multiprocessing, JSON, Databases, Deep Neural Networks, Machine Learning, Linux Servers, NGINX, Cloud, Architecture, Embedded C++, GNU Debugger (GDB), Valgrind, Elliptic Curve Cryptography, Neural Networks, Statistics, Leadership, Computer Vision, Project Management, Embedded Systems, Networking, GitLab, Data Extraction, Team Leadership, Docker, APIs, SQLAlchemy, PostgreSQL, Asynchronous I/O, Protobuf, GitHub, Jira, Cloud Storage, AWS IAM, SQL, Virtualization Technology, Virtualization, DevOps, Linux Server Administration, Mathematics, Oracle PL/SQL, Cloud Architecture, AWS Lambda, Serverless, CI/CD Pipelines, Node.js, REST APIs, AWS Cloud Architecture, Amazon SageMaker, Scripting, Git, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, C#, Boost, Artificial Intelligence (AI), Deep Learning, Automation, Embedded Linux, API Integration, TypeScript, Board Support Packages, Embedded C

Senior Kernel/Hypervisor and Robotics Engineer

2019 - 2022
Amazon UK
  • Developed a proof of concept for an NX network card emulation in QEMU to remove hardware dependency for the Xen on Nitro program.
  • Supervised a small team to deliver the same to include TCP checksum and segmentation offloading.
  • Maintained the health of accelerated compute infrastructure on Amazon EC2.
  • Managed cross-functional stakeholders in hardware, kernel, hypervisor, and GPU compute teams.
  • Contributed to the overall quality improvement and upgrade of Xen fleets and, eventually, the migration of the Xen fleet to Xen on Nitro architecture.
  • Collected and stored metrics to understand robot behavior and improve operational robustness of scout robots middleware, using the robot operating system to extend battery life.
Technologies: Python, Robot Operating System (ROS), Xen, Linux Kernel, Quick EMUlator (QEMU), Amazon EC2, Ruby, C++, C, GPU Computing, Linux, Hypervisors, Algorithms, Distributed Systems, Agile Software Development, Open Source, ARM, Graphics Processing Unit (GPU), Amazon Web Services (AWS), Multiprocessing, JSON, Amazon S3 (AWS S3), Linux Servers, Cloud, Architecture, Embedded C++, GNU Debugger (GDB), Valgrind, Leadership, Project Management, Embedded Systems, Networking, GitLab, Data Extraction, Team Leadership, Docker, APIs, PostgreSQL, Asynchronous I/O, Protobuf, GitHub, Jira, Cloud Storage, AWS IAM, SQL, Virtualization Technology, Virtualization, DevOps, Linux Server Administration, Mathematics, Oracle PL/SQL, Cloud Architecture, AWS Lambda, Serverless, CI/CD Pipelines, REST APIs, AWS Cloud Architecture, Amazon SageMaker, Internet of Things (IoT), ETL, Scripting, Git, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, C#, Deep Learning, Automation, PLC, Embedded Linux, Board Support Packages, Embedded C

Senior Software Engineer

2019 - 2019
Untangle AI
  • Delivered products as a core founding team member, aimed at explainable AI for convolution neural networks using signal estimation, uncertainty modeling, and concept extraction, and using Cython to package into SDK and integrated keygen licensing.
  • Designed an active learning process using uncertainty modeling to feed right and limited data sets to a deep neural network to reduce training time without compromising accuracy.
  • Developed and delivered a back-end service using tornado, MySQL, and layer-wise relevance propagation algorithm to explain manufacturing failures, trained using long short-term memory (LSTM) recurrent neural network architecture.
Technologies: PyTorch, Python, Cython, Kdb+, Q, MySQL, NumPy, SciPy, Pandas, Linux, Algorithms, Agile Software Development, NVIDIA CUDA, Object-oriented Programming (OOP), Amazon EC2, GPU Computing, Graphics Processing Unit (GPU), Amazon Web Services (AWS), JSON, Data Science, Deep Neural Networks, Machine Learning, Linux Servers, NGINX, Cloud, Architecture, GNU Debugger (GDB), Neural Networks, Leadership, Computer Vision, Project Management, GitLab, Data Extraction, Team Leadership, Docker, APIs, SQLAlchemy, PostgreSQL, Asynchronous I/O, Protobuf, GitHub, Jira, Cloud Storage, SQL, DevOps, Linux Server Administration, Mathematics, Oracle PL/SQL, Cloud Architecture, AWS Lambda, Serverless, CI/CD Pipelines, REST APIs, AWS Cloud Architecture, Amazon SageMaker, Internet of Things (IoT), ETL, Scripting, Git, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, C#, Boost, Artificial Intelligence (AI), Deep Learning, Automation, Serial Communication, API Integration, TypeScript

Senior Software/Research Engineer

2017 - 2019
Nutonomy
  • Designed, developed, and tested a fleet management system for thousands of autonomous taxis plying on Singapore's city-scale using dynamic shortest path algorithms to route the taxis under various constraints.
  • Researched and created constrained assignment algorithms to reduce customer wait time and maximize robotaxi utilization, exploring topics such as ride-sharing and a hybrid mode of transport.
  • Developed a grid matching algorithm and worked on GeoJSON data to match customers to available robotaxis in constant time.
  • Scaled the service by working on hybrid mode of motion planning using coarse graphs to reduce graph complexity.
Technologies: C++, Java, Go, Python, Neo4j, GIS, Linux, Distributed Systems, SQLite, Agile Software Development, Complexity Theory, Operations Research, NVIDIA CUDA, Object-oriented Programming (OOP), Asyncio, Amazon EC2, NumPy, SciPy, Pandas, CMake, JavaScript, AWS IoT, C#.NET, Conan, Amazon Web Services (AWS), Multiprocessing, JSON, Data Science, Deep Neural Networks, Machine Learning, Linux Servers, C++17, Firmware over the Air (FOTA), Architecture, Embedded C++, GNU Debugger (GDB), Valgrind, Leadership, Computer Vision, Project Management, Embedded Systems, Networking, GitLab, Team Leadership, Docker, APIs, PostgreSQL, Asynchronous I/O, Protobuf, GitHub, Jira, Cloud Storage, SQL, DevOps, Linux Server Administration, Mathematics, Cloud Architecture, AWS Lambda, CI/CD Pipelines, Node.js, REST APIs, AWS Cloud Architecture, Internet of Things (IoT), ETL, Scripting, Git, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, Boost, Automation, TypeScript, Embedded C

Senior Software Engineer

2014 - 2017
Couchbase
  • Reduced network traffic overhead and indexing throughput with a solution designed and developed to optimize MapReduce indexes when document fields are unused.
  • Simplified required efforts for the horizontal scaling of eventing nodes by architecting and mentoring on developing and implementing its consensus-free sharding mechanism.
  • Developed and open-sourced v8-inspector to debug embedded JavaScript applications with WebSocket server on the back end, integrating it with Chrome DevTools on the front end to provide debugging functionality for user-written events.
  • Applied various improvements and bug fixes in B+ tree implementation and MapReduce indexing using async networking primitives in Erlang.
Technologies: Erlang, Go, Python, C, CMake, Distributed Systems, JavaScript, Google V8, Storage, Open Source, Linux, Algorithms, Functional Programming, Agile Software Development, Object-oriented Programming (OOP), Asyncio, Amazon EC2, Amazon Web Services (AWS), JSON, Databases, Linux Servers, Cloud, Architecture, GNU Debugger (GDB), Leadership, Project Management, Redis, Networking, GitLab, Team Leadership, Docker, APIs, Protobuf, GitHub, Jira, Linux Server Administration, Cloud Architecture, AWS Lambda, CI/CD Pipelines, Node.js, REST APIs, AWS Cloud Architecture, Scripting, Git, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, C#, Automation, TypeScript, Embedded C

Senior ASIC Engineer

2012 - 2014
NVIDIA
  • Developed microcode for ARMv8 instruction set architecture (ISA) to gain out-of-order benefits with a custom in-order VLIW engine, fusing operations and optimizing hot code block using branch prediction performance metrics.
  • Created and maintained the code coverage infrastructure for a management translation software on the microcode engine and a software interpreter handling hypervisor exceptions and slow interrupt paths.
  • Improved the random instruction generator to uncover and fix bugs in the simulator and register-transfer level for ARMv8 ISA.
Technologies: C++, C, NVIDIA CUDA, Assembly, ARM, Graphics Processing Unit (GPU), Linux, Algorithms, CMake, Firmware, Microcode, Bootloaders, MPI, Hardware Drivers, Device Drivers, Linux Servers, Architecture, Embedded C++, GNU Debugger (GDB), Leadership, Embedded Systems, SQL, Linux Server Administration, Scripting, Command-line Interface (CLI), High Code Quality, Automation, PLC, Board Support Packages, Embedded C

Graphics Software Engineer

2009 - 2012
Intel
  • Contributed to the board bring-up process of Ivy Bridge and a next-gen Intel processor as part of the Legacy Video BIOS team, catering to display interfaces such as HDMI, DisplayPort, VGA, and LCD.
  • Managed a small team of system admins, video basic input/output system (BIOS) experts, and testing officers to deliver an extended desktop feature in video BIOS for Acer's Iconia line of laptops, which had a dual display in place of a keyboard.
  • Collaborated with the system administration team to move our codebase from Rational ClearCase to Git and evangelized embracing Agile practices within the system and video BIOS teams.
Technologies: BIOS, Firmware, x64 Assembly, C, Assembly, Bootloaders, MPI, Hardware Drivers, Device Drivers, Architecture, Embedded C++, GNU Debugger (GDB), Leadership, Embedded Systems, Linux Server Administration, Scripting, Command-line Interface (CLI), High Code Quality, Automation, PLC, Board Support Packages, Embedded C

System Technologist

2008 - 2009
Tandberg (Acquired by Cisco)
  • Developed a test framework for qualifying a hardware board that supports a 720-pixel camera over USB and worked towards obtaining hardware certifications.
  • Created and implemented firmware upgrade mechanisms over USB and universal asynchronous receiver-transmitter interfaces.
  • Designed and developed direct memory access drivers for a pulse-width modulation module to deliver the sinusoidal wave over the trapezoidal wave to the bipolar stepper motor used for autofocus and exposure, combatting motor noise.
Technologies: C, C++, ARM, USB, Linux, Assembly, Linux Kernel, Open Source, BIOS, Firmware, Bootloaders, Hardware Drivers, Device Drivers, Linux Servers, Embedded C++, GNU Debugger (GDB), Embedded Systems, Linux Server Administration, Scripting, Command-line Interface (CLI), High Code Quality, Automation, PLC, Embedded Linux, Board Support Packages, Embedded C

Design Engineer 2

2007 - 2008
Montalvo Computer Systems India Pvt. Ltd.
  • Implemented microcode for an in-order VLIW machine to mimic an x86_64 architecture's hardware task switching mechanism by saving and restoring register files, the system management mode, and various interrupt paths based on control register entries.
  • Added diagnostics for various FP, SSE, and MMX instructions crossing with control register sensitivities and x86 modes, implementing identical behavior in microcode.
  • Contributed to enhancing various simulator features.
Technologies: Microcode, C++, C, Linux, Algorithms, Assembly, Quick EMUlator (QEMU), BIOS, Firmware, x64 Assembly, Bootloaders, MPI, Device Drivers, Embedded C++, GNU Debugger (GDB), Embedded Systems, Linux Server Administration, Scripting, Command-line Interface (CLI), High Code Quality, Automation, Board Support Packages, Embedded C

Software Engineer

2003 - 2007
RMI Corporation (Formerly Raza Microelectronics, Inc.)
  • Developed a flash mode for an XLR MIPS simulator to simulate the boot process from flash.
  • Collaborated with the team to develop a comprehensive benchmark and stress test suite for testing XLR processors.
  • Created a GDB stub and core dump utility in the bootloader to debug crashes of multithreaded applications.
  • Ported OpenSSL to bypass the software cryptographic algorithms and use hardware accelerators.
  • Complied with Federal Information Processing Standards (FIPS) for the HSM that was part of the XLR system on a chip.
Technologies: C++, C, Firmware, MIPS, OpenSSH, OpenSSL, HSM, Linux, Algorithms, Cryptography, Assembly, Linux Kernel, Open Source, BIOS, Bootloaders, Hardware Drivers, Device Drivers, Linux Servers, Embedded C++, GNU Debugger (GDB), Elliptic Curve Cryptography, Embedded Systems, Linux Server Administration, Scripting, High Code Quality, Embedded Linux, Board Support Packages, Embedded C

Distributed Eventing Framework for Couchbase Events

An eventing framework for Couchbase, a distributed key-value JSON document store supporting 100,000 operations per second.

The eventing framework was a post-trigger mechanism to hook in user-specified functions or operations on every database event. It also provided users to connect functionalities for non-database events such as timers.

I was primarily involved with architecting a consensus-free sharding mechanism for eventing nodes to claim ownership of a set of shards by each node without involving leader election, simplifying the design to a large extent.

I was also involved with various integrations, including:

• The Google V8 engine to parse and compile user-supplied functionality using JavaScript functions.
• The v8-inspector on the back end and Chrome DevTools on the front end to debug user functions, then later open-sourcing the same to debug any embedded JavaScript applications.
• The eventing functionality with the Couchbase multi-dimension scaling paradigm, scaling each service independently depending on its workload.

Routing and Assignment Microservices for Robotaxis

A distributed hybrid routing engine to route thousands of robotaxis around Singapore's city-scale.

The project involved graph pruning for scalability and developing a parallel dynamic shortest path graph algorithm for the coarse-grained routing of taxis from source to destination. A fine-grained map was also downloaded onto the taxi on-demand, enabling the robotaxi to perform motion planning, obstruction avoidance, and lane switching to achieve the final goal, which was to provide customers with an optimal assignment service under various constraints.

I was involved with:

• Conducting a research survey to understand various state-of-the-art shortest path algorithms, decide upon the Ramalingam-Reps algorithm, and implement the back end in C++.
• Developing a simple HTTP server in C++ to receive various goal positions, traffic conditions, and other constraints to be applied to the road network graph.
• Creating an extract, transform, and load (ETL) pipeline to scrape GeoJSON data from OpenStreetMap APIs and remove the fine-grained Uni node to get a scalable graph for the cities the robotaxis operated.
• Implementing assignment microservices using operation research techniques to perform constrained optimization.

Hardware Security Modules and Cryptography for RMI

I was involved with various cryptographic algorithms in hardware and software for various HSMs at RMI Corporation.

I worked towards getting an FIPS certification for HSMs. I also developed OpenSSL and OpenSSH integrations and drivers and implemented an elliptic-curve key exchange algorithm for the Gemalto HSM.
1999 - 2003

Bachelor's Degree in Computer Science and Engineering

Bapuji Institute of Engineering and Technology - Davanagere, Karnataka, India

Libraries/APIs

OpenSSL, MPI, Protobuf, REST APIs, PyTorch, Open MPI, NumPy, SciPy, Pandas, Keras, SQLAlchemy, Asyncio, TensorFlow, Node.js

Tools

CMake, OpenSSH, GNU Debugger (GDB), GitLab, GitHub, Jira, Git, GIS, NGINX, Valgrind, Amazon SageMaker, Mesos, Conan, AWS IAM

Languages

C++, C, Python, x64 Assembly, Microcode, Embedded C++, Embedded C, Assembly, Erlang, Go, Java, JavaScript, MIPS, C#.NET, SQL, C#, Ruby, Q, Lua, C++17, TypeScript

Paradigms

ETL, Agile Software Development, Object-oriented Programming (OOP), DevOps, Automation, Functional Programming

Platforms

Linux, Amazon Web Services (AWS), Docker, NVIDIA CUDA, Xen, Quick EMUlator (QEMU), Amazon EC2, AWS IoT, AWS Lambda, Embedded Linux

Industry Expertise

Project Management

Frameworks

Boost

Storage

JSON, Databases, Amazon S3 (AWS S3), PostgreSQL, Oracle PL/SQL, MySQL, SQLite, Kdb+, Neo4j, Redis

Other

Distributed Systems, Hypervisors, Algorithms, Linux Kernel, Open Source, HSM, ARM, BIOS, Firmware, Bootloaders, Multithreading, Multiprocessing, Deep Neural Networks, Machine Learning, Linux Servers, Cloud, Architecture, Leadership, Embedded Systems, Data Extraction, Team Leadership, APIs, Linux Server Administration, Mathematics, Scripting, Command-line Interface (CLI), Data Engineering, Lint, High Code Quality, Board Support Packages, Cryptography, Complexity Theory, Operations Research, Robot Operating System (ROS), GPU Computing, Cython, Google V8, Graphics Processing Unit (GPU), Boost.Asio, Internet of Things (IoT), Compilers, Hardware Drivers, Device Drivers, Data Science, Elliptic Curve Cryptography, Neural Networks, Computer Vision, Networking, Asynchronous I/O, Cloud Storage, Virtualization Technology, Virtualization, Cloud Architecture, Serverless, CI/CD Pipelines, AWS Cloud Architecture, Artificial Intelligence (AI), Deep Learning, PLC, API Integration, Containerization, Transport Layer Security (TLS), Storage, USB, Groovy Scripting, Firmware over the Air (FOTA), Statistics, Serial Communication

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring