
Harilal Orunkara Poyil
Verified Expert in Engineering
Data Strategist and Developer
Espoo, Finland
Toptal member since March 11, 2025
Harilal is a data engineer with a strong DevOps background and over a decade of experience building and optimizing data platforms, automating data workflows, and enhancing system reliability. He specializes in Python, Databricks, Spark, Apache Airflow, and Terraform, designing scalable analytics and machine learning solutions. Harilal excels at collaborating with cross-functional teams to deliver efficient, high-performance data solutions.
Portfolio
Experience
- Data Engineering - 12 years
- Python - 8 years
- Data Architecture - 8 years
- Databases - 7 years
- Infrastructure as Code (IaC) - 5 years
- Apache Airflow - 4 years
- Amazon Web Services (AWS) - 4 years
- Databricks - 3 years
Availability
Preferred Environment
Python, Databricks, Apache Airflow, Data Engineering, Data Architecture, SQL, Linux
The most amazing...
...solution I've built is a data platform for a fintech company on Databricks and AWS, managing infra, data integration, CI/CD, and architecture for scalability.
Work Experience
Data Engineer
ePassi
- Built and optimized a scalable data platform on Databricks running on AWS, ensuring high performance and reliability.
- Designed and implemented data integration processes while collaborating with cross-functional teams to ensure efficient ingestion and transformation aligned with business needs.
- Implemented CI/CD pipelines to automate deployments and streamline infrastructure and data workflow management.
- Redesigned and improved the existing data platform, enhancing performance, scalability, and maintainability.
- Proposed and contributed to process improvements, enhancing team workflows and efficiency in an Agile development environment.
Consultant Data Engineer
Eficode
- Provided solutions for multiple clients, addressing analytics, reporting, and DevOps challenges while optimizing workflows, cost, and performance.
- Developed analytics reports for monitoring various systems and tools as part of an internal R&D project, improving system visibility and decision making.
- Built and enhanced asset management systems using Jira assets, streamlining asset tracking and management for the company and clients.
Master Thesis Intern
Truecaller
- Developed a deep learning-based system for KPI forecasting and anomaly detection, improving accuracy in performance monitoring.
- Integrated exogenous variables using data fusion techniques to enhance forecasting precision.
- Implemented and evaluated machine learning models, contributing to data-driven decision making and operational efficiency.
Research Engineer
Amrita Vishwa Vidyapeetham
- Developed a distributed early warning framework for near real-time network traffic analysis and security threat detection.
- Conducted semantic analysis of social media data to assess public opinion on various social causes.
- Implemented big data solutions using Hadoop, Spark, and Elasticsearch to process and analyze large-scale datasets efficiently.
Experience
Data Platform for a Fintech Company
Education
Master's Degree in Information and Communications Technology (ICT) Innovation
KTH Royal Institute of Technology - Stockholm, Sweden
Master's Degree in Computer Science and Engineering
Polytechnic University of Milan - Milan, Italy
Bachelor's Degree in Computer Science and Engineering
Amrita University - Kerala, India
Certifications
The Complete dbt (Data Build Tool) Bootcamp: Zero to Hero
Udemy
Databricks Certified Data Analyst Associate
Databricks
Databricks Certified Data Engineer Associate
Databricks
Data Engineering with AWS
Udacity
Machine Learning Specialization
Coursera
Deep Learning Specialization
Coursera
Skills
Libraries/APIs
Pandas, PySpark, TensorFlow
Tools
Jira, Confluence, Apache Airflow, AWS IAM, Terraform, Bitbucket, GitHub, AWS Glue, Amazon Athena, Rundeck, Jenkins, SonarQube, Ansible, Microsoft Power BI, GitLab
Languages
Python, SQL, YAML, Bash Script, Groovy, Java
Storage
Data Pipelines, Databases, Amazon S3 (AWS S3), MySQL, JSON, Data Lakes, Relational Databases, PostgreSQL, HBase, Elasticsearch
Frameworks
Apache Spark, Delta Live Tables (DLT), Spark, Hadoop, Jinja
Paradigms
Agile Software Development, Automation, DevOps, ETL, Agile
Platforms
Databricks, Linux, Amazon Web Services (AWS), Docker, Apache Kafka
Other
Data Engineering, Data Architecture, Dashboards, Documentation, Data Modeling, Data Transformation, Machine Learning, Deep Learning, Distributed Systems, Computer Science, Data Warehousing, Analytics, APIs, CI/CD Pipelines, Mentorship, Infrastructure as Code (IaC), Data Governance, Data Strategy, Star Schema, Azure Databricks, Change Data Capture, Data Science, Artificial Intelligence (AI), Recommendation Systems, Marketing Strategy, Data Mining, Operating Systems, Computer Networking, Compilers, Computer Architecture, Web Development, Algorithms, Amazon Redshift, Neural Networks, ITSM, Information & Communications Technology (ICT), Engineering, Data Build Tool (dbt), Dagster, Fivetran
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring