Vaidotas Kanopa
Verified Expert in Engineering
Data Analyst and Developer
Vaidotas is a data engineer and full-stack developer who builds high-impact solutions that produce measurable results. Examples include an AI-powered, automated customer support system and an end-to-end BI structure for a data-oriented company. He has expertise in statistics and analytics, strong business acumen, and he enjoys solving complex problems. In addition to a master's degree in financial mathematics and statistics, Vaidotas earned first place in the National Mathematical Olympiad.
Portfolio
Experience
Availability
Preferred Environment
Python, Amazon Web Services (AWS), Microsoft Power BI, Talend ETL, SQL Server Analysis Services (SSAS), SQL Server Integration Services (SSIS)
The most amazing...
...career event was starting my own company, which gave me a vastly different perspective on owning and operating a business.
Work Experience
Data Analyst | Data Engineer
Logdirect
- Co-created an AI-powered customer support system that automatically handled 1,000 to 5,000 requests per day. The production system was implemented using Salesforce Einstein.
- Built an MVP customer message classification long short-term memory networks (LSTM) model with Python Keras and scikit-learn. This proof of concept led to a decision to launch the customer support automation project into production.
- Created a pipeline to extract and transform over one million email messages into a clean and ML-ready dataset using Python Pandas library and text manipulation techniques.
- Developed and maintained Microsoft SQL Server Analysis Services (MS SSAS) multidimensional cubes that the majority of the company used as the source for business insights and day-to-day analytics.
- Participated in developing and maintaining an Amazon Redshift data warehouse that was used as the primary data source for the whole company.
- Re-developed ETL processes to provide data for day-to-day use and monitoring. This reduced data loading errors by 70% and helped spot and correct countless data inconsistencies.
- Created Power BI reports providing business insights and monitoring, such as payment reports used by the payments team to spot problems in payment flows, GDPR tracking reports, ETL process monitoring, and an automated customer support report.
Full-stack Developer
Self-employed
- Developed a custom, end-to-end software solution for detecting visual anomalies, such as residues in wine, shattered glass, and missing labels. The solution had human-level performance and was capable of processing up to 20 evaluations per second.
- Developed OpenCV-based image processing pipelines to extract images from industrial cameras and prepare them for analysis.
- Integrated deep learning Keras models to work alongside hard-coded computer vision techniques. This led to the ability to use deep learning models on specific parts of images and, in turn, increase their effectiveness.
- Built a user interface using the Kivy framework that allowed non-technical personnel to use the software effectively.
- Collaborated with assembly line managers to analyze problematic specifications and create clear requirements for the ML visual inspection solution.
Founder
MB Morsas
- Created and optimized Amazon store listings that resulted in taking the best-selling product position in a major Amazon category.
- Developed multiple online stores using WooCommerce and Shopify, each servicing a few thousand customers.
- Built a semiautomatic customer service system that helped increase agent efficiency by 60%.
- Searched for and negotiated with overseas suppliers and ensured high-quality products and on-time delivery of shipments.
- Developed a structured process for media buying on the Facebook Ads platform, which resulted in quick product testing. The data was gathered and combined using Python and Excel.
Data Analyst
Self-employed
- Developed Python Pandas-based ETL processes to produce a BI-ready data source by gathering and combining data from the ERP system (MS SQL database) and custom-made reports.
- Created QlikView reports used by the whole sales team and management to track business performance and help make optimal decisions accurately.
- Assisted in analyzing the data in order to optimize manufacturing processes. Analyses ranged from spotting inconsistencies in product recipes to evaluating the performance of sales promotions.
- Developed Tableau reports to help answer business questions and track key KPIs.
Director of Commerce
UAB Daivida
- Oversaw the largest client category, retail chains, which accounted for 40% of the company's revenue.
- Negotiated with clients and acquired new ones, notably the largest EU retail chain, which resulted in a 25% YoY revenue increase.
- Initiated a data-driven approach and developed QlikView reports that helped accurately track the sales process.
- Enhanced Excel reports by using VBA to automatically extract and process data from the ERP system (an MS SQL database). This helped save an average of four hours of employee work time per day and reduce the probability of human data-entry errors.
Purchasing Manager
UAB Daivida
- Ensured a smooth and continuous supply of raw materials for the manufacturing process.
- Searched for and negotiated with suppliers, resulting in 10 to 15% cost reductions on major raw materials used.
- Developed an Excel VBA-based semiautomatic inventory tracking and forecasting system that streamlined inventory evaluation and increased accountability. This resulted in a more efficient purchasing process and less time spent on routine tasks.
- Created ETL scripts to extract inventory/purchasing-related information from the ERP system (MS SQL database) and developed QlikView reports that provided a clear picture of the whole purchasing process, e.g., material pricing, demand, and errors.
Senior Specialist
VĮ "Žemės Ūkio Informacijos ir Kaimo Verslo Centras
- Researched statistical modeling methods, like linear regressions and BLUP, to determine their best application for selective animal breeding. Model building was done using R.
- Co-created Python-based ETLs to extract necessary data from Oracle databases.
- Provided required governmental analytical reports on the agriculture sector and ensured data quality and accuracy.
Experience
Customer and Prospect Email Request AI-based Autoresponder
The problem:
Due to the nature of the business, there were a lot of unanswered prospect email requests posing a lost opportunity, and the costs of treating customer requests were generating huge costs.
Project details:
• Aggregated and transformed Salesforce requests into ML-ready datasets
• Provided the POC version using Python and TensorFlow/Keras
• Trained and tuned models on Salesforce Einstein
• Helped develop an adjustable Salesforce interface for the autoresponder
• Implemented reports to track the performance of the system
• Helped steer the project to achieve the most business value
Result:
The system handled 1,000 to 5,000 prospect and customer requests daily, helping the business improve profitability and increase customer satisfaction.
BI Infrastructure Migration
The problem:
Due to increasing data needs, local data centers became too costly and hard to scale.
Project details:
• Helped design and choose the correct infrastructure for BI needs (EC2 and RDS servers)
• Helped design and implement migration steps ensuring continuous, uninterrupted work
• Designed database schemas and implemented BI data migration to Redshift, RDS, and EC2 Microsoft SQL servers
• Created new adapted Talend ETL flows
Result:
All BI infrastructure was migrated to AWS Cloud, improving the cost and speed with virtually no downtime during migration.
Data Democratization
The problem:
Different departments in the company were using their data and analytics solutions, causing data inconsistencies, lack of data sharing, and inefficient analytics.
Project details:
• Aggregated dispersed data sources into a central data warehouse (AWS Redshift and Spectrum)
• Helped develop a structured Power BI dashboarding system
• Converted into Power BI and optimized existing business analytics solutions
• Co-developed new Power BI reports to answer business needs
Result:
Most of the company's data has been stored in the central data warehouse as the source of truth; data analytics needs have been served by systemized Power BI reports.
Custom Visual Defect Detection App
The problem:
In many wine manufacturing plants, defect detection is done manually, which has become costly and introduces human errors.
Project details:
• Based the app on Python and the OpenCV library
• Integrated with industrial-grade cameras
• Achieved real-time speed at 20 evaluations per second
• Developed to be adjustable to new products while evaluating defects against reference images
• Utilized a Kivy-based GUI
Result:
The solution had a human-level performance on most wine bottle types and could process up to 20 evaluations per second.
Classified Ads Website for Local Businesses
The problem:
Available options to sell or buy local businesses were limited and lacked functionality.
Project details:
• Created a fully serverless implementation using AWS services (Lambda, Amazon API Gateway, Amazon Simple Storage Service (S3), DynamoDB) and Cloudflare as a content delivery network (CDN)
• Enabled automatic scaling
• Developed a reactive single-page application (SPA) front-end using Vue.js
• Constructed user authentication using AWS Cognito
Result:
A fully functioning serverless scalable classified ads website for selling and buying local businesses.
eCommerce Store for a Meat Products Manufacturer
The problem:
During COVID-19, in the local city of the manufacturer, people were struggling to order food and basic products safely due to restrictions on movement.
Project details:
• Created a fully functioning WooCommerce online store
• Developed flexible and adjustable delivery time constraints
• Built custom pricing settings on internal product parameters
• Handled sales reports generation
Result:
The company had a fully functioning eCommerce store under a week from the idea's inception.
Automatic Product Research App
The problem:
It was manually challenging to find which products were popular and had little competition on the Amazon marketplace.
Project details:
• Created a Python-based app using the Selenium framework for web scraping
• Used the results from Google Ads and other third-party providers to find trends
• Used the results from Amazon to find competition for products
Result:
The app results led to investing in products that became high sellers across all EU Amazon marketplaces.
Education
Master's Degree in Financial Mathematics and Statistics
University of Warwick - United Kingdom
Certifications
AWS Certified Data Analytics Specialty
AWS
AWS Certified Cloud Practitioner
Amazon Web Services Training and Certification
Machine Learning Scientist with Python – Career Track
DataCamp Inc.
Python Programmer Track – Career Track
DataCamp Inc.
Data Analyst with SQL Server – Career Track
DataCamp Inc.
Deep Learning
deeplearning.ai | via Coursera
Big Data
University of California San Diego | via Coursera
Machine Learning
University of Washington | via Coursera
Skills
Libraries/APIs
Pandas, Scikit-learn, OpenCV, NumPy, Keras, TensorFlow, Kivy, Vue
Tools
Microsoft Power BI, Talend ETL, Microsoft Excel, Git, Tableau, PyCharm, Jupyter, Salesforce Einstein, Amazon Cognito, AWS Glue, Amazon Elastic MapReduce (EMR), Amazon Athena, Amazon Redshift Spectrum, Amazon QuickSight
Languages
Python, SQL, HTML, Excel VBA, R, C++, PHP
Paradigms
Data Science, Business Intelligence (BI), ETL, Dimensional Modeling, Database Design
Platforms
Jupyter Notebook, QlikView, WooCommerce, Amazon Web Services (AWS), Windows, Talend, Shopify, Oracle, Salesforce, AWS Lambda, Apache Kafka
Storage
SQL Server Analysis Services (SSAS), SQL Server Integration Services (SSIS), Redshift, Relational Databases, Databases, Data Pipelines, SQL Server Reporting Services (SSRS), Amazon S3 (AWS S3), JSON, PostgreSQL, Microsoft SQL Server, Oracle SQL, Amazon DynamoDB, MySQL
Frameworks
Selenium
Other
Machine Learning, Data Engineering, Data Analysis, Data Modeling, Data Cleansing, Data, Data Visualization, Deep Learning, Convolutional Neural Networks (CNN), Facebook Ads, Ad Optimization, Landing Page Optimization, Natural Language Processing (NLP), Computer Vision, Image Processing, Data Analytics, Data Warehousing, Amazon RDS, GPT, Generative Pre-trained Transformers (GPT), Google Ads, Microsoft Data Transformation Services (now SSIS), Statistics, Statistical Modeling, Cloud, API Integration, APIs, Cloudflare, Amazon API Gateway, Data Warehouse Design, Web Scraping, Amazon Kinesis, AWS Database Migration Service (DMS), AWS DataSync
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring