Josef Toman
Verified Expert in Engineering
Software Developer
Dobřichovice, Central Bohemian Region, Czech Republic
Toptal member since March 31, 2021
Josef is a broadly skilled developer with over a decade of professional experience, strong theoretical background, the capacity to quickly learn anything given enough documentation, and a passion for programming since childhood. His recent endeavors mainly included creating automated data pipelines and tools leveraging the acquired data. Josef cares deeply for a client's actual needs and has an eye for security, long-term stability, and maintainability.
Portfolio
Experience
- Database Design - 12 years
- PostgreSQL - 12 years
- Automation - 11 years
- JavaScript - 10 years
- Web Scraping - 10 years
- Elasticsearch - 6 years
- Amazon Web Services (AWS) - 5 years
- Python - 5 years
Availability
Preferred Environment
Linux, Vim Text Editor, Slack, GitLab, Trello, Amazon Web Services (AWS), PostgreSQL
The most amazing...
...thing I've created is the automated monitoring of real estate offers. It helped me find a home for my family, and I turned it into a profit-generating service.
Work Experience
Developer
Freelance Clients
- Developed a successful prototype of an event-driven simulation to model drug prices in the European Union. The prototype was integrated with Google Sheets to fulfill initial needs. Later, it was used as a basis for further in-house development.
- Assisted a team of analysts in creating a set of complex research reports by developing a framework for coding simple modules ("chapters") and providing ongoing support whenever necessary.
- Developed a front-end application for scheduling minute-by-minute factory production using streamlined drag-and-drop actions. Achieved real-time optimized reactions to dragging in an environment with various logical constraints.
- Developed a contact scraping tool. The input was a set of companies and their websites. The scraper should hop through the website and collect management contact information.
Data Engineer
Cogvio
- Developed a set of AWS Lambda functions containing web scrapers, ETL jobs, and reporting tools. These AWS Lambda functions were interconnected in a fully automated system that could deliver a report to a customer shortly after new data became available.
- Designed an advanced full-text search model for Elasticsearch based on analyzing thousands of search queries. The improved full-text featured specialized domain-specific capabilities, greatly enhancing the user's experience.
- Designed and maintained a PostgreSQL database instance with multiple schemas, a hierarchical system of roles, and granular privileges for all users, including admins, developers, analysts, and automated processes.
- Developed an automated service for scraping tables from PDF files using Amazon Textract. The service was configurable with Google Sheet data and included a Slack integration.
Lead Back-end Developer
Yeseter
- Led the redesign of a complex Elasticsearch data model with millions of documents resulting in a 10x-100x speed increase while using 30%-50% of original resources.
- Developed a web scraping methodology and guidelines for junior developers.
- Created a system autonomously discovering Facebook profiles/pages within selected countries, evaluating their influence, and efficiently reading published posts and comments.
- Increased the number of daily acquired documents (articles, blog posts, comments, chat messages, and so on) from tens of thousands to more than a million.
Technical Specialist
Institute of Formal and Applied Linguistics, Charles University
- Improved the visualization of a single dependency parse tree and created a new visualization showing related and interlinked parse trees alongside each other with a configurable layout.
- Built two types of in-browser visualization of language corpora using pure HTML5: one automatically aligned audio recordings with multiple manual transcriptions and the second interactively linked related parse trees with tree nodes and source text.
- Created automated processes to release several language corpora: data acquisition from multiple Git repositories, templates for technical writers, a static website build localized to two languages, and data transformation.
Full-stack Developer
Winternet
- Developed a new control panel for a web hosting provider, including domain registration and renewal, mail server management, database provisioning, and invoicing.
- Built a pluggable submodule of a generalized in-house framework that provided a hierarchical catalog.
- Carried out a security audit of an application for an important client, identifying many security issues.
Technical Specialist
Institute of Formal and Applied Linguistics, Charles University
- Managed the data of a language corpus during its development.
- Developed tools to simplify and quicken the annotation process using automation where possible.
- Built a tool to measure the amount of work of diverse team members in a given month by computing differences in dependency parse trees committed into the project's Git repository.
- Developed an extension for the annotation tool, providing the annotator with dozens of structural checks to improve the quality and consistency of the submitted data.
Experience
Framework for Automated Reporting
I designed and implemented a framework that provides core features and allows the team to write auto-discoverable "plug-ins" (chapters, slides, QA checks). Since then, I have been providing support when necessary.
Core Features:
• CLI scripts
• Extensive YAML configuration
• ETL
• Output to DOCX, HTML, and XLSX files
• Methods for simple creation of charts and tables carefully tailored to fit the data and required use-cases
Key Characteristics:
• Stringent security and privacy measures concerning the data
• Attention to detail—every single data field must be loaded and interpreted correctly or fixed upon discovery of an error
• Very opinionated design requirements
The customer and the nature of the data are not disclosable.
Factory Production Scheduler
Distinctive Features:
• 100% operable with mouse only (project requirement)
• Complex drag-and-drop actions
• A configurable mocked API to enable a full-featured experience before the actual API is implemented (random data are generated on-the-fly)
Other Contributions Besides App Development:
• Thorough initial analysis
• UX contributions
• Design based on the company's visual identity standards and consultation with a pro designer
• API design and detailed specifications
EU Drug Prices Simulation
The simulation was developed using Python and leveraged the Google Sheets API to input and output data. This allowed for seamless integration with existing spreadsheets and easy collaboration with stakeholders.
My client successfully used the prototype to fulfill the needs of consulting tasks for initial clients, on time and to satisfaction. Later, the prototype was used as a basis for further in-house development.
Information System for a Tire Service Company
Project Deliverables:
1. A public presentation of the company and its branches and a reservation system for customers
2. An application for the technical staff designed for touch screens (tablets) providing:
• Calendar with reservations
• A step-by-step guide navigating the staff through each service case, including SMS notifications and invoicing
Education
Master's Degree in Computer Science
Charles University - Prague, Czech Republic
Bachelor's Degree in Computer Science
Charles University - Prague, Czech Republic
Skills
Libraries/APIs
REST APIs, Vue, Pandas, Matplotlib, Vuex, jQuery
Tools
Slack, GitLab, Git, Vim Text Editor, Trello, NGINX, Stylus, Sentry, Google Docs, Terraform
Languages
Python, SQL, Perl, PHP, JavaScript, HTML, CSS, Bash, C
Paradigms
Database Design, ETL, Automation, Object-oriented Programming (OOP), Event-driven Programming
Platforms
AWS Lambda, Linux, Amazon Web Services (AWS), Docker
Storage
PostgreSQL, Elasticsearch, MySQL, JSON, CouchDB, SQLite, MongoDB, Databases, Amazon S3 (AWS S3), Amazon DynamoDB
Other
Web Scraping, APIs, Scraping, Data Extraction, Scripting, Data Scraping, Data Engineering, Data Modeling, Programming, Algorithms, Data Structures, Discrete Mathematics, Calculus, Probability Theory, Linear Algebra, Natural Language Processing (NLP), ELT, Generative Pre-trained Transformers (GPT), Statistics, Data Science, Amazon RDS
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring