
Vít Baisa
Verified Expert in Engineering
Software Developer
Brno, South Moravian Region, Czech Republic
Toptal member since July 6, 2020
Vít has been making a living as a software engineer for more than eight years. Vit is also a natural language processing expert, and the software he's developed has been presented at international conferences and used by thousands of users all over the world. Vit has worked in academia, the private sector, and also as a freelancer in multiple roles, including researcher, software engineer, developer, and lecturer.
Portfolio
Experience
- Python - 12 years
- Artificial Intelligence (AI) - 10 years
- Data Scraping - 10 years
- Computational Linguistics - 10 years
- Generative Pre-trained Transformers (GPT) - 10 years
- Linguistics - 10 years
- Natural Language Processing (NLP) - 10 years
- Back-end - 8 years
Availability
Preferred Environment
Google Workspace, Slack, Command-line Interface (CLI), Vim Text Editor, Git, Linux
The most amazing...
...software I've developed is Sketch Engine, a state-of-the-art tool for lexicographers, linguists, teachers, translators, and language enthusiasts in general.
Work Experience
Senior Software Engineer
Freelance
- Developed a tool for an advanced English language analysis and visualization used by English language learners.
- Developed a system for continuous web scraping and analysis of millions of data points about overindebtedness affecting millions of people.
- Implemented a web app for annotating lexical resources (English, Dutch, and Italian).
Data Engineer
Merlon
- Developed and deployed a customized web scraper providing a new type of input into the company's data pipeline.
- Integrated a data provider API into company's data pipeline processing millions of documents per day.
- Implemented improvements of the company's internal annotation tool, significantly improving and speeding up the annotation process and further data processing.
Senior Full-stack Developer
Lexical Computing
- Developed a complex web app (front- and back-end) for advanced language analysis used by thousands of users worldwide.
- Developed a web app for terminology extraction with the state-of-the-art extraction quality.
- Scraped, cleaned, processed, and published several large datasets containing billions of words in tens of world languages.
Researcher
Masaryk University
- Developed a system for a continuous tracking and analyzing selected propaganda websites.
- Published more than 50 academic papers with almost 1,000 citations.
- Devised and supervised more than 20 bachelor and master theses.
- Gave more than ten in-person tutorials, workshops, seminars for beginner programmers, linguists and other language specialists.
Experience
Online Tool for Language Analysis
https://www.sketchengine.eu/I worked both on the back end (Python) and front end (JavaScript, Riot.js, jQuery, and Materialize). I added several new features into the tool and maintained its codebase.
Terminology Extraction Tool
https://terms.sketchengine.eu/I implemented the front end and the back end together with the full pipeline: from uploading a document (Word, PDF, and more), to processing it on the server, extracting plain text, automatically annotating with linguistic information, extracting the terminology, to providing the output to the user. The tool can also extract bilingual terminology.
A Map of Bankruptcy in the Czech Republic
http://www.mapabankrotu.cz/SkELL: A Language Learning Web App
I implemented both the front end and the back end which works with a billion-word dataset for English. Later, I added also other languages: Russian, Estonian, French, Italian, and German.
Text Profiler Web App
http://www.versatile.pub/versatext-info.htmlEducation
Ph.D. in Computer Science
Masaryk University - Brno, Czech Republic
Skills
Libraries/APIs
D3.js, jQuery, REST APIs
Tools
LaTeX, Git, Vim Text Editor, Asana, Slack, Shell, Google Workspace
Languages
Python, Python 3, JavaScript, Bash
Platforms
Web, Linux
Frameworks
Django
Other
Natural Language Processing (NLP), APIs, Data Scraping, Computational Linguistics, Linguistics, Generative Pre-trained Transformers (GPT), Machine Learning, Artificial Intelligence (AI), Tutoring, Data Visualization, Computer Science, Back-end, Web Standards, Algorithms, Big Data, Full-stack, Command-line Interface (CLI), ClickUp, Data Science, Front-end
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring