Vít Baisa, Developer in Brno, South Moravian Region, Czech Republic
Vít is available for hire
Hire Vít

Vít Baisa

Verified Expert  in Engineering

Software Developer

Location
Brno, South Moravian Region, Czech Republic
Toptal Member Since
July 6, 2020

Vít has been making a living as a software engineer for more than eight years. Vit is also a natural language processing expert, and the software he's developed has been presented at international conferences and used by thousands of users all over the world. Vit has worked in academia, the private sector, and also as a freelancer in multiple roles, including researcher, software engineer, developer, and lecturer.

Availability

Part-time

Preferred Environment

G Suite, Slack, Command-line Interface (CLI), Vim Text Editor, Git, Linux

The most amazing...

...software I've developed is Sketch Engine, a state-of-the-art tool for lexicographers, linguists, teachers, translators, and language enthusiasts in general.

Work Experience

Senior Software Engineer

2018 - PRESENT
Freelance
  • Developed a tool for an advanced English language analysis and visualization used by English language learners.
  • Developed a system for continuous web scraping and analysis of millions of data points about overindebtedness affecting millions of people.
  • Implemented a web app for annotating lexical resources (English, Dutch, and Italian).
Technologies: Shell, JavaScript, Python

Data Engineer

2020 - 2021
Merlon
  • Developed and deployed a customized web scraper providing a new type of input into the company's data pipeline.
  • Integrated a data provider API into company's data pipeline processing millions of documents per day.
  • Implemented improvements of the company's internal annotation tool, significantly improving and speeding up the annotation process and further data processing.
Technologies: Back-end, Python 3, Big Data

Senior Full-stack Developer

2012 - 2020
Lexical Computing
  • Developed a complex web app (front- and back-end) for advanced language analysis used by thousands of users worldwide.
  • Developed a web app for terminology extraction with the state-of-the-art extraction quality.
  • Scraped, cleaned, processed, and published several large datasets containing billions of words in tens of world languages.
Technologies: Shell, JavaScript, Python

Researcher

2009 - 2020
Masaryk University
  • Developed a system for a continuous tracking and analyzing selected propaganda websites.
  • Published more than 50 academic papers with almost 1,000 citations.
  • Devised and supervised more than 20 bachelor and master theses.
  • Gave more than ten in-person tutorials, workshops, seminars for beginner programmers, linguists and other language specialists.
Technologies: Shell, JavaScript, Python

Online Tool for Language Analysis

https://www.sketchengine.eu/
A web tool for building, managing, and analyzing large text datasets.
I worked both on the back end (Python) and front end (JavaScript, Riot.js, jQuery, and Materialize). I added several new features into the tool and maintained its codebase.

Terminology Extraction Tool

https://terms.sketchengine.eu/
A web-based tool for extracting keywords and terminology from documents uploaded by users.
I implemented the front end and the back end together with the full pipeline: from uploading a document (Word, PDF, and more), to processing it on the server, extracting plain text, automatically annotating with linguistic information, extracting the terminology, to providing the output to the user. The tool can also extract bilingual terminology.

A Map of Bankruptcy in the Czech Republic

http://www.mapabankrotu.cz/
I scraped, processed, cleaned, and analyzed data for the map which helps with discovering and understanding a large social problem in the Czech Republic. I implemented a system that continuously provides up-to-date data for overindebtedness specialists.

SkELL: A Language Learning Web App

http://skell.sketchengine.co.uk
A web-based tool for learning words and language patterns from a large monolingual dataset.
I implemented both the front end and the back end which works with a billion-word dataset for English. Later, I added also other languages: Russian, Estonian, French, Italian, and German.

Text Profiler Web App

http://www.versatile.pub/versatext-info.html
An online tool for analyzing English texts: from tokenization, part-of-speech tagging, lemmatization to text statistics, keyword extraction, language profiling and data visualization. Written in JavaScript (Riot.js).
2009 - 2016

Ph.D. in Computer Science

Masaryk University - Brno, Czech Republic

Libraries/APIs

D3.js, jQuery, REST APIs

Tools

LaTeX, Git, Vim Text Editor, Asana, Slack, Shell, G Suite

Languages

Python, Python 3, JavaScript, Bash

Platforms

Web, Linux

Frameworks

Django

Paradigms

Data Science

Other

Natural Language Processing (NLP), APIs, Data Scraping, Computational Linguistics, Linguistics, GPT, Generative Pre-trained Transformers (GPT), Machine Learning, Artificial Intelligence (AI), Tutoring, Data Visualization, Computer Science, Back-end, Web Standards, Algorithms, Big Data, Full-stack, Command-line Interface (CLI), ClickUp, Front-end

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring