
Alexandra Soroka
Verified Expert in Engineering
Machine Learning Developer
Saint Petersburg, Russia
Toptal member since May 22, 2019
Alexandra's always wanted to work with mathematics and language, so she sought out an education that combined programming and linguistics—and has been quite successful. She has about five years of NLP experience at Yandex, a major Russian search engine, and two years at smaller companies plus freelancing. Most of her projects involve entity recognition, but she’s also ranked search results, generated query expansions, and done text summarization.
Portfolio
Experience
- Linux - 7 years
- Machine Learning - 7 years
- Natural Language Processing (NLP) - 7 years
- Generative Pre-trained Transformers (GPT) - 7 years
- C++ - 5 years
- Natural Language Toolkit (NLTK) - 4 years
- Python 3 - 4 years
- Keras - 2 years
Availability
Preferred Environment
Git, Linux
The most amazing...
...thing I’ve built was a named-entity-recognition component for financial text (English/Dutch); clarified the requirements, built the datasets and model.
Work Experience
Senior Software Engineer
Huawei
- Helped design a web search from scratch—designing and ranking quality evaluation systems, dataset building processes, and so on.
Data Scientist
Itexus
- Created a working named-entity-recognition component for English and Dutch from scratch.
- Clarified the requirements.
- Built the dataset.
- Developed the neural network and wrapped it into a library which then applied it to various financial texts.
Chief Data Scientist
Econophysica
- Worked on a short-term (two months) project where I extracted oil field attributes from geological reports.
Researcher | Software Developer
Yandex
- Enhanced and developed a named-entity-recognition system for search queries in the linguistics department.
- Optimized the search result ranking in the ranking, relevance, and linguistics department.
- Maintained a query expansions generation system.
- Took part in structuring information for a knowledge graph in the web ontologies department.
Software Developer Intern
Yandex
- Enhanced the performance of a named-entity-recognition system for search queries.
Experience
A NER Component for Financial Text
A Text Summarization Project
Education
Specialist's Degree (Equivalent to a Master's Degree) in Computer Science
Russian State University for the Humanities - Moscow, Russia
Skills
Libraries/APIs
Keras, Natural Language Toolkit (NLTK), Beautiful Soup
Tools
Git, Gensim
Paradigms
MapReduce
Platforms
Linux
Languages
C++, Python 3, Python
Other
Natural Language Processing (NLP), Generative Pre-trained Transformers (GPT), Machine Learning, Random Forests, Web Search, Deep Learning
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring