
Lasse Hyyrynen
Verified Expert in Engineering
Data Scientist and Developer
Lasse is a data scientist with a background in mathematics. He excels in Natural Language Processing (NLP), data modeling, machine learning, Python, and artificial intelligence projects. He works on real-world problems that explore the extensive possibilities of ML and AI. Lasse has developed ML models that detect cybersecurity threats, created processing pipelines for machine learning models, and improved speech recognition accuracy by enhancing the pronunciation model for foreign words.
Portfolio
Experience
Availability
Preferred Environment
Linux, PyCharm, Slack
The most amazing...
...product I've created is the "JF" JSON and YAML query tool for performing complex transformations on datasets.
Work Experience
Data Scientist
F-Secure
- Developed machine learning models to detect cyber security threats.
- Built tools to integrate machine learning to various services.
- Developed the company MLOps practices to provide top quality AI services.
Text Data Scientist
Utopia Analytics
- Developed classifiers to various discussion forums and market places to automate human moderator work.
- Researched deep learning models to enhance classification results.
- Monitored model quality and built automation to retrain the model using the latest data.
Software Architect
Lingsoft
- Developed a scalable processing pipeline for machine learning models.
- Improved speech recognition accuracy by enhancing the pronunciation model for foreign words.
- Developed and enhanced multiple algorithms for specific NLP tasks.
- Managed multiple projects to meet customer requirements.
Experience
JF Dataset Filtering Tool
https://github.com/alhoo/jfSkills
Languages
Python 3, Python, SQL, C++
Libraries/APIs
Scikit-learn, Pandas, PyTorch, PySpark, SciPy, NumPy, Matplotlib, Asyncio, TensorFlow, Spark ML, Luigi
Paradigms
Data Science
Platforms
Linux, Docker, AWS Lambda, Apache Kafka
Other
Machine Learning, Deep Learning, Speech to Text, Bokeh, SIP Protocol, Indexing, Natural Language Processing (NLP), Visualization Tools, Clustering, Data Visualization, Data Engineering, Decision Tree Classification, GPT, Generative Pre-trained Transformers (GPT), Data Analysis, Decision Tree Regression, Predictive Modeling, Time Series Analysis
Frameworks
Spark, Apache Spark
Tools
PyCharm, Jupyter, GitLab CI/CD, Slack, Kaldi, RabbitMQ, Amazon Elastic MapReduce (EMR), AWS Batch
Storage
MongoDB, PostgreSQL, Azkaban
Education
Master's Degree in Mathematics and Computer Science
Aalto University - Espoo, Finland