
Jędrzej Wydra
Verified Expert in Engineering
Data Scientist and Developer
Skórka, Poland
Toptal member since October 24, 2025
Jędrzej bridges law, data science, and AI, specializing in evidence evaluation, machine learning, and statistical modeling. His work focuses on interpretability and numerical precision, combining analytical rigor with practical efficiency. Guided by the belief that simplicity is strength, Jędrzej designs solutions where every parameter has a reason—and every model, a story. Challenges drive him, and he notes that perseverance is built through ritual, not a single act.
Portfolio
Experience
- Law - 10 years
- Statistics - 8 years
- R - 8 years
- Forensic Science - 8 years
- Data Analysis - 8 years
- Data Science - 5 years
- Research - 5 years
- Python - 4 years
Preferred Environment
MacOS, R, Python, Research, Statistics, Machine Learning, Law, AI Ethics, Data Analysis, Forensic Science
The most amazing...
...project I’ve done is a model used to reconstruct temperature conditions at a crime scene in a real murder investigation.
Work Experience
PhD Researcher
Adam Mickiewicz University
- Created an EM algorithm to fit finite Weibull mixture models for estimating the time of death from insect data. The method captures population heterogeneity and yields accurate 95% coverage in a reproducible Python-based interval estimation workflow.
- Developed FDA workflows to reconstruct crime scene temperatures for time-of-death estimation. Used Fourier and concurrent models to cut measurement time from days to hours while maintaining high reconstruction accuracy.
- Built an R-based Bayesian workflow for handwriting evidence using Jaccard similarity and likelihood ratios. Combined beta fitting, MDE estimation, and heatmap visualization to enhance transparency and statistical validity in forensic analysis.
- Implemented independence tests for functional data using DIME/tDIME with basis expansion and GPU-based simulations. Benchmarked against HSIC and distance covariance, achieving strong power and type I error control.
Experience
Commentary on the AI Act in Polish
The work, created through collaboration between lawyers, data scientists, and AI engineers, offers nearly 100 practical examples of applying legal provisions in real-world scenarios. These examples address risk classification, cybersecurity, ethical concerns, system labeling, certification, and incident response. This interdisciplinary approach bridges legal, ethical, and technical perspectives, offering practical guidance for lawyers, public officials, IT specialists, and business leaders implementing AI solutions in compliance with EU law.
Interval Estimation of Thermal Summation Parameters in Forensically Important Insects
We developed an alternative statistical method that simultaneously estimates interval values for developmental parameters and identifies component populations within datasets. The approach fits finite mixtures of Weibull distributions to development time data using the Expectation–Maximization (EM) algorithm, allowing the capture of individual-level variation.
Using developmental datasets of Creophilus maxillosus and Necrodes littoralis (Staphylinidae), our model achieved 95% coverage near the nominal level, while classical methods captured only 59% and 75% of cases. This improvement demonstrates that mixture-based interval estimation better reflects biological heterogeneity and enhances the reliability of postmortem interval calculations. Beyond forensic entomology, the method offers a generalizable framework for thermal ecology and applied biological modeling.
Reliability of Fingerprint Experts in Extracting & Evaluating Minutiae in Tests of Fingerprint
https://doi.org/10.1016/j.jflm.2025.102943Thirty Polish experts identified and rated seven minutiae in two fingerprint traces of differing quality on a five-point scale. Inter-rater agreement was measured with Krippendorff’s alpha, and correlations were used to explore the link between selection frequency and perceived value.
Results showed low agreement in both selection and evaluation (α = 0.39 for high-quality and 0.42 for low-quality traces) and a strong negative correlation between selection frequency and perceived uniqueness (r = -0.84 and -0.89). These findings highlight significant variability in expert judgments and suggest that widely selected features are viewed as less discriminative.
The study emphasizes the need for systematic verification. Multi-expert reviews could improve reliability and reduce the risk of forensic identification errors.
Distance of Mean Embedding for Testing Independence of Functional Data
https://doi.org/10.1016/j.sigpro.2025.109959Our methods are compared with tests based on distance covariance and the Hilbert–Schmidt independence criterion. To evaluate their effectiveness, we presented simulation studies and two real data examples using air pollution and chemometric data sets. The new testing procedures demonstrated favorable finite-sample properties, effectively controlling the type I error rate and exhibiting competitive power, making them viable alternatives to covariance-based tests.
Forensically Useful Mid-term and Short-term Temperature Reconstruction for Quasi-indoor Death Scenes
https://doi.org/10.1016/j.scijus.2024.12.004This study introduces two functional data analysis (FDA) models that reconstruct ambient temperature using only temperature readings while shortening the required measurement period. The concurrent regression model supports mid-term reconstruction, while a Fourier-based model enables accurate short-term estimation.
Both models were tested on data from six quasi-indoor environments, including heated and unheated buildings and underground conditions. The concurrent model achieved near-perfect accuracy after six days, and the Fourier model matched its performance after only four to five hours.
These FDA-based methods, compatible with existing forensic procedures, show that reliable PMI temperature correction can be achieved with minimal measurement time.
Competition, Cooperation, & Parental Effects in Larval Aggregations Formed on Carrion
We hypothesized that exploitative competition dominates larval behavior, while cooperative interactions arise from collective exodigestion. Using experimental colonies with manipulated parental effects (present/absent) and varying larval densities (0.02-1.9 larvae/g), we measured fitness components, development time, and thermogenesis.
Results showed a strong negative group-size effect on larval fitness in colonies with parental effects and in high-density groups without them. However, development time shortened and heat production increased with density, indicating partial benefits of aggregation. Positive fitness effects emerged only at low densities without parental influence.
These findings suggest that larval societies of N. littoralis are primarily shaped by exploitation competition, moderated by both density and parental effects.
Improving Accuracy of Age Estimates for Insect Evidence
To address this limitation, we developed a statistical formula that calibrates k for specific insect evidence without requiring explicit “k versus size” models. The approach integrates the general thermal summation constant (with standard error), published species size ranges, and measurements of individual insects collected at the death scene.
The method was validated using datasets of Creophilus maxillosus and Necrodes littoralis (Coleoptera). For atypically sized insects, the formula reduced mean errors by approximately 25 ADD in C. maxillosus and 40 ADD in N. littoralis.
This calibration approach enables more accurate PMI estimation, particularly for outlier specimens, and can be applied in forensic casework where species-specific size–development data are limited. It provides a practical, reproducible way to account for biological variation without requiring additional experimental datasets.
Likelihood Ratio to Evaluate Handwriting Evidence Using Similarity Index
The method focuses on the similarity between handwriting samples, quantified using the Jaccard index from the results of a typical forensic handwriting comparison. The numerator of the likelihood ratio is the probability of a given class of similarity, assuming that a given person wrote the questioned sample. The denominator is the probability of the same similarity class, assuming that a randomly selected person wrote the questioned sample. The similarity distribution to quantify the numerator is derived from comparisons across reference handwritings. To calculate the denominator, we proposed developing similarity distributions relevant to particular forensic scenarios. In the proof-of-concept study, we developed the distribution for the simulation scenario.
The Optimal Post-eclosion Interval While Estimating the Post-mortem Interval
This study simulated PMI estimation for empty puparia of Protophormia terraenovae and Stearibia nigriceps under varying PEIs (0-90 days) and temperature conditions.
PEI did not affect PMI for specimens collected in winter or early spring (December-April). From late spring to autumn (May-November), PMI increased with PEI, most strongly in summer and only slightly in autumn.
The shortest PMI always corresponded to a PEI of 0, confirming that the minimum PMI is obtained with this assumption. For spring samples, oviposition likely occurred the previous year, while in summer, this was indicated only for longer PEIs.
These results refine PMI estimation from empty puparia and identify the conditions under which PEI can be ignored or not.
Education
Master's Degree in Theoretical Mathematics
Adam Mickiewicz University in Poznań - Poznań, Poland
Bachelor's Degree in Mathematics and Data Analysis
Adam Mickiewicz University in Poznań - Poznań, Poland
Master's Degree in Law
Adam Mickiewicz University in Poznań - Poznań, Poland
Certifications
Agile Project Management
Google | via Coursera
Azure Fundamentals (AZ-900)
Microsoft
Data Scientist
DataCamp
Data Literacy
DataCamp
AI Fundamentals
DataCamp
Introduction to Machine Learning
Coursera (Duke University)
Bayesian Statistics
Coursera (University of California, Stanta Cruz)
Data Analyst Professional
DataCamp
R Programming
Coursera (John Hopkins University)
The Data Scientist's Toolbox
Coursera (John Hopkins University)
Skills
Languages
R, Python
Paradigms
Agile Project Management
Platforms
MacOS
Other
Research, Statistics, Machine Learning, Law, Data Analysis, Forensic Science, Mathematical Modeling, Methodology, Analytics, Data, Data Visualization, Data Science, AI Ethics, Legal Research, Critical Thinking, Bayesian Statistics, AI Law, Data Engineering, Simulations, Artificial Intelligence (AI), Generative Artificial Intelligence (GenAI), Cloud Computing
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring