Hershel Safer, Developer in Rehovot, Israel
Hershel is available for hire
Hire Hershel

Hershel Safer

Verified Expert  in Engineering

Software Developer

Location
Rehovot, Israel
Toptal Member Since
January 20, 2021

Hershel has extensive experience developing quantitative models in various industries, including finance, biopharma, and communications. His focus is the extraction of data-based insights to inform decision making. For the past decade, Hershel has worked primarily with machine learning; he also has expertise in optimization (linear, nonlinear, combinatorial, and graph). Hershel has a Ph.D. from MIT in operations research and an undergraduate degree from Yale in applied mathematics and economics.

Portfolio

Safer Analytics Ltd.
Feature Engineering, Anomaly Detection, Data Science, SQL Functions...
WorldQuant
Data Science, Machine Learning, Algorithms, Nonlinear Optimization...
Tel Aviv University
Graph Theory, Genomics, Linux, Combinatorial Optimization, Computer Science...

Experience

Availability

Part-time

Preferred Environment

Jupyter Notebook, Scikit-learn, C++, R, Python, MacOS, Linux

The most amazing...

...tool I've developed was a trading model that combined machine learning and non-linear optimization, and automatically adapted to changing market conditions.

Work Experience

Founder | Machine Learning Consultant

2015 - PRESENT
Safer Analytics Ltd.
  • Developed machine-learning models of credit risk and other issues that arise in business financing (merchant cash advances and invoice factoring).
  • Built models for daily and high-frequency trading of equity and non-equity instruments, as well as tools for automatically creating models and ensuring that they work robustly.
  • Presented a 2-day course in London for credit-risk professionals about using machine learning to increase their effectiveness.
  • Mentored aspiring data scientists to enhance their ability to create tools for quantitative analysis.
  • Consulted on techniques and tools of mathematical optimization to use for solving problems of product allocation and biomedical research.
Technologies: Feature Engineering, Anomaly Detection, Data Science, SQL Functions, Google Cloud Platform (GCP), SQL, Google BigQuery, BigQuery, Algorithms, Credit Risk, Python, Statistics, Machine Learning, Artificial Intelligence (AI), Ggplot2, Matplotlib, NumPy, Pandas, Jupyter, Jupyter Notebook

Senior Quantitative Researcher

2012 - 2014
WorldQuant
  • Developed and implemented computational models for statistical arbitrage on a variety of worldwide financial markets. Used a variety of techniques, including tools from machine learning and nonlinear optimization.
  • Created a computerized system to automatically generate new trading models.
  • Initiated an online system in the local office to facilitate information sharing among researchers.
Technologies: Data Science, Machine Learning, Algorithms, Nonlinear Optimization, Computer Science, Linux, Python, C++, Algorithmic Trading

Senior Research Fellow in Computer Science and Bioinformatics

2010 - 2012
Tel Aviv University
  • Participated in research on computational methods for discovering cellular components that are involved in causing diseases. Used techniques from machine learning, combinatorial optimization, and graph theory.
  • Wrote proposals for new research grants and progress reports for existing grants.
  • Managed laboratory infrastructure that included various kinds of computing resources.
Technologies: Graph Theory, Genomics, Linux, Combinatorial Optimization, Computer Science, Algorithms, Perl, Data Science, R, Machine Learning, Computational Biology, Bioinformatics

Head of Informatics

2009 - 2009
Dynamix Pharmaceuticals
  • Participated in the design, implementation, and population of chemical-compound database for a new drug-discovery company.
  • Managed the specifications and purchases of computers and storage.
  • Oversaw the purchases and installations of software for computational chemistry.
Technologies: Drug Development, Algorithms, Linux, IT, Chemistry

Head of the Bioinformatics Core Facility

2003 - 2009
Weizmann Institute of Science
  • Managed the core facility for data analysis in life-sciences research. Provided consulting, training, and computing resources for scientists at the Weizmann Institute and throughout Israel.
  • Expanded the unit’s scope to serve a wider range of laboratories and added support for new research areas: next-generation DNA sequencing and protein structure.
  • Managed the purchases and installations of computers and software for next-generation sequencing.
  • Chaired the Fifth European Conference on Computational Biology (ECCB), the second-largest annual international conference in the field. The conference was a great success—400 delegates from 35 countries, and a surplus from our $290,000 budget.
Technologies: Linux, Genomics, Algorithms, IT, Perl, Computational Biology, Bioinformatics

Head of Informatics

2002 - 2003
Zetiq Technologies
  • Managed and performed cheminformatics analyses for drug discovery.
  • Designed and implemented software and database to gather, qualify, and store results from high-throughput screening assays.
  • Developed analyses to identify interesting compounds for drug discovery.
Technologies: Drug Development, SQL Functions, IT, Chemistry, SQL, Microsoft Access

Senior Scientist

1999 - 2001
Compugen
  • Developed and implemented algorithms for designing genomics microarrays to profile the expression of splice variants.
  • Led a group that developed and implemented algorithms for identifying proteins via mass spectrometry.
  • Served as a project manager for the company’s largest customer.
  • Wrote scientific articles and coordinated weekly seminar series.
Technologies: Drug Development, Algorithms, Perl, Genomics, Computational Biology, Bioinformatics

Associate Director of Bioinformatics

1993 - 1998
Genome Therapeutics Corp.
  • Developed a type of software to analyze genomic data for commercial clients and the Human Genome Project.
  • Wrote the computational portions of two successful NIH grant applications.
  • Helped build and led a computing group of ~30 people; managed the hiring and resolved personnel issues.
Technologies: Drug Development, DNA Sequencing, Perl, Genomics, Computational Biology, Bioinformatics

Weight of Evidence Target Encoding

https://github.com/hsafer/weight_of_evidence
Weight of evidence (WOE) is used to assess the relative risk of the values taken by a categorical feature. For example, suppose that the goal is to predict the probability of default on loan. One feature might be homeownership, with values "own outright," "own with a mortgage," "rent," "live with parents," and "other."

With unordered, non-numeric values, a feature like this cannot be used directly with most machine-learning methods. WOE addresses this challenge by converting each value, say "own with a mortgage," to the relative risk of an applicant in the group with that value paying off the loan. Specifically, WOE is the log odds of the categorical value being associated with a negative target value.

When the WOE of a categorical value is positive, the probability of the loan being paid in full is above average among all applicants, and vice versa when WOE is negative.

WOE has advantages over the one-hot encoding that is commonly used for categorical features. WOE avoids the proliferation of features when one-hot encoding is used with categorical features that take many values. WOE also does not lose information the way that one-hot encoding does when tree-based algorithms select a subset of features at each node.

Paradigms

Data Science, Linear Programming, Anomaly Detection

Other

Machine Learning, Artificial Intelligence (AI), Operations Research, Combinatorial Optimization, Graph Theory, Statistics, Network Optimization, Algorithms, Computer Science, Statistical Analysis, Statistical Modeling, Nonlinear Optimization, Economics, Credit Risk, Algorithmic Trading, Computational Biology, Chemistry, IT, Genomics, DNA Sequencing, Feature Engineering, Google BigQuery, Drug Development, OR-Tools, Fintech

Languages

Python, R, C++, Perl, SQL

Libraries/APIs

Scikit-learn, Matplotlib, NumPy, Pandas, Ggplot2

Tools

Jupyter, Microsoft Access, BigQuery

Platforms

Linux, MacOS, Jupyter Notebook, Google Cloud Platform (GCP)

Storage

SQL Functions

Industry Expertise

Bioinformatics

1984 - 1992

Ph.D. in Operations Research

MIT | Massachusetts Institute of Technology - Cambridge, MA, United States

1981 - 1984

Master's Degree in Operations Research

Columbia University - New York, NY, United States

1977 - 1981

Bachelor's Degree in Applied Mathematics and Economics (Double Major)

Yale University - New Haven, CT, United States

OCTOBER 2020 - PRESENT

NVIDIA DLI Certificate – Applications of AI for Anomaly Detection

NVIDIA

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring