James Arnemann
Verified Expert in Engineering
Statistics Developer
San Francisco, CA, United States
Toptal member since August 2, 2019
James is an experienced data scientist and machine learning engineer with several years of industry experience and publications in leading journals. He's held positions researching and deploying machine learning and deep learning models at UC Berkeley, Intel, National Laboratories, and others.
Portfolio
Experience
Availability
Preferred Environment
Python
The most amazing...
...thing I've implemented was a deep learning algorithm looking at simulated dark matter distributions to predict cosmological parameters that govern our universe.
Work Experience
Machine Learning Engineer
Twosense
- Created models for behavioral biometrics and continuous multi-factor authentication in the field of cybersecurity.
- Designed and implemented a typing signature machine learning model used in production.
- Built deep learning models using accelerometer, GPS, and other mobile data to create multiple behavioral biometrics for a Department of Defense contract.
Program Director of Research Science
New York-Presbyterian Hospital
- Built predictive models using historical data to predict the number of patients in the emergency departments at the different NYP hospitals.
- Cleaned and parsed millions of electronic health records and determined hospital-acquired VTE (Venous thromboembolism) rates and metrics of how it's addressed by different hospitals and departments.
- Developed analytics for oncology rates of the different departments and different cancer types throughout NYPs ambulatory care network.
- Taught Python programming and data analysis courses to over 50 NYP employees.
Deep Learning Research Scientist
National Energy Research Scientific Computing Center (NERSC)
- Implemented deep learning architectures on cosmology simulations to understand and predict the parameters that govern the evolution of the universe.
- Collaborated with a diverse team from Lawrence Berkeley National Lab, Intel, and Cray, to run these models at state-of-the-art performance on the world's eighth-largest supercomputer.
- Published in SC18 (The International Conference for High-Performance Computing, Networking, Storage, and Analysis).
Graduate Student Researcher
UC Berkeley
- Led multiple computational projects and developed novel algorithms in machine learning.
- Developed a novel exploration algorithm using Bayesian non-parametric statistical analysis and information theory (accepted to NIPS 2014).
- Trained an autoencoder neural network to learn temporal dynamics of cellular automata evolution.
- Classified hand-written digits with an unsupervised neural network algorithm using only local learning rules.
- Mentored research assistants to take on original research projects.
Data Science Intern (Artificial Intelligence Group)
Intel
- Implemented Neural Style Transfer with VGG-19 (Convolutional Neural Network).
- Reconstructed audio spectrograms from hidden layer activations of Deep Speech 2 (many-layered Bidirectional Recurrent Neural Network model for speech to text).
- Developed a novel approach for style transfer applied to audio signals.
Experience
Behavioral Biometric Phone App for Continuous MFA
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
https://sc18.supercomputing.org/proceedings/tech_paper/tech_paper_pages/pap429.htmlYou've Got Meal
Education
Ph.D. Degree in Physics
University of California Berkeley - Berkeley, CA
Master's Degree in Physics
University of California Berkeley - Berkeley, CA
Bachelor's Degree in Mathematics
University of Illinois Urbana - Champaign - Urbana, IL
Skills
Libraries/APIs
Scikit-learn, Pandas, NumPy, Matplotlib, TensorFlow, SciPy
Tools
MATLAB, Git, Jira, PyCharm
Languages
Python, SQL, C++
Platforms
Jupyter Notebook, Amazon Web Services (AWS), Linux, Unix
Paradigms
Agile Software Development, Agile
Other
Machine Learning, Data Science, Principal Component Analysis (PCA), Data, Linear Regression, Logistic Regression, Data Analytics, Data Analysis, Modeling, Data Modeling, Data Scraping, Web Scraping, Random Forests, Deep Learning, K-means Clustering, Statistics, Data Cleaning, Data Visualization, Naive Bayes, Convolutional Neural Networks (CNNs), Bayesian Statistics, Statistical Modeling, Predictive Analytics, Big Data, Support Vector Machines (SVM), Agile Data Science, Gradient Boosting, Artificial Intelligence (AI), Programming, Dashboards
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring