AI/Machine Learning Developer
2023 - PRESENTParagon Component Systems LLC- Optimized the truss webs using Bayesian optimization, having an unknown number of webs, and optimized the cost maintaining that the structure passed strength and structure requirements.
- Worked with different truss structures and found an optimum number of webs to lower cost.
- Designed a system to find a matching of previously designed truss structure with the structure that passed strength requirements.
Technologies: C#, Machine Learning, Artificial Intelligence (AI), Deep LearningMachine Learning Expert for Data Science POC
2021 - 2022Breadcrumb Data Limited- Developed time series anomaly detection using autoencoders for bearing and mills.
- Used a Controller Area Network (CAN bus) to build a vibration anomaly detection. The system uses the accelerometer acceleration data to estimate velocity and displacement.
- Estimated power spectrum to estimate the harmonics of vibration and use that to estimate the health of the equipment.
- Used TensorFlow to build this application and deployed it to Pi Zero and Pi 4.
Technologies: TensorFlow Deep Learning Library (TFLearn), Data Science, Machine Learning, Feature Engineering, Variational Autoencoders, TensorFlow, IOTA, Raspberry Pi, Autoencoders, CAN Bus, Data Analysis, Forecasting, Data Visualization, API Integration, Simulations, Inference, Fine-tuning, Recommendation Systems, Communication, Open Source, Pandas, Bash, Linux, CUDA, GPU Computing, NumPy, AI Design, Time Series, Neural Networks, Artificial Neural Networks (ANN), Hardware, Real-time Data, Scikit-optimize, MathematicsData Scientist and AI Consultant
2021 - 2022S Wave International Corp- Developed deep learning models to auto-tag music records using generated spectrogram images created from music records.
- Tracked and fixed bugs using Jira as a reporting tool.
- Deployed the production model in AWS Fargate with an elastic application load balancer to auto-scale the auto-tagging of music records depending on demand.
- Estimated the beat per minute using signal processing and deep learning. The project used the ViT model to classify spectrograms.
- Used PyTorch Lightning and DeepSpeed to train models using a large number of tracks.
Technologies: Data Science, Machine Learning, Data Engineering, AIOps, Amazon Web Services (AWS), Models, Data, Projects, Hang Tags, Transformer Models, Fast.ai, Scikit-learn, Time Series Analysis, Docker, Audio, Signal Processing, Data Visualization, API Integration, Inference, Fine-tuning, DeepSpeed, PyTorch Lightning, Communication, Open Source, Pandas, Bash, Linux, CUDA, GPU Computing, Project Management, NumPy, AI Design, Time Series, Amazon DynamoDB, Neural Networks, Artificial Neural Networks (ANN), Text Analytics, Scikit-optimize, MathematicsGIS Platform Architect/Engineer
2021 - 2021Birch Infrastructure, PBLLC- Replicated data from Velocity Suite to BigQuery, performed many ETL operations, and generated materialized views using DBT.
- Converted geospatial data from shapefiles into BigQuery spatial.
- Generated a materialized view of LMP prices with weather information, updated hourly.
- Created Prefect flows to create jobs for data scrapping and downloading from various sources into BigQuery.
- Created APIs to perform data downloads from various sources like FRED, LMP prices, and other spatial data sources.
Technologies: Python, Data Build Tool (dbt), Prefect, ArcGIS, GPS, Docker, Google Kubernetes Engine (GKE), Data Analysis, Statistics, Data Analytics, Data Visualization, API Integration, Analytics, Datasets, Inference, Fine-tuning, Google BigQuery, Communication, Open Source, Data Mining, ETL, Pandas, Bash, Data Reporting, Leadership, Architecture, NumPy, Time Series, Data Processing Automation, Text Analytics, Web Scraping, Mathematics, Mathematical Analysis, BigQueryDeveloper
2020 - 2021Toptal Client- Created an API for a chatbot to train medical students with virtual patients for the Meksi project in Sydney.
- Used pre-trained BERT models with examples sent from the client.
- Created an API to infer the intent from user inquiries.
- Built a chatbot using the intent classifier, Named Entity Recognition (NER).
- Merged the Bert Response with an IBM Watson assistant response based on confidence to identify the intent. The results were 97% accurate across 512 intents.
Technologies: Language Models, BERT, Python 3, PyTorch, Fast.ai, Scikit-learn, Natural Language Processing (NLP), GPT-2, GPT-J, Docker, Data Analysis, Statistics, Data Visualization, Causal Inference, Text Generation, Inference, Fine-tuning, Communication, Open Source, ETL, Machine Learning Operations (MLOps), Pandas, Bash, Linux, CUDA, GPU Computing, NumPy, AI Design, Neural Networks, Artificial Neural Networks (ANN), Chatbots, Sentiment Analysis, Chatbot Conversation DesignMachine Learning Consultant
2019 - 2021Oyu Tolgoi LLC- Converted seismic traces into the spectrogram to be used with image detection, a Resnet50 model.
- Built a multi-input model with the height of the event as an embedding input and spectrogram as the second input. The model and approach will be published in the scientific journal Bulletin of the Seismological Society of America.
- Deployed the model successfully. It is planned to go into production in 2020.
Technologies: Computer Vision, Deep Learning, Dlib, OpenCV, Qt, C++, PyTorch, TensorFlow, Keras, Python, Time Series Analysis, Google Kubernetes Engine (GKE), Audio, Data Visualization, Inference, Fine-tuning, Communication, Open Source, Azure Databricks, Pandas, Bash, Linux, CUDA, GPU Computing, NumPy, AI Design, Neural Networks, Artificial Neural Networks (ANN), Mathematics, Mathematical AnalysisMachine Learning Consultant
2015 - 2021Global Unmanned System- Developed algorithms to estimate above-ground biomass using point cloud data from drone images.
- Created object detection software for sealion detection in drone images.
- Classified images for Sandalwood detection in drone images.
- Developed various image analysis software for drone images using OpenCV.
- Created various GIS applications for satellite images and point cloud data.
- Developed a program to help ship berthing at Fremantle Port using slam algorithm and graph network.
- Gathered requirements and met with mine managers and refineries to learn their problems and find possible projects.
Technologies: Amazon Web Services (AWS), ArcPy, Point Clouds, Amazon SageMaker, 3D Image Processing, Image Processing, Predictive Analytics, Computer Vision, ArcGIS Runtime SDK for .NET, PyTorch, Data Science, GIS, MySQL, Agile Software Development, Deep Learning, Git, XGBoost, Keras, OpenCV, R, Python, Data Analysis, Object Detection, Video Processing, Statistics, Data Visualization, Inference, Fine-tuning, Algorithms, Open Source, Pandas, Bash, Data Reporting, Leadership, Real-time Vision Systems, Architecture, NumPy, AI Design, Neural Networks, Artificial Neural Networks (ANN), Real-time Data, Data Processing Automation, Mathematical AnalysisSenior Data Scientist
2018 - 2020Freelance Work- Developed refineries predictive maintenance using machine learning in Databricks, Azure Machine Learning service, and Azure Machine Learning Studio.
- Built time series prediction using Keras and PyTorch for anomaly detection.
- Built time series prediction with LSTM/CNN using multivariate one-minute sensors data.
- Developed a PowerBI dashboard for mining with Fleet Management System.
- Built a sound and vibration equipment health using a convolution neural network.
- Managed external contractors to evaluate cloud technology and perform a proof-of-concept solution to common anomaly detection in time series data and apply it to all pumps in the refinery.
Technologies: Transformer Models, BERT, Language Models, Reinforcement Learning, Uniformance Process History Database (PHD), OSI Model, Data Engineering, ArcPy, Point Clouds, RStan, Image Processing, Predictive Analytics, Kubernetes, Microsoft Power BI, ArcGIS Runtime SDK for .NET, PyTorch, Data Science, GIS, SQL, Agile Software Development, Deep Learning, Artificial Intelligence (AI), Statistical Learning, Databricks, Azure ML Studio, Azure Machine Learning, C++, R, Python, Scikit-learn, Time Series Analysis, Signal Processing, Forecasting, Statistics, Data Analytics, Data Visualization, Physics Simulations, Inference, Fine-tuning, Open Source, ETL, Azure Databricks, Machine Learning Operations (MLOps), XGBoost, Pandas, Bash, Data Reporting, CUDA, NumPy, Time Series, Neural Networks, Artificial Neural Networks (ANN), Data Processing Automation, Mathematics, Mathematical AnalysisSenior Data Scientist
2019 - 2019Western Power- Built energy demand time series prediction with multivariate half-hourly input data using LSTM/CNN neural networks.
- Created a production solution to use forecasted weather data to forecast demand deployed to AWS Fargate.
- Project-managed through Jira tickets and code stored in Bitbucket.
Technologies: ArcGIS Runtime SDK for .NET, Data Science, GIS, Agile Software Development, Artificial Intelligence (AI), TensorFlow, Keras, Python, ArcGIS, Time Series Analysis, Data Analysis, Sensor Fusion, Forecasting, Statistics, Data Visualization, Analytics, Physics Simulations, Inference, Fine-tuning, JavaScript, Open Source, Plotly, ETL, Pandas, Bash, Data Reporting, Linux, Leadership, CUDA, Time Series, Neural Networks, Artificial Neural Networks (ANN), Text Analytics, MathematicsSenior Spatial Engineer
2016 - 2018BHP- Helped a big mining company to take advantage of its spatial data.
- Created driver behavior analysis software for a mining operation.
- Worked with natural language processing with Keras.
- Developed various GIS projects using ArcPy and C# arcobjects.
- Created predictive models using machine learning and Apache Spark (data bricks).
- Completed a time series data analysis using Kalman filters for vehicle tracking.
- Mounted edge devices on diggers in an underground mine (Olympic Dam Mine) to classify the underground signs and determine if the bucket was full or empty.
- Built a data pipeline using Java to take data from the data logger through Kafka into the Hadoop cluster.
- Gathered requirements and met with mine managers and refineries to learn about their problems and find possible projects.
- Developed an R Shiny dashboard for mining, a hotspot of high-rack events.
Technologies: Amazon Web Services (AWS), Cloudera, Hortonworks Data Platform (HDP), OSI Model, Data Engineering, Point Clouds, RStan, ArcGIS GeoEvent Server, 3D Image Processing, Redshift, Spotfire, Microsoft Power BI, Hadoop, ArcGIS Runtime SDK for .NET, GIS, SQL, Agile Software Development, Statistical Learning, RStudio Shiny, Kibana, Elasticsearch, Oracle, Microsoft SQL Server, Git, Apache Kafka, C#, ArcPy, Keras, Python, Esri, Kalman Filtering, ArcGIS Server, ArcGIS, GPS, Docker, Data Analysis, Sensor Fusion, Video Processing, Data Analytics, Data Visualization, Analytics, JavaScript, Plotly, ETL, Spark, Big Data, XGBoost, Pandas, Bash, Data Reporting, Linux, Architecture, Neural Networks, Hardware, Data Processing Automation, Text Analytics, Mathematics, Data ModelingSenior Algorithm Engineer
2014 - 2016Fugro- Developed image analysis software for underwater object detection.
- Processed point cloud data using C++/Python for both underwater objects and above ground.
- Created image classification for a remote sensing LiDAR point cloud using Python running in AWS and Fugro Roames for Ergon.
- Created C++ numerical algorithms for echo sounder calculation using Armadillo C++, OpenBLAS, and algorithms including Kalman filter, LAZ smoothing, and classification.
- Created various Julia and R regression machine learning applications.
Technologies: Amazon Web Services (AWS), Data Engineering, Point Clouds, ArcGIS Runtime SDK for .NET, GIS, SQL, Artificial Intelligence (AI), Statistical Learning, MATLAB, Julia, Esri, Oracle, Microsoft SQL Server, Git, Python, C++, Kalman Filtering, ArcGIS, GPS, Sensor Fusion, Object Detection, Audio, Data Analytics, Data Visualization, Algorithms, JavaScript, XGBoost, Pandas, Bash, Architecture, Mathematics, Data ModelingSenior GIS Developer
2010 - 2014Department of Mines and Petroleum- Developed various GIS software to help surveyors in their work.
- Wrote classification and regression software for a GIS application.
- Developed GeoMap.WA, which is used to display the department's GIS products.
- Developed projects to do point cloud data analysis using ArcGIS software.
- Wrote various SQL server scripts to optimize retrieval of data.
Technologies: ArcPy, .NET, GIS, SQL, Microsoft SQL Server, Microsoft Team Foundation Server, TeamCity, Git, R, Python, GPS, Esri, C#, 3D Image Processing, ArcGIS, ArcSDE, ArcGIS Server, Data Visualization, JavaScript, Sentiment Analysis, Data Processing Automation, Mathematical Analysis, Data ModelingSoftware Engineer
2008 - 2010Western Power- Supported GIS software to show Western Power Assets in Western Australia.
- Created predictive models for wooden pole maintenance and inspection using R.
- Developed and helped in the establishment of the wooden pole serviceability index.
- Wrote classification software for pole top fire prediction and pole serviceability index.
- Wrote Oracle scripts to download data for Oracle reports and optimize database queries.
Technologies: SQL, Java, SourceTree, Oracle, C++, C#, R, Time Series Analysis, Kalman Filtering, Signal Processing, Forecasting, Data Visualization, JavaScriptSenior Software Engineer
2006 - 2008Comsec- Served as a senior software engineer and worked with various kinds of share trading software.
- Developed and designed NAB Equity Lending's online margin lending software.
- Supported various kinds of online trading software and managed funds.
- Created a SQL Server and Oracle database that shared procedures and database optimization.
- Participated in the design of NAB Equity Lending's migration to the Commsec Apollo project.
- Provided bug fixes and problem-solving for issues with trading software.
Technologies: T-SQL, SQL, COM+, ASP, SourceTree, Java, .NET, C++, Quantitative FinanceSenior Software Engineer
2000 - 2006ERG- Wrote Various C++ and Java applications for smart rider ticketing.
- Wrote transaction processing software in C++ to run under Solaris Unix and Windows operating systems.
- Supported existing software and bug fixes, code debugging, code executing speed (profiling), and peer review.
- Generated Oracle reports for Ventura bus travel times, loading, and peak time analysis.
- Optimized various PL and SQL queries for reports using Toad and Oracle query analysis.
Technologies: T-SQL, SQL, PostgreSQL, IBM Rational Rose, Case, C#, Oracle, C++, BashPostdoctoral Research Fellow
1999 - 2000Columbia University, New York- Conducted research on the Columbia Linear Machine (CLM) as a postdoctoral research fellow at Columbia University.
- Supervised PhD and honor students and assisted with lectures and labs.
- Worked with LabVIEW on their National Instrument (NI) products control experiment.
- Looked after the lab supplies and placed orders to maintain operations.
- Wrote signal processing software (filtering) for probe measurements to remove noise.
- Wrote scientific papers and published the results of experiments.
Technologies: NAG Numerical Library, MATLAB, IMSL Numerical Libraries, Fortran, C++, LabVIEW, Time Series Analysis, Signal Processing, Physics Simulations, SimulationsPostdoctoral Research Fellow
1996 - 1999Flinders University of South Australia- Supervised PhD students and honour students during their studies.
- Wrote numerical analysis software for signal processing.
- Helped with lectures and lab tutorials and demonstrations.
- Maintained the lab by ordering supplies and repairs.
- Wrote scientific papers and published results of experiments.
Technologies: NAG Numerical Library, MATLAB, Fortran, LabWindows/CVI, Windows, C++, Signal Processing, Simulations