Pradeep Nalabalapu, Developer in Austin, TX, United States
Pradeep is available for hire
Hire Pradeep

Pradeep Nalabalapu

Verified Expert  in Engineering

Software Developer

Location
Austin, TX, United States
Toptal Member Since
July 22, 2019

Pradeep is a software engineer with experience in data engineering and machine learning. He has recently worked on machine learning and ETL using Python PySpark on platforms like Cloudera, Databricks, Azure, and AWS. Pradeep also has experience programming using C, C++, Java, Scala, and JavaScript and has several years of semiconductor industry experience.

Portfolio

DeepIQ
Cloudera, Scikit-learn, TensorFlow, PySpark
MapR Technologies
Java, Docker, TensorFlow, Spark, Python
Self-employed
PostgreSQL, Python, C++

Experience

Availability

Full-time

Preferred Environment

C++, C, Python, MacOS, Linux

The most amazing...

...thing I've done was to train a neural network to detect a trigger word in audio samples.

Work Experience

Lead Data Scientist

2019 - PRESENT
DeepIQ
  • Developed the back end of the in-house ETL software, using PySpark.
  • Constructed ML models for predicting oil resource density based on location.
  • Created an ETL solution that runs on AWS Glue for a retail client to process transaction data; also imported custom PySpark scripts into Glue.
Technologies: Cloudera, Scikit-learn, TensorFlow, PySpark

Senior Machine Learning Engineer

2018 - PRESENT
MapR Technologies
  • Worked on an ETL solution for a retail client (Chico's).
  • Developed with Java and used MapR Streams and MapR-DB.
  • Built a machine learning deployment demo; worked on both the React front end and Scala/Spark web-server back end.
  • Constructed a generic item-similarity based recommendation engine; used the Spark engine with Scala.
  • Created a proof of concept (POC) for scaling a distributed app using Docker and Kubernetes.
Technologies: Java, Docker, TensorFlow, Spark, Python

Software Developer

2017 - 2017
Self-employed
  • Developed C++ software to capture data from a bedside patient monitor (Philips MP70).
Technologies: PostgreSQL, Python, C++

Consultant

2017 - 2017
Clarity Insights
  • Primarily worked on ETL for a new data warehousing solution being developed on AWS.
  • Developed PySpark programs to clean up and extract columns from existing in-house data sources.
  • Assisted the data architect in refining the schema and defining transformations from existing data columns.
Technologies: Redshift, Spark, Python

Data Scientist

2014 - 2016
Clemetric
  • Extract aggregates and statistics on huge amounts of insurance claims data (Python and Apache Spark).
  • Developed machine learning models for health insurance claims data (Python Scikit-learn, NumPy, and Pandas).
  • Performed ETL on health insurance claims data, using Python and PostgreSQL.
  • Developed SQL stored procedures.
  • Built-up the back end for a web app that served a data stream.
  • Developed an API using Node.js and Python.
  • Created a simple front end and charts to display patient vitals (AngularJS, Highcharts, and D3.js).
  • Developed C/C++ implementations of medical data analysis algorithms (MATLAB).
Technologies: AngularJS, Node.js, PostgreSQL, C++, C, Spark, Pandas, NumPy, Scikit-learn, Python

Staff Verification Engineer

2012 - 2014
Qualcomm, Inc.
  • Worked primarily as a hardware verification engineer on the development of test benches and test suite for verifying hardware blocks in a video decoder.
  • Built an interface between the C++ based system model and OVM-based test bench.
Technologies: C++, Verilog

Member of Technical Staff

2004 - 2012
Ambarella Corporation
  • Worked on various stages of hardware verification at Ambarella.
  • Verified the memory subsystem.
  • Developed the DRAM controller programming portion of BIOS and DRAM controller bring-up in the lab.
  • Built some C++ system models that were part of the software model of the chip.
Technologies: ASM, C++, SystemVerilog

My GitHub Page

https://github.com/pnalaba
I have worked on various proof-of-concept projects and have saved the code in my GitHub repositories.

Languages

Python, C++, Java, Verilog, SystemVerilog, C, Java 8, JavaScript

Libraries/APIs

PySpark, Spark ML, NumPy, Pandas, Scikit-learn, TensorFlow, PyTorch, Keras, Node.js, React

Other

Data Engineering, Machine Learning, Software Development, Back-end Development, Web Development

Frameworks

AngularJS, ASM, Spark

Tools

LaTeX, Cloudera, Amazon Elastic MapReduce (EMR)

Platforms

MacOS, Docker, Linux, MapR, Google Cloud Platform (GCP), Amazon Web Services (AWS)

Storage

Redshift, PostgreSQL, Amazon S3 (AWS S3), MapR-DB

2000 - 2003

Master of Science (MSc) Degree in Computer Engineering

Clemson University - Clemson, SC, USA

1996 - 2000

Bachelor of Technology Degree in Electrical Engineering

Indian Institute of Technology Madras - Chennai, India

MAY 2019 - PRESENT

End-to-end Machine Learning with TensorFlow on GCP

Coursera

MAY 2019 - PRESENT

Production Machine Learning Systems

Coursera

AUGUST 2018 - PRESENT

Machine Learning with TensorFlow on the Google Cloud Platform Specialization

Coursera

APRIL 2018 - PRESENT

Sequence Models

Coursera

APRIL 2018 - PRESENT

Convolutional Neural Networks

Coursera

APRIL 2018 - PRESENT

Deep Learning Specialization

Coursera

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring