Dmitri Ivanovich Arkhipov
Python Developer
Dmitri has a PhD degree in computer science from UC Irvine, and he's been involved in tech either as a student, freelancer, intern, or employee for over 15 years. Dmitri works primarily in Unix/Linux ecosystems—within which he has developed programs in Python, Java, Scala, C, C++, Perl, JavaScript, and several other languages. His most recent experience has been with Python and JavaScript, but he's willing to adapt.
Portfolio
Experience
Python - 12 yearsJava - 10 yearsUbuntu Linux - 8 yearsEclipse IDE - 6 yearsVim Text Editor - 5 yearsPandas - 4 yearsGit - 3 yearsJavaScript - 2 yearsAvailability
Preferred Environment
Ubuntu Linux, Apache Maven, Git, Eclipse, Vim Text Editor, CentOS, Red Hat Linux, Ubuntu
The most amazing...
...project I've worked on was my dissertation on optimally choosing ad request orders to maximize expected return given a time threshold (bounded sub-optimal).
Work Experience
Full-stack Software Engineer | Solutions Engineer Executive Advisor
Anthem
- Developed, tested, and integrated back-end API components for TeleHealth OS, an Anthem project.
- Developed code to connect in and outbound phone calls to online video chat rooms.
- Expanded back-end data model, extended back-end functionality applied DB migrations.
- Integrated ICS calendar file event tracking interoperable with all major mail/calendar software.
- Implemented real-time event notifications via WebSockets and server-side events (HTTP2).
Senior Optimization Scientist
Ticketmaster, Inc.
- Developed and deployed machine learning models for the PriceMaster dynamic pricing optimization engine. The engine returns on-demand dynamic ticket price recommendations for live event ticket pricing.
- Contributed to a B2B application used by event and venue managers.
- Built and utilized ETL data pipeline, a model built, result evaluation, and storage pipeline.
Research Engineer
ChromaCode, Inc.
- Designed, developed, tested, validated, and deployed classification and regression algorithms for QPCR and DDPCR medical diagnostic tests.
- Developed, packaged, versioned, continuously integrated, and released internal- and external-facing research and publication tools.
- Wrote code from scratch and transformed it into cohesive, verifiably tested, and carefully documented, modules packaged and stored in a private repository.
- Guided their deployment and release to tested Docker container constellations executed in AWS ECS.
- Debugged and expanded features of existing modules; just as in the case of new modules, I took the necessary changes through all the phases described above in my work with existing modules.
- Developed the front end in React and Node.js to display the machine learning data to the user.
- Tested the features in Cucumber and implemented unit tests in Mocha.
- Provisioned (using Ansible) a malware scanning-and-alerting system for file uploads to Amazon S3; file upload events trigger a malware AWS Lambda execution and an AWS SNS notification in the case of virus detection.
Data Engineer
Formation, Inc.
- Developed periodic database unloads, and ETL transforms for a client's data science ingestion. The software launched multiterabyte-sized data dumps daily to unload data from Amazon Redshift to S3.
- Implemented PySpark and ran it on AWS EMR and Hive and HDFS (running on AWS S3) to perform complex ETL queries on data in Redshift.
- Packaged ETL code into Docker containers stored in AWS ECR, executed in ECS, and fired by Cron events. Docker containers were orchestrated with Docker Compose.
- Enabled continuous integration (CI) support with CircleCI.
- Ensured that the ETL delivery times didn't take more than 1.5 hours consistently; previously, it took too long due to forced unloads.
Postdoctoral Researcher
Donald Bren School of Information and Computer Sciences | University of California, Irvine
- Involved with the ongoing research of combinatorial optimization in online advertising.
- Created novel concurrency and interprocess synchronization methods for parallel and distributed computing.
- Developed batch-size selections for stochastic gradient descent.
- Designed spot market clearing mechanisms for transportation spot markets.
- Built an agent-based framework for general distributed computation.
Graduate Student Researcher | Teaching Assistant
Donald Bren School of Information and Computer Sciences | University of California, Irvine
- Worked on several optimization and parallel execution-themed papers.
- Participated in several smart city projects with researchers in China and the UK.
- Developed a mathematical model and analysis of sequential ad polling. Implemented an algorithm using the model that guaranteed bounded sub-optimality (with respect to the expected value given a time budget).
- Assisted in introductory computer programming and discrete mathematics courses. In particular, I have worked on introductory and intermediate Java, C, and Python courses.
- Performed significant research on thread and process synchronization and low-level inter-thread message passing constructs.
- Wrote conference papers, journal papers, and a dissertation with research findings.
Advanced Engineering Intern
CalAmp Wireless, Inc.
- Designed, planned, implemented, and tested a new release of the company’s (M2M/MRM) cloud infrastructure back end.
- Ensured that the release was fault-tolerant, recoverable, scalable through integration with Amazon Web Services (AWS) and that the message triggered execution was dynamically reconfigurable.
- Implemented and A/B tested the AWS Kinesis and AWS Simple Queue Service (SQS) based solutions to persist and load-balance incoming traffic.
- Worked on a project that was a dynamic message router implemented for direct integration into a Spring-integration environment.
- Developed the router component, a meta-construction allowing each Spring-integration component to determine the next procedural step to execute on the current step's output and then routing it.
Software Engineering Intern
Adaptive Medias, Inc.
- Developed a heuristic optimization solution for the ad-ordering problem written in Java. The solution was deployed into production for determining advertising waterfalls for users.
- Wrote a RESTful web application in Python Flask to accept web-domain URLs and classify (using MALLET) the semantic content into IAB tier 1 categories. This was deployed into production as a web service and was at the heart of the main aim of a sprint.
- Worked in Scala with the Cloudera package of Hadoop to write Spark code to perform log aggregation. Terabytes of event logs relating to customer interactions were condensed into lifecycle objects and written to disk.
- Ensured that each of these three projects was released into production as web microservices and that each was developed as a Git branch and successfully incorporated into the development process and subsequently operating branch.
- Developed and compared with a team alternative mathematical models for classifications—as part of the classification work.
Web Application Developer
Intellisurvey, Inc.
- Designed and implemented new features and enhancements to the Intellisurvey software (in Perl and mod_perl).
- Found and resolved software defects—debugging and feature creation through an in-house ticketing system.
- Assisted in the development of the Intellisurvey infrastructure.
- Created tools to aid in development, testing, and systems administration.
- Provided technical support to internal software users and clients who use licensed Intellisurvey software tools.
- Added UI features in JavaScript to components of the web system's front end.
Research Assistant
University of California, Irvine
- Installed, configured, and populated a PostgreSQL database with heterogeneous data sources; also developed a JSP front end.
- Linked data sources on the basis of common fields and meta information.
- Optimized MATLAB algorithms for dynamic network flow optimization.
- Wrote C functions called from the MEX interface of MATLAB.
- Wrote a highly optimized Djikstras shortest-path algorithm in C and bound it in MATLAB—it was over 100x faster than the MATLAB equivalent.
Experience
Spring Dynamic Message Processor
https://github.com/darkhipo/SpringDynamicMessageProcessorThe Spring Integration is a framework for developing message-passing systems. One component not part of the framework is a dynamic router, next hop, and a current function—this project implements that component.
Evaluating the Value of Social Media Sentiment
https://github.com/darkhipo/unclean_sentimentsI made sure that the data was conveniently accessible for evaluation. The data is cleaned, ingested into SQLite using Pandas, and is made available via a CLI interface.
Buffered Sort in Kinesis
https://github.com/darkhipo/kinesis-two-phase-sortKineses are used to create data-processing applications, known as Kinesis Data Streams applications. A typical Kinesis Data Streams application reads data from a data stream as data records.
Python Flask Document Categorizer Microservice
I developed and deployed a RESTFul microservice using Flask to train and update classifiers learning from web data and classify any document into a normalized vector representing the theme breakdown of the document into IAB categories. This service used Flask as the REST service layer, the multiprocessing module for process control, MALLET software (Mallet.cs.umass.edu/) for classification, beautiful soup for data preprocessing, and Nutch/SOLR for crawling and indexing web data.
Ad Ordering with Python
I've developed a suite of algorithms for solving this problem under different assumptions in my work. A version of this work deployed in production use data aggregation over terabytes of log data to build distributions representing how much time a solicitation will take. The deployment used Flask as a RESTful microservice layer, Python, and SciPy for the main algorithms. More advanced research solutions make use of real-time dynamic programming to achieve solutions with bounded suboptimality.
Remote Object SDN with Python
http://ieeexplore.ieee.org/abstract/document/7218384To our knowledge, we are the first group to build an SDN over the remote object paradigm. With this approach, the client requests a connection rather than an address from the gateway. The link returned is a remote object now shared by both the client and the network used for reads and writes by the client and reconfigured by the network administration as network conditions change.

Maximum Flow and the Linear Assignment Problem
Skills
Languages
Python, Bash, Python 2, Java, Bash Script, Java 7, Python 3, PHP, Pascal, Scheme, CSS, HTML, JavaScript, Perl, C#, C++, C, Scala, Java 8, Assembler x86, VHDL, Tcl, Tcl/Tk
Frameworks
Spring Integration, Spring, Spring Boot, Spark, Scrapy, Hadoop, Jakarta Server Pages (JSP), Apache Spark, Spring JDBC, JUnit, Flask, Django
Libraries/APIs
MALLET, PySpark, Node.js, React, Flask-RESTful, Mod_perl, Apache Lucene, Facebook Open Graph API, JDBC, OpenGL, Pandas, Scikit-learn, Matplotlib
Tools
Apache Maven, GitLab CI/CD, Amazon SageMaker, StatsModels, Domo, MATLAB, Apache Solr, LaTeX, Apache Tomcat, Vim Text Editor, Eclipse IDE, Amazon Simple Queue Service (SQS), Cloudera, ModelSim, PyDev, Scala IDE, Git, PyPI, Docker Compose, Ansible, NGINX
Paradigms
Distributed Computing, Distributed Programming, Parallel Computing, Constraint Programming, Compiler Design, Software-defined Networking (SDN), Linear Programming, Dynamic Programming, Concurrent Programming, Event-driven Programming, Functional Programming, REST
Platforms
Eclipse, Amazon EC2, Nexus, Amazon Web Services (AWS), AWS Lambda, Hortonworks Data Platform (HDP), Red Hat Linux, Ubuntu, Linux, CentOS, Unix, Android, Ubuntu Linux, Docker, Kubernetes
Storage
Databases, Cassandra, MongoDB, Amazon S3 (AWS S3), Spring Data JPA, PostgreSQL, MySQL, Apache Hive
Other
RESTful Web Services, EMR, ECS, Conda, Data Structures, Algorithms, Distributed Systems, Machine Learning, Evolutionary Algorithms, Genetic Algorithms, Operating Systems, Interpreter Design, Computer Graphics, Computer Science, Mixed-integer Linear Programming, Convex Optimization, Optimization, Combinatorics, Mathematical Programming, Transportation & Shipping, Networks, Machine-to-Machine (M2M), Apache Commons, Metaheuristics, Artificial Intelligence (AI), Binary Search Trees, Decision Trees, Mathematical Modeling, Apache Cassandra, Research, Amazon Kinesis, Eclipse CDT, TkInter, PIP, Combinatorial Optimization, Clustering
Education
PhD Degree in Computer Science (Computational Models for Scheduling in Online Advertising)
University of California, Irvine - Irvine, CA, USA
Master's Degree in Computer Science
University of California, Irvine - Irvine, CA, USA
Bachelor's Degree in Information and Computer Science: Specializations in Computer Systems, Distributed Systems (Minor: Mathematics)
University of California, Irvine - Irvine, CA, USA