Arman A Anwar, Developer in Fairfax, VA, United States
Arman is available for hire
Hire Arman

Arman A Anwar

Verified Expert  in Engineering

Bio

Arman is an AI systems expert with a proven track record of building enterprise-grade intelligent solutions that drive business outcomes. Organizations like DARPA and Samsung have trusted him to solve complex problems and deliver real-world impact. Arman is passionate about next-gen AI, especially systems that demand multi-modal reasoning, precise inference, and knowledge-based thinking.

Portfolio

Skyfire AI
Python 3, OpenAI, Chatbots, ChromaDB, Scikit-learn, SPARQL, PostgreSQL...
Wind River Systems
Python 3, ChromaDB, Milvus, PostgreSQL, Docker, Llama 2, OpenAI, Streamlit...
George Mason University
Natural Language Processing (NLP), PyTorch, Natural Language Toolkit (NLTK)...

Experience

  • Python 3 - 14 years
  • Artificial Intelligence (AI) - 14 years
  • Machine Learning - 14 years
  • Natural Language Processing (NLP) - 11 years
  • Scikit-learn - 9 years
  • Large Language Models (LLMs) - 4 years
  • Synthetic Data Generation - 4 years
  • Retrieval-augmented Generation (RAG) - 2 years

Availability

Full-time

Preferred Environment

Python 3, LangChain, LangGraph, OpenAI, PyTorch, Artificial Intelligence (AI), Transformers, Retrieval-augmented Generation (RAG), Natural Language Processing (NLP), AI Chatbots, AI Art Generator

The most amazing...

...multi-agent system I've built leveraged OpenAI, ChromaDB, and LangChain to automate drone flight planning with geographic reasoning and mission optimization.

Work Experience

AI Engineer (Contract)

2024 - PRESENT
Skyfire AI
  • Architected a hybrid multi-agent system for unmanned aerial vehicle (UAV) flight planning via natural language input. Used OpenAI and LangChain for intent parsing to auto-generate mission profiles like surveys, structure scans, and deliveries.
  • Implemented a slot-filling tool to extract mission parameters from user input, ensuring structured, complete, and accurate data capture for autonomous UAV task generation and flight planning.
  • Integrated retrieval-augmented generation (RAG) to retrieve historical UAV missions, enabling context-aware planning and reuse for similar future operations, improving mission accuracy and reducing planning time.
  • Prototyped a constraint-solving tool-calling agent using SciPy to optimize UAV mission allocation, boosting resource efficiency and enabling parallel execution.
  • Introduced LLM hallucination mitigation by simulating and validating generated flight plans with bounds checking, ensuring safety and reliability in autonomous UAV operations.
Technologies: Python 3, OpenAI, Chatbots, ChromaDB, Scikit-learn, SPARQL, PostgreSQL, LangChain, LangGraph, Scikit-optimize, Pandas, NumPy, Transformers, Retrieval-augmented Generation (RAG), Natural Language Processing (NLP), Data Synthesis, Hugging Face Transformers, PyTorch, AI Chatbots, Amazon EC2, AWS Lambda, Amazon Elastic Container Service (ECS), Docker, Amazon S3 (AWS S3), Docker Compose, OpenAI API, AI Art Generator, SQLite, Text to Speech (TTS), SQL, Whisper, Fine-tuning, Unsupervised Learning, Reinforcement Learning from Human Feedback (RLHF), Supervised Learning, Speech to Text, Deep Learning, Minimum Viable Product (MVP), Full-stack, Amazon Web Services (AWS), Prompt Engineering, ChatGPT API

Lead Data Scientist

2021 - 2023
Wind River Systems
  • Built a tech-docs chatbot using LangChain, Milvus, Llama 2, and OpenAI with a Streamlit UI, enabling conversational search over complex enterprise documentation with insight extraction.
  • Created a RAG-based chatbot using LangChain, ChromaDB, and Jira data dumps with a Streamlit UI, enabling natural language queries for situational awareness and detecting story standard violations to support Agile workflow analysis and improvement.
  • Developed a RAG-based chatbot using LangChain and ChromaDB to help developers convert product specs into epics and stories set by the Work Breakdown Structure (WBS) guidelines.
  • Led a team of six colleagues to deliver an AI edge software development kit, prototyping multi-modal data support (image, video, telemetry) and enabling cloud-connected model updates and rapid edge AI development.
  • Designed concept demos revealing LLMs' limitations in planning, occlusion reasoning, and basic cognitive tasks, highlighting gaps in spatial awareness and causal inference for embodied AI applications.
Technologies: Python 3, ChromaDB, Milvus, PostgreSQL, Docker, Llama 2, OpenAI, Streamlit, Apache Airflow, SpaCy, Hugging Face Transformers, Datasets, Pandas, NumPy, JQL, Gensim, Scikit-learn, Artificial Intelligence (AI), Clustering, Anomaly Detection, LangChain, PyTorch, AI Chatbots, Natural Language Processing (NLP), Retrieval-augmented Generation (RAG), Transformers, Amazon EC2, Amazon S3 (AWS S3), AWS IoT, MQTT, Docker Compose, OpenAI API, AI Art Generator, SQLite, SQL, Statistical Modeling, Predictive Analytics, Fine-tuning, Unsupervised Learning, Reinforcement Learning from Human Feedback (RLHF), Supervised Learning, Deep Learning, Minimum Viable Product (MVP), Full-stack, Amazon Web Services (AWS), Azure, Prompt Engineering, ChatGPT API

Affiliate Faculty in the Information Sciences and Technology Department

2021 - 2022
George Mason University
  • Enhanced researcher NLP pipelines for the Social Media Mining for Health workshop, improving performance, scalability, and modularity using real-world engineering practices.
  • Collaborated with George Mason University’s NLP research group to develop a custom language annotation platform for generating high-quality training data for machine learning models.
  • Contributed to a commonsense geographic QA system using classical NLP and BERT-based models for spatial and comparative reasoning.
  • Built components for query expansion, sub-query decomposition, retrieval, answer ranking, and explanation generation.
Technologies: Natural Language Processing (NLP), PyTorch, Natural Language Toolkit (NLTK), SpaCy, Pandas, BERT, Python 3, NumPy, Scikit-learn, Stanford CoreNLP, Sentiment Analysis, Named-entity Recognition (NER), Link Analysis, Clustering, Artificial Intelligence (AI), DBpedia, Wikipedia API, AI Chatbots, Stanford NLP, Docker, Docker Compose, SQLite, SQL, Statistical Modeling, Fine-tuning, Unsupervised Learning, Reinforcement Learning from Human Feedback (RLHF), Supervised Learning, Deep Learning, Full-stack, Amazon Web Services (AWS)

Director of Data Science Innovation (Contract)

2018 - 2021
Grant Thornton
  • Led development of GTConfirm, enabling top private equity clients to submit financial data for compliance. Delivered the beta in 90 days through the internal incubator, accelerating the go-to-market for regulatory workflows.
  • Leveraged principal component analysis (PCA) and clustering to uncover investment archetypes, enabling expert-driven anomaly detection in cold-start scenarios with no historical baseline.
  • Built a CFO's cockpit in Plotly to surface real-time insights and anomaly statuses, adapting dynamically to auto/manual classifications and overrides.
Technologies: Information Extraction, Named-entity Recognition (NER), Apache Airflow, Data Science, Plotly.js, Data Visualization, Anomaly Detection, Clustering, Python 3, Scikit-learn, Pandas, NumPy, Natural Language Toolkit (NLTK), SpaCy, Natural Language Processing (NLP), SQLite, SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Minimum Viable Product (MVP), Full-stack, Leadership, Amazon Web Services (AWS), Azure, Fintech

Lead Data Scientist (Contract)

2016 - 2018
Xululabs
  • Led a data science team to launch a quantitative market tech demo in 120 days, with iterative releases every 90 days.
  • Co-created an Agile audience modeling platform for over nine million US profiles across 65,000+ features, partnering closely with the launch customer.
  • Architected lookalike and purchase propensity models using diverse data sources and a stack including Python, R, Spark, Apache Airflow, and AWS.
  • Cut model generation time from over four hours to under five minutes through performance tuning and optimization.
Technologies: Python 3, R, Spark, Spark ML, Apache Airflow, Amazon EC2, Amazon S3 (AWS S3), SQLite, SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Deep Learning, Minimum Viable Product (MVP), Full-stack, Amazon Web Services (AWS), Fintech

Vice President – Data Science

2015 - 2016
Time
  • Architected ASPEN, a Python/R/Spark-based act-alike modeling platform, replacing a legacy SAS system and boosting digital ad revenue by 4% across five major media brands, including Time, Fortune, and People.
  • Led development of MoM, a R/Spark/GraphX-based tool that helped Time editors identify emerging topics before they trended, leveraging NLP-driven tweet clustering, co-occurrence graphs, and inter-topic PageRank to infer narrative momentum.
  • Collaborated with Time's investment and M&A groups, delivering due diligence on artificial intelligence and machine learning investment and acquisition targets.
Technologies: R, Spark, Spark ML, Natural Language Toolkit (NLTK), Gensim, gensim, Python 3, MySQL, NetworkX, GraphX, MLlib, Amazon EC2, Amazon S3 (AWS S3), Docker, Natural Language Processing (NLP), SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Deep Learning, Minimum Viable Product (MVP), Full-stack, Leadership, Amazon Web Services (AWS)

Director of Strategic Analytics

2014 - 2015
Samsung
  • Led the development of marketing data science capabilities on an AWS-based data lake, delivering insights on LTV, segmentation, act-alikes, churn, and basket analysis using Python, R/Spark, Apache Hive, and Amazon S3 (AWS S3).
  • Rescued a stalled music recommender MVP by designing a cold-start strategy, leveraging NLP-based feature engineering on song metadata and reviews. Delivered the MVP in under 180 days by substituting listening behavior with content-derived features.
  • Architected a Spark-based text processing pipeline to extract sentiment, tags, and themes from music reviews. Combined these features with artist similarity and metadata to build a robust feature matrix powering cold-start recommendations.
Technologies: R, Spark ML, Natural Language Toolkit (NLTK), Gensim, Python 3, Apache Hive, Amazon S3 (AWS S3), EMR, Anomaly Detection, MLlib, Natural Language Processing (NLP), SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Minimum Viable Product (MVP), Full-stack, Leadership, Amazon Web Services (AWS)

Senior Manager – Business Analytics

2013 - 2014
AIG
  • Engineered a unified dataset by integrating survey and HR data using Python, enabling trust and collaboration analysis across 40,000+ staff by cleaning, joining, and enriching records with role, geography, and organizational metadata.
  • Applied SNA using Python, NetworkX, and Gephi to map trust and collaboration across 40,000+ employees at AIG, revealing key cross-team relationships and organizational gaps through interactive network visualizations.
  • Built graph-based tools for organizational design, enabling interactive visualization and analysis of reporting structures, spans of control, cost, and headcount using large-scale graph reasoning and manipulation.
  • Applied LDA topic modeling using NLTK and Gensim to classify free-text responses from an employee engagement survey into structured categories, saving substantial time and cost previously spent on manual coding.
Technologies: Python, NetworkX, R, Natural Language Toolkit (NLTK), Gephi, Natural Language Processing (NLP), SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Amazon Web Services (AWS)

Senior Data Scientist

2011 - 2013
DeepMile LLC
  • Developed advanced social media analytics using NLP techniques, including named entity recognition (NER), sentiment analysis, topic modeling, and text classification, enabling the extraction of structured insights from unstructured social data.
  • Implemented influencer detection, co-term graph analysis, psychographic profiling, and message clustering using unsupervised learning methods. These capabilities were adopted by leading digital agencies including Edelman and Publicis.
  • Engineered a high-throughput Twitter Decahose pipeline with AWS S3-based storage for scalable ingestion and processing of large-scale social media data. Integrated streaming NLP enrichment to support real-time analysis of unstructured text.
  • Developed dynamic visual analytics for Twitter data using D3.js and Gephi's web plugin, enabling interactive exploration of co-hashtag networks, word clouds, and longitudinal sentiment trends.
  • Created a composite signal product leveraging the world’s largest DNS query dataset, designed to support high-value use cases in both the investment and cybersecurity industries.
Technologies: R, Python, Natural Language Toolkit (NLTK), Pandas, Apache Hive, Hadoop, Mahout, Apache Pig, NetworkX, Apache OpenNLP, Amazon EC2, Natural Language Processing (NLP), SQLite, SQL, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Unsupervised Learning, Supervised Learning, Minimum Viable Product (MVP), Full-stack, Amazon Web Services (AWS)

Experience

Voice to Flight Proof of Concept

This is a proof-of-concept system that generates unmanned aerial vehicles (UAV) flight plans from natural language input. It requires advanced reasoning to produce legal, safe, and Part 107-compliant missions. Unlike typical chat-based automation, this system handles complex mission planning with regulatory, geographic, and operational constraints.

I architected a hybrid multi-agent system using OpenAI and LangChain to extract structured mission parameters (e.g., location, altitude, and type) via intent parsing and slot-filling. I also integrated OpenStreetMap (OSM) queries to inform planning decisions with real-world terrain, no-fly zones, and structural data. I used retrieval-augmented generation (RAG) to incorporate relevant prior missions into flight plan generation, improving planning efficiency and contextual relevance. In addition, I prototyped a constraint-solving agent using SciPy to optimize UAV allocation and enable parallel mission execution, implementing validation routines to simulate and check generated flight plans for LLM hallucination mitigation and operational safety.

PTSD and TBI Detection from Social Media

A DARPA-funded project to explore early detection of PTSD and traumatic brain injury (TBI) in veterans through analysis of social media content. The system was designed to learn directly from training inputs provided by medical professionals, enabling adaptive screening and potential early intervention strategies.

I developed a machine learning pipeline capable of extracting and analyzing linguistic patterns indicative of PTSD and TBI from unstructured social media text. The pipeline incorporates expert-labeled training data from clinicians to guide supervised model development and evaluation. I addressed domain-specific challenges such as limited labeled data, informal language, and privacy-preserving data handling. Springer recognized the project outcomes in "Sentiment Analysis for PTSD Signals," highlighting the system's contribution to computational mental health research.

Education

2021 - 2025

Progress Toward a Master's Degree in Cyber Physical Systems

Georgia Institute of Technology - Atlanta, GA, USA

2015 - 2018

Graduate Certificate in Data Mining and Applications

Stanford University - Palo Alto, CA, USA

2006 - 2008

Master's Degree in Computer Science and Artificial Intelligence

George Mason University - Fairfax, VA, USA

Skills

Libraries/APIs

NetworkX, PyTorch, Pandas, NumPy, SpaCy, Hugging Face Transformers, Scikit-learn, Scikit-optimize, Plotly.js, Natural Language Toolkit (NLTK), GraphX, SciPy, Wikipedia API, Stanford NLP, OpenAI API, Spark ML, MLlib, Mahout

Tools

Apache Airflow, Gensim, Named-entity Recognition (NER), Apache OpenNLP, Stanford CoreNLP, Amazon Elastic Container Service (ECS), Docker Compose, Google OR-Tools, MQTT, Whisper

Languages

R, Python 3, Python, SQL, JQL, SPARQL

Paradigms

Anomaly Detection, Synthetic Data Generation

Platforms

Docker, Gephi, Amazon EC2, AWS Lambda, AWS IoT, Amazon Web Services (AWS), Apache Pig, Azure

Storage

PostgreSQL, MySQL, Amazon S3 (AWS S3), Elasticsearch, SQLite, Apache Hive

Frameworks

LangGraph, Streamlit, Spark, Hadoop, Django

Industry Expertise

AI Art Generator

Other

Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Reinforcement Learning, Statistical Machine Learning, Random Forests, Regression, Large Language Models (LLMs), Clustering, Predictive Modeling, Cyber-Physical Systems, Natural Language Understanding (NLU), Graphical Models, LangChain, ChromaDB, OpenAI, Transformers, Datasets, Chatbots, Information Extraction, Data Science, Data Visualization, gensim, EMR, Neural Networks, ECS, Retrieval-augmented Generation (RAG), Text Classification, BERT, Sentiment Analysis, Link Analysis, DBpedia, Data Synthesis, AI Chatbots, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Fine-tuning, Unsupervised Learning, Supervised Learning, Deep Learning, Minimum Viable Product (MVP), Full-stack, Leadership, Architecture, Vector Databases, Fintech, Prompt Engineering, ChatGPT API, Bayesian Methods, Support Vector Machines (SVM), Optimization, Simulations, PID Controllers, Milvus, Llama 2, Hadoop Nutch, Text to Speech (TTS), Reinforcement Learning from Human Feedback (RLHF), Speech to Text, Hypothesis Testing, Data Mining, Applications

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring