
Arman A Anwar
Verified Expert in Engineering
Artificial Intelligence (AI) Developer
Fairfax, VA, United States
Toptal member since April 16, 2025
Arman is an AI systems expert with a proven track record of building enterprise-grade intelligent solutions that drive business outcomes. Organizations like DARPA and Samsung have trusted him to solve complex problems and deliver real-world impact. Arman is passionate about next-gen AI, especially systems that demand multi-modal reasoning, precise inference, and knowledge-based thinking.
Portfolio
Experience
- Python 3 - 14 years
- Artificial Intelligence (AI) - 14 years
- Machine Learning - 14 years
- Natural Language Processing (NLP) - 11 years
- Scikit-learn - 9 years
- Large Language Models (LLMs) - 4 years
- Synthetic Data Generation - 4 years
- Retrieval-augmented Generation (RAG) - 2 years
Availability
Preferred Environment
Python 3, LangChain, LangGraph, OpenAI, PyTorch, Artificial Intelligence (AI), Transformers, Retrieval-augmented Generation (RAG), Natural Language Processing (NLP), AI Chatbots, AI Art Generator
The most amazing...
...multi-agent system I've built leveraged OpenAI, ChromaDB, and LangChain to automate drone flight planning with geographic reasoning and mission optimization.
Work Experience
AI Engineer (Contract)
Skyfire AI
- Architected a hybrid multi-agent system for unmanned aerial vehicle (UAV) flight planning via natural language input. Used OpenAI and LangChain for intent parsing to auto-generate mission profiles like surveys, structure scans, and deliveries.
- Implemented a slot-filling tool to extract mission parameters from user input, ensuring structured, complete, and accurate data capture for autonomous UAV task generation and flight planning.
- Integrated retrieval-augmented generation (RAG) to retrieve historical UAV missions, enabling context-aware planning and reuse for similar future operations, improving mission accuracy and reducing planning time.
- Prototyped a constraint-solving tool-calling agent using SciPy to optimize UAV mission allocation, boosting resource efficiency and enabling parallel execution.
- Introduced LLM hallucination mitigation by simulating and validating generated flight plans with bounds checking, ensuring safety and reliability in autonomous UAV operations.
Lead Data Scientist
Wind River Systems
- Built a tech-docs chatbot using LangChain, Milvus, Llama 2, and OpenAI with a Streamlit UI, enabling conversational search over complex enterprise documentation with insight extraction.
- Created a RAG-based chatbot using LangChain, ChromaDB, and Jira data dumps with a Streamlit UI, enabling natural language queries for situational awareness and detecting story standard violations to support Agile workflow analysis and improvement.
- Developed a RAG-based chatbot using LangChain and ChromaDB to help developers convert product specs into epics and stories set by the Work Breakdown Structure (WBS) guidelines.
- Led a team of six colleagues to deliver an AI edge software development kit, prototyping multi-modal data support (image, video, telemetry) and enabling cloud-connected model updates and rapid edge AI development.
- Designed concept demos revealing LLMs' limitations in planning, occlusion reasoning, and basic cognitive tasks, highlighting gaps in spatial awareness and causal inference for embodied AI applications.
Affiliate Faculty in the Information Sciences and Technology Department
George Mason University
- Enhanced researcher NLP pipelines for the Social Media Mining for Health workshop, improving performance, scalability, and modularity using real-world engineering practices.
- Collaborated with George Mason University’s NLP research group to develop a custom language annotation platform for generating high-quality training data for machine learning models.
- Contributed to a commonsense geographic QA system using classical NLP and BERT-based models for spatial and comparative reasoning.
- Built components for query expansion, sub-query decomposition, retrieval, answer ranking, and explanation generation.
Director of Data Science Innovation (Contract)
Grant Thornton
- Led development of GTConfirm, enabling top private equity clients to submit financial data for compliance. Delivered the beta in 90 days through the internal incubator, accelerating the go-to-market for regulatory workflows.
- Leveraged principal component analysis (PCA) and clustering to uncover investment archetypes, enabling expert-driven anomaly detection in cold-start scenarios with no historical baseline.
- Built a CFO's cockpit in Plotly to surface real-time insights and anomaly statuses, adapting dynamically to auto/manual classifications and overrides.
Lead Data Scientist (Contract)
Xululabs
- Led a data science team to launch a quantitative market tech demo in 120 days, with iterative releases every 90 days.
- Co-created an Agile audience modeling platform for over nine million US profiles across 65,000+ features, partnering closely with the launch customer.
- Architected lookalike and purchase propensity models using diverse data sources and a stack including Python, R, Spark, Apache Airflow, and AWS.
- Cut model generation time from over four hours to under five minutes through performance tuning and optimization.
Vice President – Data Science
Time
- Architected ASPEN, a Python/R/Spark-based act-alike modeling platform, replacing a legacy SAS system and boosting digital ad revenue by 4% across five major media brands, including Time, Fortune, and People.
- Led development of MoM, a R/Spark/GraphX-based tool that helped Time editors identify emerging topics before they trended, leveraging NLP-driven tweet clustering, co-occurrence graphs, and inter-topic PageRank to infer narrative momentum.
- Collaborated with Time's investment and M&A groups, delivering due diligence on artificial intelligence and machine learning investment and acquisition targets.
Director of Strategic Analytics
Samsung
- Led the development of marketing data science capabilities on an AWS-based data lake, delivering insights on LTV, segmentation, act-alikes, churn, and basket analysis using Python, R/Spark, Apache Hive, and Amazon S3 (AWS S3).
- Rescued a stalled music recommender MVP by designing a cold-start strategy, leveraging NLP-based feature engineering on song metadata and reviews. Delivered the MVP in under 180 days by substituting listening behavior with content-derived features.
- Architected a Spark-based text processing pipeline to extract sentiment, tags, and themes from music reviews. Combined these features with artist similarity and metadata to build a robust feature matrix powering cold-start recommendations.
Senior Manager – Business Analytics
AIG
- Engineered a unified dataset by integrating survey and HR data using Python, enabling trust and collaboration analysis across 40,000+ staff by cleaning, joining, and enriching records with role, geography, and organizational metadata.
- Applied SNA using Python, NetworkX, and Gephi to map trust and collaboration across 40,000+ employees at AIG, revealing key cross-team relationships and organizational gaps through interactive network visualizations.
- Built graph-based tools for organizational design, enabling interactive visualization and analysis of reporting structures, spans of control, cost, and headcount using large-scale graph reasoning and manipulation.
- Applied LDA topic modeling using NLTK and Gensim to classify free-text responses from an employee engagement survey into structured categories, saving substantial time and cost previously spent on manual coding.
Senior Data Scientist
DeepMile LLC
- Developed advanced social media analytics using NLP techniques, including named entity recognition (NER), sentiment analysis, topic modeling, and text classification, enabling the extraction of structured insights from unstructured social data.
- Implemented influencer detection, co-term graph analysis, psychographic profiling, and message clustering using unsupervised learning methods. These capabilities were adopted by leading digital agencies including Edelman and Publicis.
- Engineered a high-throughput Twitter Decahose pipeline with AWS S3-based storage for scalable ingestion and processing of large-scale social media data. Integrated streaming NLP enrichment to support real-time analysis of unstructured text.
- Developed dynamic visual analytics for Twitter data using D3.js and Gephi's web plugin, enabling interactive exploration of co-hashtag networks, word clouds, and longitudinal sentiment trends.
- Created a composite signal product leveraging the world’s largest DNS query dataset, designed to support high-value use cases in both the investment and cybersecurity industries.
Experience
Voice to Flight Proof of Concept
I architected a hybrid multi-agent system using OpenAI and LangChain to extract structured mission parameters (e.g., location, altitude, and type) via intent parsing and slot-filling. I also integrated OpenStreetMap (OSM) queries to inform planning decisions with real-world terrain, no-fly zones, and structural data. I used retrieval-augmented generation (RAG) to incorporate relevant prior missions into flight plan generation, improving planning efficiency and contextual relevance. In addition, I prototyped a constraint-solving agent using SciPy to optimize UAV allocation and enable parallel mission execution, implementing validation routines to simulate and check generated flight plans for LLM hallucination mitigation and operational safety.
PTSD and TBI Detection from Social Media
I developed a machine learning pipeline capable of extracting and analyzing linguistic patterns indicative of PTSD and TBI from unstructured social media text. The pipeline incorporates expert-labeled training data from clinicians to guide supervised model development and evaluation. I addressed domain-specific challenges such as limited labeled data, informal language, and privacy-preserving data handling. Springer recognized the project outcomes in "Sentiment Analysis for PTSD Signals," highlighting the system's contribution to computational mental health research.
Education
Progress Toward a Master's Degree in Cyber Physical Systems
Georgia Institute of Technology - Atlanta, GA, USA
Graduate Certificate in Data Mining and Applications
Stanford University - Palo Alto, CA, USA
Master's Degree in Computer Science and Artificial Intelligence
George Mason University - Fairfax, VA, USA
Skills
Libraries/APIs
NetworkX, PyTorch, Pandas, NumPy, SpaCy, Hugging Face Transformers, Scikit-learn, Scikit-optimize, Plotly.js, Natural Language Toolkit (NLTK), GraphX, SciPy, Wikipedia API, Stanford NLP, OpenAI API, Spark ML, MLlib, Mahout
Tools
Apache Airflow, Gensim, Named-entity Recognition (NER), Apache OpenNLP, Stanford CoreNLP, Amazon Elastic Container Service (ECS), Docker Compose, Google OR-Tools, MQTT, Whisper
Languages
R, Python 3, Python, SQL, JQL, SPARQL
Paradigms
Anomaly Detection, Synthetic Data Generation
Platforms
Docker, Gephi, Amazon EC2, AWS Lambda, AWS IoT, Amazon Web Services (AWS), Apache Pig, Azure
Storage
PostgreSQL, MySQL, Amazon S3 (AWS S3), Elasticsearch, SQLite, Apache Hive
Frameworks
LangGraph, Streamlit, Spark, Hadoop, Django
Industry Expertise
AI Art Generator
Other
Machine Learning, Artificial Intelligence (AI), Natural Language Processing (NLP), Reinforcement Learning, Statistical Machine Learning, Random Forests, Regression, Large Language Models (LLMs), Clustering, Predictive Modeling, Cyber-Physical Systems, Natural Language Understanding (NLU), Graphical Models, LangChain, ChromaDB, OpenAI, Transformers, Datasets, Chatbots, Information Extraction, Data Science, Data Visualization, gensim, EMR, Neural Networks, ECS, Retrieval-augmented Generation (RAG), Text Classification, BERT, Sentiment Analysis, Link Analysis, DBpedia, Data Synthesis, AI Chatbots, Data Engineering, Statistical Modeling, Forecasting, Predictive Analytics, Time Series Analysis, Fine-tuning, Unsupervised Learning, Supervised Learning, Deep Learning, Minimum Viable Product (MVP), Full-stack, Leadership, Architecture, Vector Databases, Fintech, Prompt Engineering, ChatGPT API, Bayesian Methods, Support Vector Machines (SVM), Optimization, Simulations, PID Controllers, Milvus, Llama 2, Hadoop Nutch, Text to Speech (TTS), Reinforcement Learning from Human Feedback (RLHF), Speech to Text, Hypothesis Testing, Data Mining, Applications
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring