Keyvis Damptey, Machine Learning Developer in Atlanta, GA, United States
Keyvis Damptey

Machine Learning Developer in Atlanta, GA, United States

Member since July 3, 2019
Keyvis uses statistics and mathematics to discover valuable information by identifying, testing, and verifying relationships between all the factors influencing your business. This process unveils the nuances to the "lay of the land" when it comes to the costs, operations, or customer sentiment that your organization must work with. From there, you both can discover the impact of actions and design strategies around your organization's goals.
Keyvis is now available for hire

Portfolio

Experience

  • Machine Learning, 4 years
  • Statistics, 4 years
  • Statistical Methods, 4 years
  • Statistical Modeling, 4 years
  • Text Analytics, 4 years
  • Natural Language Processing (NLP), 4 years
  • Data Science, 4 years
  • Python, 4 years

Location

Atlanta, GA, United States

Availability

Part-time

Preferred Environment

Linux, Python, R, Scala

The most amazing...

...AI I've made automatically discovered interrelated activities from justifications for financial advances. It then predicted the legal risk of those activities.

Employment

  • Data Scientist

    2019 - PRESENT
    Tactical Foresight Consulting, LLC
    • Used Python and R for data collection and statistical modeling, leveraging unsupervised models when labeled data was scarce.
    • Determined and designed technological capabilities, showcasing proof-of-concept (POC) of said capabilities to the client.
    • Created D3.js and Tableau visualizations for clients which reported needs.
    • Built a program to parse court documents to count reference to legislative statues and detect novel combinations of laws.
    • Used Bayesian Networks to visualize the influencers of a ballot measure pass rate.
    • Used NLP to create a graph of activities from scraped data from news articles.
    Technologies: Python, R, JavaScript, D3.js, Neo4j, Hadoop, Spark
  • Data Scientist (Consultant)

    2018 - 2018
    MatchPoint
    • Suggested, created, and tested a framework of unsupervised methods to detect suggested suppliers.
    • Presented results in a clear manner and developed flowcharts of how the system works.
    • Used natural language processing dependency trees to create categorizes as a training set.
    • Extracted useful search features from the text, created classifications for matching and search problems, and worked on experiments which resulted in successful unsupervised matching algorithm with approximately 96% accuracy.
    • Developed metaheuristics for creating and sourcing training datasets.
    Technologies: Python, SQL, Regex
  • Data Scientist

    2017 - 2018
    Systematrix Solutions
    • Used Spark MLlib via PySpark for outlier detection on GraphX RDDs.
    • Presented and coded new algorithms for graph analytics using GraphX and Scala.
    • Used PySpark for fraud analytics on banking records via RDD transformations, filters, and joins.
    • Created, modified, and benchmarked machine-learning algorithms for statistical inference on network properties and money laundering prediction in a Docker container.
    • Routinely provided qualitative insights into upcoming roadblocks to meeting projects and customers needs before it was a noticeable problem.
    • Took the initiative to develop and present data privacy policies, standards, processes, and local and international legal requirements.
    • Translated the fraud investigators' goals to extract essential subgraphs via graph-properties filters and transversals that delivered explicitly fraudulent connections in addition to causing a reduction processing time for analytics.
    • Prescribed a strategic approach to handle changing algorithmic regulations, burst-out-fraud, and take-over-fraud.
    Technologies: Python, SQL, Scala, JavaScript, D3.js, Neo4j, Hadoop, Spark
  • Operational Intelligence Analyst

    2015 - 2017
    Stanford University
    • Used mathematical techniques and fitted statistical models to analyze data related to business problems and visualized the results in Tableau dashboards and Neo4j.
    • Visualized and Identified contextual data that was needed, patterns, summary statistics and trends using (but not limited to): graph analytics, non-parametric ensemble models, Bayesian inference, and natural language processing (NLP).
    • Adjusted the code for multicore parallel processing on computer clusters and used MapReduce functions to aggregate data for customer profile to supplement Neo4j database.
    • Used Cypher (Neo4j QL) to add features such as fund amount to graph database of transactions.
    • Automated a system to categorize any text using an unsupervised model that eliminated the need for manually finding cluster centers or reducing the time to find density parameters.
    • Leveraged glove vectors (or Word2Vec) to classify an activity's risk which was extracted from text using NLP and then modeled their impact as a network/graph.
    • Constructed statistical frameworks and code by utilizing new machine learning programs; I then presented them at conferences and expos.
    • Met with clients and listened to their needs in order to design solutions to those needs.
    • Transferred, aggregated, and updated data on approvers of advances, credit cards, purchase orders, payments, and other financial and banking transactions in NoSQL database (MongoDB) using JavaScript, and Python.
    • Visualized the above-mentioned data in a Tableau dashboard.
    • Collaborated on multiple high-priority projects and made key contributions to the team’s long term strategy meetings.
    • Solved problems with a user-friendly explanation of the methodology and with minimal oversight.
    Technologies: Python, R, Neo4j, SQL, MongoDB, Tableau

Experience

  • Multiproject Visuals (Other amazing things)

    This site has the visualizations from multiple small projects that showcase the breadth of skills I have to offer.

  • Publicly Available Code (Other amazing things)
    https://github.com/quantkeyvis/PublicFiles

    Here is some of the impromptu code that I have published to showcase my ad-hoc coding style.

Skills

  • Languages

    Python, Regex, SQL, R, JavaScript, Cypher
  • Libraries/APIs

    SciPy, NumPy, Pandas, D3.js, GraphX, Spark ML, NLTK, Sklearn, NetworkX, SpaCy
  • Paradigms

    Data Science, Agile Workflow, Parallel Programming
  • Storage

    Neo4j, MongoDB
  • Other

    Natural Language Processing (NLP), Text Analytics, Text Mining, Unsupervised Learning, Statistical Modeling, Statistical Methods, Statistics, Topic Modeling, Analytics, Machine Learning, Agile Data Science, Graph Theory, Nonparametric Statistics, Nonlinear Optimization, Operations Research, Sentiment Analysis
  • Frameworks

    Django, Spark, Hadoop, Flask
  • Tools

    Stanford CoreNLP, Tableau
  • Platforms

    Linux, Docker

Education

  • Bachelor's degree in Industrial Engineering
    2008 - 2013
    University of Central Florida - Orlando, FL, USA

To view more profiles

Join Toptal
I really like this profile
Share it with others