James Choi, Natural Language Processing (NLP) Developer in Vancouver, BC, Canada
James Choi

Natural Language Processing (NLP) Developer in Vancouver, BC, Canada

Member since October 26, 2018
James enjoys the challenges of extracting value from real-world datasets by wielding the latest machine learning and artificial intelligence tools in the Python stack. He has extensive experience in scientific research in academia and industry and thrives in collaborative teamwork.
James is now available for hire




Vancouver, BC, Canada



Preferred Environment

Amazon Web Services (AWS), MacOS, Linux, Vim Text Editor, Atom, Python, Git

The most amazing...

...thing I've worked on was applying my neuroscience expertise to develop new methods in the characterization of charge-stabilized nanostructures (nanobubbles).


  • Data Scientist

    2017 - PRESENT
    GN Science
    • Created a neural network-based text classifier and text summarizer with transfer learning (using Fast.ai and AWS SageMaker) for a client's proprietary data.
    • Built a data pipeline to scrape (ETL) more than 1 million PDFs from a website to model and rate a financial broker's integrity using the AWS platform (EC2, RDS, DynamoDB).
    • Implemented new clustering algorithms for the identification and characterization of neuronal cell types.
    • Developed and managed digital marketing campaigns (Facebook Ads, Google, AdWords Search/Display/Video), for big-budget (more than $100,000/month) clients.
    • Implemented A/B testing to optimize the customer journey through a conversion funnel.
    Technologies: Amazon SageMaker, Amazon DynamoDB, Amazon EC2, PostgreSQL, MongoDB, Fast.ai, Python
  • Scientist

    2010 - 2013
    Revalesio Corporation
    • Set up electrophysiology laboratory facilities for in-house ion channel screening.
    • Established a semi-automated patch clamp pipeline for membrane-based target screening using proprietary cell lines.
    • Coordinated studies with leading institutions and CRAs to elucidate the biophysical mechanism of action for nanobubble therapeutics.
    • Actively contributed to the physical-chemistry characterization of nanoparticles in solution.
    • Devised new detection methods using nanopores.
    • Successfully detected and characterized cardiomyocyte activity modulation for A/B testing.
    Technologies: MATLAB
  • Postdoctorate Fellow

    2006 - 2007
    Boston Children's Hospital | Harvard Medical School
    • Conducted single-cell-patch-clamp experiments on tissue slices from the mammalian visual thalamus.
    • Devised stereotactic injection techniques for the focalized injection of viral vectors.
    Technologies: MATLAB, Neuroscience


  • Current and Previous Projects in Data Science

    This is a blog that documents all of the projects that I have tackled thus far.


  • Paradigms

    Data Science, CRISP-DM
  • Other

    Software Development, Natural Language Processing (NLP), Artificial Intelligence (AI), Supervised Learning, Machine Learning, Neuroscience, Clustering, Big Data
  • Languages

    Python 3, Python 2, Python, SQL
  • Libraries/APIs

    Pandas, NumPy, Scikit-learn, PySpark, Fast.ai
  • Tools

    GitHub, Git, Atom, Vim Text Editor, MATLAB, Amazon SageMaker, Amazon ECS (Amazon Elastic Container Service), AWS Glue, AWS CLI
  • Platforms

    Amazon EC2, Amazon Web Services (AWS), Linux, MacOS, Jupyter Notebook
  • Storage

    MongoDB, Amazon DynamoDB, Amazon S3 (AWS S3), PostgreSQL
  • Frameworks



  • PhD Degree in Neuroscience
    1998 - 2005
    Brandeis University - Waltham, MA, USA
  • Bachelor of Arts Degree in Biochemistry and Cognitive Science
    1993 - 1998
    Rice University - Houston, TX, USA


  • Data Science
    MARCH 2018 - PRESENT
    International Language Academy of Canada Vancouver‎

To view more profiles

Join Toptal
Share it with others