George McIntire
Verified Expert in Engineering
Data Scientists and Developer
Berkeley, CA, United States
Toptal member since January 31, 2024
George is a results-driven data scientist who brings a diverse skill set and rich experience to the table. His expertise lies in data translation and a versatile toolkit, including Python, SQL, and machine learning. George excels at distilling complex findings into actionable and comprehensible insights, making data accessible and impactful for stakeholders.
Portfolio
Experience
Availability
Preferred Environment
Jupyter Notebook, Python 3, Google BigQuery, GitHub, Amazon SageMaker, Amazon Web Services (AWS), SQL
The most amazing...
...solution I've created is a text classification and quote extraction pipeline for a nonprofit focused on analyzing how the media quotes men as opposed to women.
Work Experience
AI & Data Science Consultant
Self-employed
- Worked as the lead LLM developer for an app that uses a ChatGPT-powered LLM to generate custom infographics. I created a vector database for retrieval-augmented generation (RAG), and used Amazon DynamoDB to store user conversation history.
- Consulted for a project that explores and tests data science methodologies for use in the legal profession. I fine-tuned and adapted ChatGPT to extract and analyze relevant information from case opinions to aid lawyers.
- Leveraged unsupervised learning and sentence embeddings to analyze the survey results of healthcare workers.
- Created a MVP AI recipe recommendation app. Used role&goal and few-shot prompting for prompt engineering with Chat-GPT4. Also built a RAG db with Qdrant populated with recipes and ingredient nutrients.
Data Scientist / ML Engineer
- Conducted exploratory data analysis using BigQuery and named entity recognition on millions of tweets reported by users for perceived terms of service violations.
- Detected networks that coordinated reporting actions by users maliciously targeting other users for banning using NetworkX and Neo4j.
- Built an interactive dashboard with the results using Looker Studio, which was used by Twitter's health data science team members to inform allocating resources to combat malicious behavior.
Data Visualization Analyst
Callisto Media
- Conducted data research projects using exploratory data analysis, statistical analysis, and machine learning.
- Partnered with the marketing team to build an interactive dashboard using Plotly's Dash tool, allowing them to visualize important KPIs for various campaigns easily.
- Designed a word-similarity mechanism that outputs a score that evaluates how similar two Amazon key phrases are to one another, helping the company evaluate the Amazon book market to decide which types of books to publish.
- Used Word2Vec to automate a process that matches Amazon search terms with their appropriate categories designated by the Callisto taxonomy—a vital project to the company, given its reliance on Amazon search data.
Experience
DataJockey
https://github.com/GeorgeMcIntire/DataJockeyProtect Nil LLM
Gender Representation and Opinion Detection in the Media
https://www.ischool.berkeley.edu/projects/2022/gender-representation-and-opinion-detection-mediaMy role was training a subjectivity text classification model and mine patterns in the extracted quotes from the articles dataset.
Education
Master's Degree in Information Systems
UC Berkeley School of Information - Berkeley, CA, USA
Bachelor's Degree in Economics
Occidental College - Los Angeles, CA, USA
Skills
Libraries/APIs
Pandas, Scikit-learn, NumPy, PyTorch, TensorFlow, SpaCy, OpenAI API
Tools
ChatGPT, Plotly, GitHub, Amazon SageMaker, BigQuery
Languages
SQL, Python, Python 3, R
Frameworks
Streamlit
Storage
Databases, PostgreSQL, Amazon DynamoDB, Amazon S3 (AWS S3), Neo4j
Platforms
Jupyter Notebook, Amazon Web Services (AWS)
Other
Machine Learning, Natural Language Processing (NLP), Data Analysis, Web Scraping, Data Visualization, Artificial Intelligence (AI), Data Science, Prompt Engineering, Clustering, Data Cleansing, OpenAI, Data Mining, Data Scraping, Dashboards, Data Analytics, Supervised Learning, ChatGPT Prompts, English, Google BigQuery, Writing & Editing, Social Network Analysis, Causal Inference, Surveying, Large Language Models (LLMs), Predictive Modeling, Deep Learning, Neural Networks, Generative Pre-trained Transformers (GPT), OpenAI GPT-4 API, Chatbots, Dash, LangChain, Pinecone, Regression Modeling, Text Recognition, Labeling, Blogging, Technical Writing, Content Writing, Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), Transformers, OpenAI GPT-3 API, Retrieval-augmented Generation (RAG), Churn Analysis, Hugging Face, MLflow, Critical Thinking, Research, Information Systems, Economics, Amazon RDS, Legal, Text Classification
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring