Steven Nicolaou, Developer in Boston, MA, United States
Steven is available for hire
Hire Steven

Steven Nicolaou

Verified Expert  in Engineering

Data Scientist and Developer

Location
Boston, MA, United States
Toptal Member Since
June 8, 2019

Steven is a C# and Python developer, data scientist, search engineer, strong communicator, and creative thinker with 15 years of consulting experience in enterprise search and big data. He's focused on original and innovative solutions and has architected large search implementations across industries, including BP, Best Buy, Merck, McGraw Hill, Barnes & Noble, Goldman Sachs, Ernst & Young, the US Department of Justice, Dell, Xerox, and Kraft.

Portfolio

Thermo Fisher Scientific
Python, Scikit-learn, Spark, Solr, Amazon RDS, Apache Airflow, Pandas...
LinearBee
Elasticsearch, Spark, Carrot2, Pandas, Scikit-learn, Python, Python 3...
Microsoft Corporation
Microsoft SQL Server, Windows PowerShell, C#, SharePoint, Search, Enterprise...

Experience

Availability

Part-time

Preferred Environment

PyCharm, Visual Studio, Windows

The most amazing...

...solo project I've built is a rule-based data cleansing and normalization pipeline for a Fortune 500's eCommerce website.

Work Experience

Data Scientist

2018 - PRESENT
Thermo Fisher Scientific
  • Developed an app to track content translations and automatically trigger re-translation for modified content worldwide.
  • Built a general-purpose ensemble NLP classifier with a variety of internal applications.
  • Building an enterprise-wide knowledge graph for eCommerce.
  • Built Lucidworks global synonym consolidation workflow to merge implicit relationships, eliminate duplication and streamline definitions.
Technologies: Python, Scikit-learn, Spark, Solr, Amazon RDS, Apache Airflow, Pandas, SQLAlchemy, Redshift, Amazon Athena, Amazon EC2, Scrapy, NumPy, SciPy, Python 3, Windows, Linux, Data Science, SQL, Agile, Development, Apache Solr, Git, Machine Learning, Algorithms

Data Scientist

2018 - PRESENT
LinearBee
  • Built a data cleansing and normalization pipeline for a Fortune 500 company's entire eCommerce catalog using Pandas and Spark.
  • Built an equivalent product matcher for two global regions using Whoosh.
  • Built customer feedback topic clusters for executive action using Pandas and Carrot2.
  • Built an SEO category landing page tagger using product metadata.
  • Built a product taxonomy classifier for products with missing taxonomy tags using Scikit-learn, Pandas, and Solr.
Technologies: Elasticsearch, Spark, Carrot2, Pandas, Scikit-learn, Python, Python 3, Data Science, SQL, Agile, Development, Apache Solr, Git, Machine Learning, Algorithms

Principal Consultant

2008 - 2015
Microsoft Corporation
  • Architected global enterprise search deployments for major corporations using SharePoint and FAST.
  • Led relevance tuning and conducted customer education workshops.
  • Developed the FAST ImPulse eCommerce search engine.
  • Served as the team lead on many enterprise search deployments.
  • Helped a customer build an innovative chemical search by drawing chemicals.
Technologies: Microsoft SQL Server, Windows PowerShell, C#, SharePoint, Search, Enterprise, Python 3, Azure, SQL, Agile, Development, Algorithms

Architect

2003 - 2008
FAST Search
  • Architected many enterprise search deployments for customers.
  • Assisted with development, product management, analysis, deployment, and support for FAST Impulse eCommerce search engine.
  • Consulted on relevance tuning, scaling, and data pipelines.
Technologies: PHP, C#, Python, SQL, Agile, Development, Algorithms

Cyprus Covid-19 Dashboard

A live, interactive COVID-19 dashboard to help citizens analyze and understand the epidemiological big picture with simple, targeted visualizations and concise explanations. High-resolution graphs and content were available in English and Greek.

Whoosh

https://github.com/whoosh-community/whoosh
Contributor to Whoosh, a fast, featureful full-text indexing and searching library implemented in pure Python. Provided community support on Github Issues and stackoverflow.com. Promoted Whoosh in several projects.

FizzBuzz

https://github.com/stevennic/FizzBuzz-Python
Customers often find it useful to see an implementation of FizzBuzz to assess coding skills.

Languages

Python, Python 3, SQL, C#, PHP

Libraries/APIs

Pandas, Scikit-learn, SQLAlchemy, NumPy, Carrot2, SciPy

Tools

Visual Studio, PyCharm, Solr, Git, Apache Solr, Apache Airflow, Amazon Athena, Plotly

Other

Search, Development, Amazon RDS, Machine Learning, Algorithms, Enterprise, Dashboards

Frameworks

Scrapy, Streamlit, Windows PowerShell, Spark

Paradigms

Data Science, Agile, REST

Platforms

Windows, Amazon EC2, SharePoint, Linux, Azure

Storage

Microsoft SQL Server, Elasticsearch, Redshift

2007 - 2009

Master of Business Administration Degree in Entrepreneurship

Babson College - Wellesley, MA

2001 - 2003

Bachelor of Science Degree in Computer Science

Wentworth Institute of Technology - Boston, MA

1995 - 1998

Diploma in Computer Science

Cyprus University of Technology - Nicosia, Cyprus

APRIL 2015 - PRESENT

Microsoft Specialist: Azure Solutions

Microsoft

JUNE 2011 - PRESENT

Microsoft Certified Technology Specialist: SharePoint 2010, Configuration

Microsoft

MAY 2011 - PRESENT

Microsoft Certified Technology Specialist: SharePoint 2010, Application Development

Microsoft

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring