Scroll To View More
Hire the top 3% of freelance developers
Michael Vu

Michael Vu

Rockville, MD, United States
Member since September 14, 2016
Michael is a database expert with 15 years of experience in database and ETL development in SQL servers. He has extensive experience in data warehouse and ETL design and in development—writing complex SQL queries/stored procedures and performance tuning. He also has 2 years of experience working with IBM Netezza. Michael is a true team player and is perceived as very pleasant to work with.
Michael is now available for hire
Experience
  • SQL, 16 years
  • Microsoft SQL Server, 16 years
  • T-SQL, 16 years
  • Data Warehouse, 10 years
  • ETL, 10 years
  • ETL Implementation & Design, 10 years
  • Netezza, 2 years
Rockville, MD, United States
Availability
Part-time
Preferred Environment
Windows, SSMS, BI Studio, TFS
The most amazing...
...thing I've created was a metadata driven system to aggregate data based on predefined algorithms.
Employment
  • Technical Lead
    American College of Cardiology
    2015 - PRESENT
    • Led the design and development of the second generation of the National Cardiovascular Database Registries (NCDR) data warehouse that aggregates patient data from 6,000 US hospitals and practices on thousands of metrics.
    • Architected the data warehouse system blending industry standards with organization specific requirements.
    • Designed and developed ETL applications, including T-SQL stored procedures/functions and SSIS packages to integrate data into the data warehouse and data marts from transaction sources.
    • Designed and developed the aggregation process to aggregate data and calculate algorithmic metrics.
    • Facilitated the development process by identifying the risks and issues and helping team members solve problems.
    Technologies: SQL Server
  • Lead DBA Consultant
    Computer Science Corporation
    2013 - 2015
    • Participated in the architecting and development of various projects of the NIH’s enterprise data warehouse, which aggregates 20 billion clinical records from 50+ sources.
    • Spearheaded the Common Data Model concept and built the first ETL automation system that allows transformation rules to be defined in the metadata.
    • Developed the extract process to extract and distribute consolidated and de-identified data to NIH institutes and centers.
    • Ensured system optimal performance by using indexes and query tuning, partitioning, and replication.
    Technologies: SQL Server, Netezza
  • Lead DBA
    American College of Cardiology
    2009 - 2012
    • Worked with the product manager to design the data ware house that match requirements, and ensure system is scalable to accommodate future products.
    • Led the team of developers to build the data warehouse and ETL components to feed data into the system.
    • Applied several techniques to help maintain data flow capacity and complete the job within the required time window (midnight-6AM). The techniques include staging of data, delayed processing and batch processing.
    • Developed a new platform that reduced the time to generate outcomes reports from 3 days down to 4 hours.
    Technologies: SQL Server
  • Database Engineer
    Discovery Logic (now Thomson Reuters)
    2007 - 2008
    • Developed and maintained a database of biomedical research and funding that are used as company's core product. The database received feeds from government owned public data sources such as PTO, NFS, NLM, and others.
    • Supported the analytics work by developing queries for data mining to identify trends and research outcomes.
    • Tuned and optimized queries and databases for improved performance. I was able to tune many stored procedures up to 60% faster, and optimize the update process from 9 hours down to 2 hours.
    • Developed functionality to link (with confidence scores) grants, publications, and patents across sources by analyzing author names, grant numbers, and author collaboration trail.
    Technologies: SQL Server
  • SQL Developer
    Datalab USA
    2004 - 2007
    • Developed SQL programs and SSIS packages to import consumer data into the SQL database. Performed merge and purge.
    • Developed programs to compile data for direct marketing campaigns based on selection criteria.
    • Built an integrated reporting system for marketing executives to measure performance of marketing effort in different metrics.
    • Developed a SOAP-based web service to support lead acquisition systems in standardizing postal addresses. The service is used by Laureate Education and City Cards.
    Technologies: SQL Server
  • Bioinformatic Developer
    Delaware Biotechnology Institute
    2001 - 2004
    • Designed databases to store lab data. Developed the ETL programs to load data into the databases.
    • Developed an algorithm for signature extraction from genomic data. Also developed a parallel version to run on the 128-node supercomputer.
    • Developed the language compiler in LIBAN, a data mining tool for analysis on genomic data.
    Technologies: SQL Server, MySQL, Java
Experience
  • Built a Program to Dynamically Aggregate Data (Other amazing things)

    Aggregation is based on metrics that are defined as metadata which dictates data columns and calculation formula (in the form of SQL snippets). The metrics are updated/added on a regular basis, hence the aggregation process needs to dynamically pick the metric definitions and execute at run-time.

  • Optimized a Calculation Process that Reduced Execution Time from 3 Days to 4 Hours (Other amazing things)

    The process is a complex SQL-based computing process that calculates the ranking and percentile of 2,500 US hospitals on 1000+ metrics, based on billion rows of data. I was able to intervene in a few critical areas to minimize the intra-process data movement and change from loop-based (per client) to set based (process all at once).

    The result is that the process time is reduced from 3 days down to 4 hours.

Skills
  • Languages
    SQL, Transact-SQL, T-SQL
  • Paradigms
    ETL, ETL Implementation & Design
  • Storage
    Microsoft SQL Server, SQL Server Integration Services (SSIS), Netezza, PL/SQL
  • Other
    Data Warehouse, Data Warehouse Design
Education
  • Master's degree (MBA) in Business Administration
    University of Massachusetts - Amherst, MA, USA
    2014 - 2017
  • Master's degree in Computer Science
    University of Delaware - Newark, DE, USA
    2002 - 2004
Hire the top 3% of freelance developers
I really like this profile
Share it with others