Hi, I'm Sang Vo

I'm a San Francisco Bay Area based aspiring data scientist / engineer, statistical thinker, and analytical storyteller.

On my spare time, I like to travel, workout, and try new foods.

Projects

Intermediate-Advance SQL Queries

A collection of SQL queries that I created.
PostgreSQL

GithubGitHub

Video Games Sales Queries

Normalizing the data, creating a database, querying the database, and visualizing game sales.
Python, SQL

GithubGitHub

A/B Test on Web Forms

Analyzing A/B test results on web form reduction using Python.
Python

GithubGitHub

Avocado Sales KPI Dashboard

Dashboard displaying avocado sales in the U.S. in 2016-2017.
Tableau

GithubGitHub

Customer Churn Analysis

Analyzes customer data to identify why customers are churning.
R
Logistic Regression, Decision Trees, Random Forest

GithubGitHub

EDA & Hotel Cancellation Prediction

Explores hotel data from Portugal and identify guest cancellation predictors.
Python
Logistic Regression, K-nearest Neighbors, Random Forest

GithubGitHub

US Census Bureau Web Scraper

A python script that extracts web links from the Population and Housing Unit Estimates web page of the U.S. Census Bureau and outputs those links in a CSV file in an absolute and non-duplicated format.
Python

GithubGitHub

Seattle PD Funding Eligibility

Insights about the logs of emergency 911 calls from the Seattle Police Department and whether the department will be eligible for additional funding if the minimum standard of 2.5 officers onsite per incident is met.
Excel
Tableau, Linear Regression

GithubGitHub

Algorithmic Quantitative Value Investing Strategy

A robust algorithmic quantitative value investing strategy that selects 100 value stocks with the best value metrics using a universe of stocks in S&P 500 and IEX Cloud API.
Excel, Python, API

GithubGitHub

Company Fleet Selection Using Weighted Scoring

Weighted Scoring analysis on selecting one of the four vehicles as part of the company fleet based on requirements.
Excel
Tableau

GithubGitHub

California Population Prediction

Predict the size of its population in the state of California up to the year 2030.
R
Linear Regression

GithubGitHub