A brief introduction to data analysis with R using the fortune 500 dataset.
R Analyses ListA collection of analyses performed in R.
2016 Kaggle Caravan Insurance Challenge (Part 1 of 2). Dealing with unbalanced data.
Attempting to quantify happiness. Building clustering models on the 2016 World happiness report.
What to do when things go too well. Building and comparing XGBoost and Random Forest models on the Agaricus dataset (Mushroom Database).
Plotting a few common statistical functions, namely: PDF, CDF, and iCDF
Analyzing the classic sleep dataset using, two-sample and paired t-tests, and calculating statistical power.
Getting started with modeling. Multiple approaches to Multiple Linear Regression using the classic Boston Housing dataset