Teaching Materials for Machine Learning
Here you will find teaching materials from sections I ran as part of Harvard Kennedy School's Machine Learning and Big Data Analytics class (Fall 2018). The main goal of these sessions was to enable students to use R for machine learning, though the sessions also included brief and high-level conceptual reviews (the PDF documents).
Getting Started with R
The R code is intended to be used in RStudio, which can be downloaded here. You will also need to install R, which you can do here. Please download and install both R and RStudio before attempting to run the code included on this page.
Conceptual Introductions to Topics in Machine Learning
Introduction to Machine Learning and k-Nearest Neighbors (k-NN)
Cross-validation and Linear / Quadratic Discriminant Analysis (LDA and QDA)
Bootstrapping and Penalized Regressions (Lasso and Ridge Regression)
Principal Components Analysis (PCA) and Partial Least Squares (PLS)
Random Forest
R Code for Machine Learning
Introduction to Machine Learning and k-Nearest Neighbors (k-NN)
Cross-validation and Linear / Quadratic Discriminant Analysis (LDA and QDA)
Bootstrapping and Penalized Regressions (Lasso and Ridge Regression)
Principal Components Analysis (PCA) and Partial Least Squares (PLS)
Random Forest, Splines, and Generalized Additive Models (GAMs)