Multicollinearity, Precision Adjustment, and DAGs for Good and Bad controls

2. Multicollinearity, Precision Adjustment, and DAGs for Good and Bad controls#

In this lab, we explore key concepts in causal inference: multicollinearity, precision adjustment, and the use of Directed Acyclic Graphs (DAGs) to identify good and bad controls. We start by examining multicollinearity, where highly correlated predictors in a regression model inflate the variances of estimated coefficients. This section includes both theoretical explanations and practical examples to illustrate how multicollinearity can distort regression results and methods to detect and address it.

Next, we discuss precision adjustment, which involves adding variables to regression models to reduce residual variance and increase estimate precision. This section introduces the Lasso method for variable selection, which helps in identifying and including relevant variables while excluding irrelevant ones. We emphasize the importance of selecting appropriate variables to avoid introducing bias. Using DAGs, we demonstrate how to visually represent causal relationships and distinguish between good controls that reduce bias and bad controls that amplify it. Practical exercises and simulations help reinforce these concepts, enabling students to apply them to real-world data.

Python

Lab 2 - Python Code

R

Lab 2 - R Code

Julia

Lab 2 - Julia Code