Causal ML & Meta-Learners Quiz - Stephen Quiz Week 6

Time Remaining:
25:00

Student Information

Question 1: Data Interpretation

Below is a chart showing the pre-treatment GDP trends for California and several potential control states:

Pre-Treatment GDP Growth (%) 2015 2016 2017 2018 2019 Year 0 1 2 3 4 5 GDP Growth (%) California Nevada Oregon Washington Arizona

Based on this pre-treatment data, which state would make the best single control unit for a Synthetic Control Method analysis of California?

Question 2: Multiple Choice

Which treatment effect explains why a tutoring program works better for STEM students than humanities students?

Question 3: Multiple Choice

In the context of SHAP values, what does a negative SHAP value for a feature indicate?

Question 4: Multiple Choice

Using our policy intervention dataset, if California implemented a policy (Treatment = "Yes") and saw GDP increase from 3.2 to 3.8, while Oregon had no policy (Treatment = "No") and saw GDP increase from 3.1 to 3.3, what is the naive estimate of the treatment effect?

Question 5: Multiple Choice

In a Synthetic Control Method, if Nevada contributes 30%, Oregon contributes 40%, and Washington contributes 30% to your synthetic California, what would be the synthetic prediction if these states have unemployment rates of 4.2%, 3.9%, and 4.5% respectively?

Question 6: Data Analysis

The following table shows treatment effects for a job training program across different demographic groups:

Demographic Group Sample Size Treatment Effect (Income Increase %) Standard Error p-value
Women under 30 245 14.2% 2.1% 0.001
Women over 30 278 9.5% 1.8% 0.008
Men under 30 231 6.8% 2.4% 0.015
Men over 30 252 3.2% 1.9% 0.089
Overall Average 1006 8.4% 1.1% 0.003

What does this data tell us about the Conditional Average Treatment Effects (CATEs) of the job training program?

Question 7: Chart Analysis

Consider this chart showing the actual outcome for a treated unit (red line) compared with two different counterfactual estimates (blue and green lines):

Treatment Effect Analysis: Smoking Ban on Respiratory Admissions Pre-Treatment Post-Treatment Policy Implemented 0 10 20 30 40 50 Hospital Admissions Actual Outcome Synthetic Control Counterfactual Difference-in-Differences Counterfactual

Based on the data shown in this chart, which of the following statements is most accurate?

Question 8: Multiple Choice

If a policy intervention lowered unemployment from 5.1% to 4.8% in treated regions, but would have naturally decreased to 5.0% without intervention (according to synthetic control), what is the actual treatment effect?

Question 9: Multiple Choice

In the context of LATE (Local Average Treatment Effect), what do we mean by "compliers"?

Question 10: Chart Comparison

The charts below compare the Synthetic Control Method (SCM) and Difference-in-Differences (DiD) approaches for estimating the effect of a minimum wage increase on employment levels:

Synthetic Control Method Pre-Treatment Post-Treatment 0% 2% 4% 6% 8% Unemployment Rate Effect: +4% Treated State Synthetic Control
Difference-in-Differences Pre-Treatment Post-Treatment 0% 2% 4% 6% 8% Unemployment Rate Effect: +3% Treated State Control State Counterfactual Trend

Based on these charts, which statement is most accurate regarding the advantages of the Synthetic Control Method compared to Difference-in-Differences?

Question 11: Multiple Choice

Using our dataset, if both California (treated) and Washington (untreated) had GDP growth of 0.2% before policy implementation, and after implementation California grew by 0.5% while Washington grew by 0.3%, what is the difference-in-differences estimate?

Question 12: SHAP Analysis

The following SHAP (SHapley Additive exPlanations) plot shows the contribution of various features to house price predictions:

SHAP Values for House Price Prediction Base Value: $250,000 Prediction: $285,000 Base Value Prediction Location (+$15,000) Square Footage (+$10,000) Fireplace (+$5,000) Age of House (-$12,000) No Pool (-$3,000) Increases prediction Decreases prediction

Based on the SHAP values displayed in this visualization, which feature contributes the most to increasing the house price prediction above the base value?

Question 13: External Validity Analysis

Below is data from four different studies of the same educational intervention conducted in different settings:

Study Location Population Sample Size Effect Size
(Std. Deviations)
95% Confidence
Interval
Study A Urban USA Low-income 1,250 0.42 [0.36, 0.48]
Study B Rural USA Mixed income 850 0.29 [0.22, 0.36]
Study C Urban Kenya Low-income 1,100 0.68 [0.61, 0.75]
Study D Urban India Low-income 950 0.71 [0.63, 0.79]

Based on this data, which conclusion about the external validity of this educational intervention is best supported?

Question 14: Synthetic Control Assumptions

Below is a graph showing pre-treatment fit for a Synthetic Control Method analysis of a tax policy implemented in State A:

Pre-Treatment Fit: State A vs. Synthetic State A 2016 2017 2018 2019 2020 Year Treatment 0 5 10 15 20 25 Tax Revenue ($ Millions) State A Synthetic State A

Based on this pre-treatment fit chart, which key assumption of the Synthetic Control Method is most likely violated in this analysis?

Question 15: True/False

The Synthetic Control Method (SCM) requires a perfect pre-treatment fit between the treated unit and its synthetic counterpart.

Question 16: True/False

The Average Treatment Effect (ATE) and the Treatment Effect on the Treated (TT) are always identical.

Question 17: True/False

SHAP values always sum up to the difference between the model's prediction and the average prediction over the entire dataset.

Question 18: Scenario Question

Consider a job training program that shows the following Conditional Average Treatment Effects (CATE) across different demographic groups:

CATE for Job Training Program Under 25 25-40 41-55 Over 55 Age Group 0% 5% 10% 15% 20% Income Increase 12% 15% 8% 2%

Based on this CATE diagram, which of the following decisions would be most appropriate if resources are limited?

Question 19: Scenario Question

A government implemented a tax credit for electric vehicles and wants to assess its impact on EV adoption rates. The CATE analysis shows:

CATE for EV Tax Credit by Income Group Low Income Middle Income High Income Income Group 0% 2% 4% 6% 8% Increase in EV Adoption 0.5% 4% 7%

Based on these CATEs, what can we conclude about the tax credit policy?

Question 21: Multiple Choice

In the context of counterfactual analysis, what does the statement "What would happen to house prices if every home had a fireplace?" represent?

Question 22: Scenario Question

A housing market analysis produced the following counterfactual visualization. The blue histogram shows the current distribution of house prices in a neighborhood, while the orange histogram shows the counterfactual distribution if every house without a fireplace had one installed:

Actual vs. Counterfactual House Price Distribution $200k $250k $300k $350k $400k House Price 0 10 20 30 40 Frequency Current Distribution Counterfactual (All Houses with Fireplaces)

Based on this counterfactual analysis, which of the following conclusions is most accurate?

Quiz Results

Your score: 0/20