Sensitivity Evaluation for Unobserved Confounding | by Ugur Yildirim

Machine Learning

Sensitivity Evaluation for Unobserved Confounding | by Ugur Yildirim | Feb, 2024

hhhhm

2024年2月13日

Sensitivity Evaluation for Unobserved Confounding | by Ugur Yildirim | Feb, 2024

[ad_1]

Find out how to know the unknowable in observational research

Introduction
Drawback Setup
2.1. Causal Graph
2.2. Mannequin With and With out Z
2.3. Energy of Z as a Confounder
Sensitivity Evaluation
3.1. Aim
3.2. Robustness Worth
PySensemakr
Conclusion
Acknowledgements
References

The specter of unobserved confounding (aka omitted variable bias) is a infamous drawback in observational research. In most observational research, except we are able to moderately assume that remedy task is as-if random as in a pure experiment, we are able to by no means be really sure that we managed for all potential confounders in our mannequin. Because of this, our mannequin estimates may be severely biased if we fail to manage for an vital confounder–and we wouldn’t even realize it because the unobserved confounder is, effectively, unobserved!

Given this drawback, you will need to assess how delicate our estimates are to potential sources of unobserved confounding. In different phrases, it’s a useful train to ask ourselves: how a lot unobserved confounding would there need to be for our estimates to drastically change (e.g., remedy impact not statistically important)? Sensitivity evaluation for unobserved confounding is an energetic space of analysis, and there are a number of approaches to tackling this drawback. On this submit, I’ll cowl a easy linear technique [1] primarily based on the idea of partial R² that’s broadly relevant to a big spectrum of instances.

2.1. Causal Graph

Allow us to assume that we now have 4 variables:

Y: consequence
D: remedy
X: noticed confounder(s)
Z: unobserved confounder(s)

This can be a frequent setting in lots of observational research the place the researcher is fascinated by understanding whether or not the remedy of curiosity has an impact on the end result after controlling for potential treatment-outcome confounders.

In our hypothetical setting, the connection between these variables are such that X and Z each have an effect on D and Y, however D has no impact on Y. In different phrases, we’re describing a state of affairs the place the true remedy impact is null. As will turn out to be clear within the subsequent part, the aim of sensitivity evaluation is with the ability to motive about this remedy impact when we now have no entry to Z, as we usually gained’t because it’s unobserved. Determine 1 visualizes our setup.

Determine 1: Drawback Setup

2.2. Mannequin With and With out Z

To exhibit the issue that our unobserved Z could cause, I simulated some knowledge in keeping with the issue setup described above. You’ll be able to confer with this pocket book for the small print of the simulation.

Since Z could be unobserved in actual life, the one mannequin we are able to usually match to knowledge is Y~D+X. Allow us to see what outcomes we get if we run that regression.

Picture by writer, equations primarily based on [1], see pages 49–52

[ad_2]

Find out how to know the unknowable in observational research

2.1. Causal Graph

2.2. Mannequin With and With out Z

2.3. Energy of Z as a Confounder

3.1. Aim

3.2. Robustness Worth