Robust Semiparametric Causal Inference
- 2026-05-11 (Mon.), 10:30 AM
- Auditorium, B1F, Institute of Statistical Science;The tea reception will be held at 10:10.
- Online live streaming through Microsoft Teams will be available.
- Dr. Gaspard Bernard
- Institute of Statistical Science, Academia Sinica
Abstract
The classical causal inference problem in which one aims to estimate an average treatment effect, while allowing the outcome of the treatment to be not necessarily independent of its assignment, is considered. This problem is closely linked to the semiparametric inference problem of estimating the expectation of Y, a Bernoulli random variable, from i.i.d. observations (R, RY, Z’). Here, R is a masking random variable following a Bernoulli distribution representing treatment assignation, and it is assumed that R and Y are independent conditionally on some vector of covariates Z. Root-n consistent and asymptotically semiparametrically efficient estimators have already been proposed in the literature, granted that the underlying model is regular enough. It is also well known that such estimators allow for some flexibility in the modeling, in the sense that a lack of regularity in the conditional expectation of Y given Z can be compensated for by stronger regularity assumptions on the assignment variable R. However, we explore in detail the limitations of this type of estimator from the point of view of robustness, and show that in certain scenarios where the model is slightly misspecified, they tend to perform poorly. In particular, when the assumption that R and Y are independent is not exactly satisfied, these estimators tend to exhibit very poor performance.
Using the theory of rho-estimation, a new robust estimator is therefore proposed. We study its nonasymptotic behavior under contamination and general misspecification, showing that both the estimation error and the approximation error can be controlled. We next turn to the question of the rate-optimality of the proposed estimator when the model is correctly specified. When the underlying model is correctly specified and has low enough regularity, we show that the minimax rate for estimating the functional of interest can be attained. When the underlying model is correctly specified and regular enough, we show that the optimal parametric root-n rate can be attained. Finally, still under correct model specification, we study in what sense a lack of regularity in the conditional expectation of Y can be compensated for by stronger regularity assumptions on R, and vice versa.
Please click here for participating the talk online.

