Abstract:Machine learning classifiers in dynamic environments face concept drift -- changes in the data-generating process that degrade performance. Conventional evaluation via static test sets or noise perturbations fails to preserve causal dependencies in tabular data, often producing causally invalid assessments. Post-hoc tools like SHAP and LIME offer correlational insights that may not reflect the causal mechanisms driving model failure.
We propose a framework that complements existing drift detection by leveraging Structural Causal Models as "Digital Twins" of data-generating processes, enabling precise causal interventions while preserving structural dependencies. Our technique, Causal Parametric Drift Simulation, stress-tests classifiers to identify vulnerabilities before deployment. Experiments on the Open Sourcing Mental Illness (OSMH) dataset demonstrate that this approach exposes latent vulnerabilities invisible to standard statistical monitors.
| Comments: | 34 pages, 13 figures, 14 tables |
| Subjects: | Machine Learning (cs.LG); Artificial Intelligence (cs.AI) |
| MSC classes: | 62H22, 62D20, 68T05 |
| ACM classes: | I.2.6; I.2.0; G.3 |
| Cite as: | arXiv:2605.09663 [cs.LG] |
| (or arXiv:2605.09663v1 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2605.09663 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Julien Lafrance [view email]
[v1]
Sun, 10 May 2026 17:28:49 UTC (3,160 KB)
