Causal Parametric Drift Simulation: A Digital Twin Framework for Classifier Robustness Evaluation

View PDF HTML (experimental)

Abstract:Machine learning classifiers in dynamic environments face concept drift -- changes in the data-generating process that degrade performance. Conventional evaluation via static test sets or noise perturbations fails to preserve causal dependencies in tabular data, often producing causally invalid assessments. Post-hoc tools like SHAP and LIME offer correlational insights that may not reflect the causal mechanisms driving model failure.
We propose a framework that complements existing drift detection by leveraging Structural Causal Models as "Digital Twins" of data-generating processes, enabling precise causal interventions while preserving structural dependencies. Our technique, Causal Parametric Drift Simulation, stress-tests classifiers to identify vulnerabilities before deployment. Experiments on the Open Sourcing Mental Illness (OSMH) dataset demonstrate that this approach exposes latent vulnerabilities invisible to standard statistical monitors.

Comments:	34 pages, 13 figures, 14 tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
MSC classes:	62H22, 62D20, 68T05
ACM classes:	I.2.6; I.2.0; G.3
Cite as:	arXiv:2605.09663 [cs.LG]
	(or arXiv:2605.09663v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2605.09663 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Julien Lafrance [view email]
[v1] Sun, 10 May 2026 17:28:49 UTC (3,160 KB)