Extracting Rare Dependence Patterns via Adaptive Sample Reweighting
Abstract
Discovering dependence patterns between variables from observational data is a fundamental issue in data analysis. However, existing testing methods often fail to detect subtle yet critical patterns that occur within small regions of the data distribution–patterns we term rare dependence. These rare dependencies obscure the true underlying dependence structure in variables, particularly in causal discovery tasks. To address this issue, we propose a novel testing method that combines kernel-based (conditional) independence testing with adaptive sample importance reweighting. By learning and assigning higher importance weights to data points exhibiting significant dependence, our method amplifies the patterns and can detect them successfully. Theoretically, we analyze the asymptotic distributions of the statistics in this method and show the uniform bound of the learning scheme. Furthermore, we integrate our tests into the PC algorithm, a constraint-based approach for causal discovery, equipping it to uncover causal relationships even in the presence of rare dependence. Empirical evaluation of synthetic and real-world datasets comprehensively demonstrates the efficacy of our method.
Cite
Text
Li et al. "Extracting Rare Dependence Patterns via Adaptive Sample Reweighting." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Li et al. "Extracting Rare Dependence Patterns via Adaptive Sample Reweighting." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/li2025icml-extracting/)BibTeX
@inproceedings{li2025icml-extracting,
title = {{Extracting Rare Dependence Patterns via Adaptive Sample Reweighting}},
author = {Li, Yiqing and Xia, Yewei and Wang, Xiaofei and Chen, Zhengming and Peng, Liuhua and Gong, Mingming and Zhang, Kun},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {36365-36399},
volume = {267},
url = {https://mlanthology.org/icml/2025/li2025icml-extracting/}
}