Faithful Group Shapley Value

Abstract

Data Shapley is an important tool for data valuation, which quantifies the contribution of individual data points to machine learning models. In practice, group-level data valuation is desirable when data providers contribute data in batch. However, we identify that existing group-level extensions of Data Shapley are vulnerable to \emph{shell company attacks}, where strategic group splitting can unfairly inflate valuations. We propose Faithful Group Shapley Value (FGSV) that uniquely defends against such attacks. Building on original mathematical insights, we develop a provably fast and accurate approximation algorithm for computing FGSV. Empirical experiments demonstrate that our algorithm significantly outperforms state-of-the-art methods in computational efficiency and approximation accuracy, while ensuring faithful group-level valuation.

Cite

Text

Lee et al. "Faithful Group Shapley Value." Advances in Neural Information Processing Systems, 2025.

Markdown

[Lee et al. "Faithful Group Shapley Value." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/lee2025neurips-faithful/)

BibTeX

@inproceedings{lee2025neurips-faithful,
  title     = {{Faithful Group Shapley Value}},
  author    = {Lee, Kiljae and Liu, Ziqi and Tang, Weijing and Zhang, Yuan},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/lee2025neurips-faithful/}
}