Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation

Abstract

Given a real-world dataset, data condensation (DC) aims to synthesize a small synthetic dataset that captures the knowledge of a natural dataset while being usable for training models with comparable accuracy. Recent works propose to enhance DC with data parameterization, which condenses data into very compact parameterized data containers instead of images. The intuition behind data parameterization is to encode shared features of images to avoid additional storage costs. In this paper, we recognize that images share common features in a hierarchical way due to the inherent hierarchical structure of the classification system, which is overlooked by current data parameterization methods. To better align DC with this hierarchical nature and encourage more efficient information sharing inside data containers, we propose a novel data parameterization architecture, Hierarchical Memory Network (HMN). HMN stores condensed data in a three-tier structure, representing the dataset-level, class-level, and instance-level features. Another helpful property of the hierarchical architecture is that HMN naturally ensures good independence among images despite achieving information sharing. This enables instance-level pruning for HMN to reduce redundant information, thereby further minimizing redundancy and enhancing performance. We evaluate HMN on five public datasets and show that our proposed method outperforms all baselines.

Cite

Text

Zheng et al. "Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72691-0_10

Markdown

[Zheng et al. "Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zheng2024eccv-leveraging/) doi:10.1007/978-3-031-72691-0_10

BibTeX

@inproceedings{zheng2024eccv-leveraging,
  title     = {{Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation}},
  author    = {Zheng, Haizhong and Sun, Jiachen and Wu, Shutong and Kailkhura, Bhavya and Mao, Zhuoqing Morley and Xiao, Chaowei and Prakash, Atul},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72691-0_10},
  url       = {https://mlanthology.org/eccv/2024/zheng2024eccv-leveraging/}
}