Differentially Private Video Activity Recognition
Abstract
In recent years, differential privacy has seen significant advancements in image classification; however, its application to video activity recognition remains under-explored. This paper addresses the challenges of applying differential privacy to video activity recognition, which primarily stem from: (1) a discrepancy between the desired privacy level for entire videos and the nature of input data processed by contemporary video architectures, which are typically short, segmented clips; and (2) the complexity and sheer size of video datasets relative to those in image classification, which render traditional differential privacy methods inadequate. To tackle these issues, we propose Multi-Clip DP-SGD, a novel framework for enforcing video-level differential privacy through clip-based classification models. This method samples multiple clips from each video, averages their gradients, and applies gradient clipping in DP-SGD without incurring additional privacy loss. Moreover, we incorporate a parameter-efficient transfer learning strategy to make the model scalable for large-scale video datasets. Through extensive evaluations on the UCF-101 and HMDB-51 datasets, our approach exhibits impressive performance, achieving 81% accuracy with a privacy budget of epsilon=5 on UCF-101, marking a 76% improvement compared to a direct application of DP-SGD. Furthermore, we demonstrate that our transfer learning strategy is versatile and can enhance differentially private image classification across an array of datasets including CheXpert, ImageNet, CIFAR-10, and CIFAR-100.
Cite
Text
Luo et al. "Differentially Private Video Activity Recognition." Winter Conference on Applications of Computer Vision, 2024.Markdown
[Luo et al. "Differentially Private Video Activity Recognition." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/luo2024wacv-differentially/)BibTeX
@inproceedings{luo2024wacv-differentially,
title = {{Differentially Private Video Activity Recognition}},
author = {Luo, Zelun and Zou, Yuliang and Yang, Yijin and Durante, Zane and Huang, De-An and Yu, Zhiding and Xiao, Chaowei and Fei-Fei, Li and Anandkumar, Animashree},
booktitle = {Winter Conference on Applications of Computer Vision},
year = {2024},
pages = {6657-6667},
url = {https://mlanthology.org/wacv/2024/luo2024wacv-differentially/}
}