WiFi and Vision Multimodal Learning for Accurate and Robust Device-Free Human Activity Recognition

Abstract

Human activity recognition plays an indispensable role in a myriad of emerging applications in context-aware services. Accurate activity recognition systems usually require the user to carry mobile or wearable devices, which is inconvenient for long term usage. In this paper, we design WiVi, a novel human activity recognition scheme that is able to identify common human activities in an accurate and device-free manner via multimodal machine learning using only commercial WiFi-enabled IoT devices and camera. For sensing using WiFi, a new platform is developed to extract fine-grained WiFi channel information and transform them into WiFi frames. A tailored convolutional neural network model is designed to extract high-level representative features among the WiFi frames in order to provide human activity estimation. We utilized a variant of C3D model for activity sensing using vision. Following this, WiVi performs multimodal fusion at the decision level to combine the strength of WiFi and vision by constructing an ensembled DNN model. Extensive experiments are conducted in an indoor environment, demonstrating that WiVi achieves 97.5% activity recognition accuracy and is robust under unfavorable situations, as each modality provides the complementary sensing when the other faces its limiting conditions.

Cite

Text

Zou et al. "WiFi and Vision Multimodal Learning for Accurate and Robust Device-Free Human Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019. doi:10.1109/CVPRW.2019.00056

Markdown

[Zou et al. "WiFi and Vision Multimodal Learning for Accurate and Robust Device-Free Human Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/zou2019cvprw-wifi/) doi:10.1109/CVPRW.2019.00056

BibTeX

@inproceedings{zou2019cvprw-wifi,
  title     = {{WiFi and Vision Multimodal Learning for Accurate and Robust Device-Free Human Activity Recognition}},
  author    = {Zou, Han and Yang, Jianfei and Das, Hari Prasanna and Liu, Huihan and Zhou, Yuxun and Spanos, Costas J.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2019},
  pages     = {426-433},
  doi       = {10.1109/CVPRW.2019.00056},
  url       = {https://mlanthology.org/cvprw/2019/zou2019cvprw-wifi/}
}