Marker-Removal Networks to Collect Precise 3D Hand Data for RGB-Based Estimation and Its Application in Piano

Abstract

Hand pose analysis is a key step to understanding dexterous hand performances of many high-level skills, such as playing the piano. Currently, most accurate hand tracking systems are using fabric-/marker-based sensing that potentially disturbs users' performance. On the other hand, markerless computer vision-based methods rely on a precise bare-hand dataset for training, which is difficult to obtain. In this paper, we collect a large-scale high precision 3D hand pose dataset with a small workload using a novel marker-removal network (MR-Net). The proposed MR-Net translates the marked-hand images to realistic bare-hand images, and the corresponding 3D postures are captured by a motion capture system thus few manual annotations are required. A baseline estimation network PiaNet is introduced and we report the accuracy of various metrics together with a blind qualitative test to show the practical effect.

Cite

Text

Wu et al. "Marker-Removal Networks to Collect Precise 3D Hand Data for RGB-Based Estimation and Its Application in Piano." Winter Conference on Applications of Computer Vision, 2023.

Markdown

[Wu et al. "Marker-Removal Networks to Collect Precise 3D Hand Data for RGB-Based Estimation and Its Application in Piano." Winter Conference on Applications of Computer Vision, 2023.](https://mlanthology.org/wacv/2023/wu2023wacv-markerremoval/)

BibTeX

@inproceedings{wu2023wacv-markerremoval,
  title     = {{Marker-Removal Networks to Collect Precise 3D Hand Data for RGB-Based Estimation and Its Application in Piano}},
  author    = {Wu, Erwin and Nishioka, Hayato and Furuya, Shinichi and Koike, Hideki},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2023},
  pages     = {2977-2986},
  url       = {https://mlanthology.org/wacv/2023/wu2023wacv-markerremoval/}
}