Gesture, Speech, and Gaze Cues for Discourse Segmentation

Abstract

Psycholinguistic evidence has established the complementary nature of the verbal and non-verbal aspects of human expression. We present our findings in the detection of these cites in interaction. We use the psycholinguistic device known as the 'catchment' as the locus of integration of gesture, speech and gaze components. We videotape conversation elicitation experiments in which subjects convey complex spatial plans to an interlocutor using a calibrated three-camera setup. We extract the gestural motion of both hands, gaze direction, and voiced units in the discourse and compare these with transcripts generated by expert microanalysis of the video. Our results show the complementary nature of these communicative modalities. Where there is ambiguity in the structure of one modality (such as in haptologies or owing to noise in the audio signal), other modalities provide evidence for correct segmentation.

Cite

Text

Quek et al. "Gesture, Speech, and Gaze Cues for Discourse Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000. doi:10.1109/CVPR.2000.854800

Markdown

[Quek et al. "Gesture, Speech, and Gaze Cues for Discourse Segmentation." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2000.](https://mlanthology.org/cvpr/2000/quek2000cvpr-gesture/) doi:10.1109/CVPR.2000.854800

BibTeX

@inproceedings{quek2000cvpr-gesture,
  title     = {{Gesture, Speech, and Gaze Cues for Discourse Segmentation}},
  author    = {Quek, Francis K. H. and McNeill, David and Bryll, Robert K. and Kirbas, Cemil and Arslan, Hasan and McCullough, Karl E. and Furuyama, Nobuhiro},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2000},
  pages     = {2247-2254},
  doi       = {10.1109/CVPR.2000.854800},
  url       = {https://mlanthology.org/cvpr/2000/quek2000cvpr-gesture/}
}