A Privacy-Sensitive Approach to Modeling Multi-Person Conversations

Abstract

In this paper we introduce a new dynamic Bayesian network that separates the speakers and their speaking turns in a multi-person conversation. We protect the speakers' privacy by using only features from which intelligible speech cannot be reconstructed. The model we present combines data from multiple audio streams, segments the streams into speech and silence, separates the different speakers, and detects when other nearby individuals who are not wearing microphones are speaking. No pre-trained speaker specific models are used, so the system can be easily applied in new and different environments. We show promising results in two very different datasets that vary in background noise, microphone placement and quality, and conversational dynamics.

Cite

Text

Wyatt et al. "A Privacy-Sensitive Approach to Modeling Multi-Person Conversations." International Joint Conference on Artificial Intelligence, 2007.

Markdown

[Wyatt et al. "A Privacy-Sensitive Approach to Modeling Multi-Person Conversations." International Joint Conference on Artificial Intelligence, 2007.](https://mlanthology.org/ijcai/2007/wyatt2007ijcai-privacy/)

BibTeX

@inproceedings{wyatt2007ijcai-privacy,
  title     = {{A Privacy-Sensitive Approach to Modeling Multi-Person Conversations}},
  author    = {Wyatt, Danny and Choudhury, Tanzeem and Bilmes, Jeff A. and Kautz, Henry A.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2007},
  pages     = {1769-1775},
  url       = {https://mlanthology.org/ijcai/2007/wyatt2007ijcai-privacy/}
}