Deep Clustering of Text Representations for Supervision-Free Probing of Syntax
Abstract
We explore deep clustering of multilingual text representations for unsupervised model interpretation and induction of syntax. As these representations are high-dimensional, out-of-the-box methods like K-means do not work well. Thus, our approach jointly transforms the representations into a lower-dimensional cluster-friendly space and clusters them. We consider two notions of syntax: Part of Speech Induction (POSI) and Constituency Labelling (CoLab) in this work. Interestingly, we find that Multilingual BERT (mBERT) contains surprising amount of syntactic knowledge of English; possibly even as much as English BERT (E-BERT). Our model can be used as a supervision-free probe which is arguably a less-biased way of probing. We find that unsupervised probes show benefits from higher layers as compared to supervised probes. We further note that our unsupervised probe utilizes E-BERT and mBERT representations differently, especially for POSI. We validate the efficacy of our probe by demonstrating its capabilities as a unsupervised syntax induction technique. Our probe works well for both syntactic formalisms by simply adapting the input representations. We report competitive performance of our probe on 45-tag English POSI, state-of-the-art performance on 12-tag POSI across 10 languages, and competitive results on CoLab. We also perform zero-shot syntax induction on resource impoverished languages and report strong results.
Cite
Text
Gupta et al. "Deep Clustering of Text Representations for Supervision-Free Probing of Syntax." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I10.21317Markdown
[Gupta et al. "Deep Clustering of Text Representations for Supervision-Free Probing of Syntax." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/gupta2022aaai-deep/) doi:10.1609/AAAI.V36I10.21317BibTeX
@inproceedings{gupta2022aaai-deep,
title = {{Deep Clustering of Text Representations for Supervision-Free Probing of Syntax}},
author = {Gupta, Vikram and Shi, Haoyue and Gimpel, Kevin and Sachan, Mrinmaya},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {10720-10728},
doi = {10.1609/AAAI.V36I10.21317},
url = {https://mlanthology.org/aaai/2022/gupta2022aaai-deep/}
}