Hierarchical Macro Discourse Parsing Based on Topic Segmentation

Abstract

Hierarchically constructing micro (i.e., intra-sentence or inter-sentence) discourse structure trees using explicit boundaries (e.g., sentence and paragraph boundaries) has been proved to be an effective strategy. However, it is difficult to apply this strategy to document-level macro (i.e., inter-paragraph) discourse parsing, the more challenging task, due to the lack of explicit boundaries at the higher level. To alleviate this issue, we introduce a topic segmentation mechanism to detect implicit topic boundaries and then help the document-level macro discourse parser to construct better discourse trees hierarchically. In particular, our parser first splits a document into several sections using the topic boundaries that the topic segmentation detects. Then it builds a smaller and more accurate discourse sub-tree in each section and sequentially forms a whole tree for a document. The experimental results on both Chinese MCDTB and English RST-DT show that our proposed method outperforms the state-of-the-art baselines significantly.

Cite

Text

Jiang et al. "Hierarchical Macro Discourse Parsing Based on Topic Segmentation." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I14.17554

Markdown

[Jiang et al. "Hierarchical Macro Discourse Parsing Based on Topic Segmentation." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/jiang2021aaai-hierarchical/) doi:10.1609/AAAI.V35I14.17554

BibTeX

@inproceedings{jiang2021aaai-hierarchical,
  title     = {{Hierarchical Macro Discourse Parsing Based on Topic Segmentation}},
  author    = {Jiang, Feng and Fan, Yaxin and Chu, Xiaomin and Li, Peifeng and Zhu, Qiaoming and Kong, Fang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {13152-13160},
  doi       = {10.1609/AAAI.V35I14.17554},
  url       = {https://mlanthology.org/aaai/2021/jiang2021aaai-hierarchical/}
}