STable: Table Generation Framework for Encoder-Decoder Models

Abstract

The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutation-based decoder of our proposal is a generalized sequential method that comprehends information from all cells in the table. The training maximizes the expected log-likelihood for a table's content across all random permutations of the factorization order. During the content inference, we exploit the model's ability to generate cells in any order by searching over possible orderings to maximize the model's confidence and avoid substantial error accumulation, which other sequential models are prone to. Experiments demonstrate a high practical value of the framework, which establishes state-of-the-art results on several challenging datasets, outperforming previous solutions by up to 15%.

Cite

Text

Pietruszka et al. "STable: Table Generation Framework for Encoder-Decoder Models." NeurIPS 2022 Workshops: TRL, 2022.

Markdown

[Pietruszka et al. "STable: Table Generation Framework for Encoder-Decoder Models." NeurIPS 2022 Workshops: TRL, 2022.](https://mlanthology.org/neuripsw/2022/pietruszka2022neuripsw-stable/)

BibTeX

@inproceedings{pietruszka2022neuripsw-stable,
  title     = {{STable: Table Generation Framework for Encoder-Decoder Models}},
  author    = {Pietruszka, Michał and Turski, Michał and Borchmann, Łukasz and Dwojak, Tomasz and Pałka, Gabriela and Szyndler, Karolina and Jurkiewicz, Dawid and Garncarek, Łukasz},
  booktitle = {NeurIPS 2022 Workshops: TRL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/pietruszka2022neuripsw-stable/}
}