Information Extraction Based Multiple-Category Document Classification for the Global Legal Information Network

Abstract

This paper describes a prototype application of an information extraction (IE) based document classification system in the international law domain. IE is used to determine if a set of concepts for a class are present in a document. The syntactic and semantic constraints that must be satisfied to make this determination are derived automatically from a training corpus. A collection of IE systems are arranged in a classification hierarchy and novel documents are guided down the hierarchy based on the results from the previous level. Experimental results for a research prototype are given on a subset of the Global Legal Information Network domain.

Cite

Text

Holowczak and Adam. "Information Extraction Based Multiple-Category Document Classification for the Global Legal Information Network." AAAI Conference on Artificial Intelligence, 1997.

Markdown

[Holowczak and Adam. "Information Extraction Based Multiple-Category Document Classification for the Global Legal Information Network." AAAI Conference on Artificial Intelligence, 1997.](https://mlanthology.org/aaai/1997/holowczak1997aaai-information/)

BibTeX

@inproceedings{holowczak1997aaai-information,
  title     = {{Information Extraction Based Multiple-Category Document Classification for the Global Legal Information Network}},
  author    = {Holowczak, Richard D. and Adam, Nabil R.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1997},
  pages     = {1013-1018},
  url       = {https://mlanthology.org/aaai/1997/holowczak1997aaai-information/}
}