A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text
Abstract
In this paper, we present a classification-based approach towards single-slot as well as multi-slot information extraction (IE). For single-slot IE, we worked on the domain of Seminar Announcements, where each document contains information on only one seminar. For multi-slot IE, we worked on the domain of Management Succession. For this domain, we restrict ourselves to extracting information sentence by sentence, in the same way as (Soderland 1999). Each sentence can contain information on several management succession events. By using a classification approach based on a maximum entropy framework, our system achieves higher accuracy than the best previously published results in both domains.
Cite
Text
Chieu and Ng. "A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text." AAAI Conference on Artificial Intelligence, 2002. doi:10.5555/777092.777213Markdown
[Chieu and Ng. "A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text." AAAI Conference on Artificial Intelligence, 2002.](https://mlanthology.org/aaai/2002/chieu2002aaai-maximum/) doi:10.5555/777092.777213BibTeX
@inproceedings{chieu2002aaai-maximum,
title = {{A Maximum Entropy Approach to Information Extraction from Semi-Structured and Free Text}},
author = {Chieu, Hai Leong and Ng, Hwee Tou},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2002},
pages = {786-791},
doi = {10.5555/777092.777213},
url = {https://mlanthology.org/aaai/2002/chieu2002aaai-maximum/}
}