Learning the Common Structure of Data

Abstract

The proliferation of online information sources has accentuated the need for tools that automatically validate and recognize data. We present an efficient algorithm that learns structural information about data from positive examples alone. We describe two Web wrapper maintenance applications that employ this algorithm. The first application detects when a wrapper is not extracting correct data. The second application automatically identifies data on Web pages so that the wrapper may be reinduced when the source format changes.

Cite

Text

Lerman and Minton. "Learning the Common Structure of Data." AAAI Conference on Artificial Intelligence, 2000.

Markdown

[Lerman and Minton. "Learning the Common Structure of Data." AAAI Conference on Artificial Intelligence, 2000.](https://mlanthology.org/aaai/2000/lerman2000aaai-learning/)

BibTeX

@inproceedings{lerman2000aaai-learning,
  title     = {{Learning the Common Structure of Data}},
  author    = {Lerman, Kristina and Minton, Steven},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2000},
  pages     = {609-614},
  url       = {https://mlanthology.org/aaai/2000/lerman2000aaai-learning/}
}