Learning for Efficient Retrieval of Structured Data with Noisy Queries

Abstract

Increasingly large collections of structured data necessitate the development of efficient, noise-tolerant retrieval tools. In this work, we consider this issue and describe an approach to learn a similarity function that is not only accurate, but that also increases the effectiveness of retrieval data structures. We present an algorithm that uses functional gradient boosting to maximize both retrieval accuracy and the retrieval efficiency of vantage point trees. We demonstrate the effectiveness of our approach on two datasets, including a moderately sized real-world dataset of folk music.

Cite

Text

Parker et al. "Learning for Efficient Retrieval of Structured Data with Noisy Queries." International Conference on Machine Learning, 2007. doi:10.1145/1273496.1273588

Markdown

[Parker et al. "Learning for Efficient Retrieval of Structured Data with Noisy Queries." International Conference on Machine Learning, 2007.](https://mlanthology.org/icml/2007/parker2007icml-learning/) doi:10.1145/1273496.1273588

BibTeX

@inproceedings{parker2007icml-learning,
  title     = {{Learning for Efficient Retrieval of Structured Data with Noisy Queries}},
  author    = {Parker, Charles and Fern, Alan and Tadepalli, Prasad},
  booktitle = {International Conference on Machine Learning},
  year      = {2007},
  pages     = {729-736},
  doi       = {10.1145/1273496.1273588},
  url       = {https://mlanthology.org/icml/2007/parker2007icml-learning/}
}