A Theoretical Analysis of Query Selection for Collaborative Filtering

Abstract

We consider the problem of determining which of a set of experts has tastes most similar to a given user by asking the user questions about his likes and dislikes. We describe a simple algorithm for generating queries for a theoretical model of this problem. We show that the algorithm requires at most opt( F )(ln(| F |/opt( F )) + 1) + 1 queries to find the correct expert, where opt( F ) is the optimal worst-case bound on the number of queries for learning arbitrary elements of the set of experts F . The algorithm runs in time polynomial in | F | and | X | (where X is the domain) and we prove that no polynomial-time algorithm can have a significantly better bound on the number of queries unless all problems in NP have n O (log log n ) time algorithms. We also study a more general case where the user ratings come from a finite set Y and there is an integer-valued loss function ℓ on Y that is used to measure the distance between the ratings. Assuming that the loss function is a metric and that there is an expert within a distance η from the user, we give a polynomial-time algorithm that is guaranteed to find such an expert after at most 2opt( F , η) ln \(\tfrac{{\left| F \right|}}{{1 + \deg (F,\eta )}}\) + 2(η + 1)(1 + deg( F , η)) queries, where deg( F , η) is the largest number of experts in F that are within a distance 2η of any f ∈ F .

Cite

Text

Dasgupta et al. "A Theoretical Analysis of Query Selection for Collaborative Filtering." Machine Learning, 2003. doi:10.1023/A:1022961719072

Markdown

[Dasgupta et al. "A Theoretical Analysis of Query Selection for Collaborative Filtering." Machine Learning, 2003.](https://mlanthology.org/mlj/2003/dasgupta2003mlj-theoretical/) doi:10.1023/A:1022961719072

BibTeX

@article{dasgupta2003mlj-theoretical,
  title     = {{A Theoretical Analysis of Query Selection for Collaborative Filtering}},
  author    = {Dasgupta, Sanjoy and Lee, Wee Sun and Long, Philip M.},
  journal   = {Machine Learning},
  year      = {2003},
  pages     = {283-298},
  doi       = {10.1023/A:1022961719072},
  volume    = {51},
  url       = {https://mlanthology.org/mlj/2003/dasgupta2003mlj-theoretical/}
}