Syskill & Webert: Identifying Interesting Web Sites
Abstract
We describe Syskill & Webert, a software agent that learns to rate pages on the World Wide Web (WWW), deciding what pages might interest a user. The user rates explored pages on a three point scale, and Syskill & Webert learns a user profile by analyzing the information on each page. The user profile can be used in two ways. First, it can be used to suggest which links a user would be interested in exploring. Second, it can be used to construct a LYCOS query to find pages that would interest a user. We compare six different algorithms from machine learning and information retrieval on this task. We find that the naive Bayesian classifier offers several advantages over other learning algorithms on this task. Furthermore, we find that an initial portion of a web page is sufficient for making predictions on its interestingness substantially reducing the amount of network transmission required to make predictions. 1 Introduction There is a vast amount of information on the World Wide Web...
Cite
Text
Pazzani et al. "Syskill & Webert: Identifying Interesting Web Sites." AAAI Conference on Artificial Intelligence, 1996.Markdown
[Pazzani et al. "Syskill & Webert: Identifying Interesting Web Sites." AAAI Conference on Artificial Intelligence, 1996.](https://mlanthology.org/aaai/1996/pazzani1996aaai-syskill/)BibTeX
@inproceedings{pazzani1996aaai-syskill,
title = {{Syskill & Webert: Identifying Interesting Web Sites}},
author = {Pazzani, Michael J. and Muramatsu, Jack and Billsus, Daniel},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {1996},
pages = {54-61},
url = {https://mlanthology.org/aaai/1996/pazzani1996aaai-syskill/}
}