Sampling for Approximate Bipartite Network Projection
Abstract
Bipartite graphs manifest as a stream of edges that represent transactions, e.g., purchases by retail customers. Recommender systems employ neighborhood-based measures of node similarity, such as the pairwise number of common neighbors (CN) and related metrics. While the number of node pairs that share neighbors is potentially enormous, only a relatively small proportion of them have many common neighbors. This motivates finding a weighted sampling approach to preferentially sample these node pairs. This paper presents a new sampling algorithm that provides a fixed size unbiased estimate of the similarity matrix resulting from a bipartite edge stream projection. The algorithm has two components. First, it maintains a reservoir of sampled bipartite edges with sampling weights that favor selection of high similarity nodes. Second, arriving edges generate a stream of similarity updates, based on their adjacency with the current sample. These updates are aggregated in a second reservoir sample-based stream aggregator to yield the final unbiased estimate. Experiments on real world graphs show that a 10% sample at each stage yields estimates of high similarity edges with weighted relative errors of about 1%.
Cite
Text
Ahmed et al. "Sampling for Approximate Bipartite Network Projection." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/456Markdown
[Ahmed et al. "Sampling for Approximate Bipartite Network Projection." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/ahmed2018ijcai-sampling/) doi:10.24963/IJCAI.2018/456BibTeX
@inproceedings{ahmed2018ijcai-sampling,
title = {{Sampling for Approximate Bipartite Network Projection}},
author = {Ahmed, Nesreen K. and Duffield, Nick G. and Xia, Liangzhen},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2018},
pages = {3286-3292},
doi = {10.24963/IJCAI.2018/456},
url = {https://mlanthology.org/ijcai/2018/ahmed2018ijcai-sampling/}
}