Sampling for Approximate Bipartite Network Projection

Abstract

Bipartite graphs manifest as a stream of edges that represent transactions, e.g., purchases by retail customers. Recommender systems employ neighborhood-based measures of node similarity, such as the pairwise number of common neighbors (CN) and related metrics. While the number of node pairs that share neighbors is potentially enormous, only a relatively small proportion of them have many common neighbors. This motivates finding a weighted sampling approach to preferentially sample these node pairs. This paper presents a new sampling algorithm that provides a fixed size unbiased estimate of the similarity matrix resulting from a bipartite edge stream projection. The algorithm has two components. First, it maintains a reservoir of sampled bipartite edges with sampling weights that favor selection of high similarity nodes. Second, arriving edges generate a stream of similarity updates, based on their adjacency with the current sample. These updates are aggregated in a second reservoir sample-based stream aggregator to yield the final unbiased estimate. Experiments on real world graphs show that a 10% sample at each stage yields estimates of high similarity edges with weighted relative errors of about 1%.

Cite

Text

Ahmed et al. "Sampling for Approximate Bipartite Network Projection." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/456

Markdown

[Ahmed et al. "Sampling for Approximate Bipartite Network Projection." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/ahmed2018ijcai-sampling/) doi:10.24963/IJCAI.2018/456

BibTeX

@inproceedings{ahmed2018ijcai-sampling,
  title     = {{Sampling for Approximate Bipartite Network Projection}},
  author    = {Ahmed, Nesreen K. and Duffield, Nick G. and Xia, Liangzhen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {3286-3292},
  doi       = {10.24963/IJCAI.2018/456},
  url       = {https://mlanthology.org/ijcai/2018/ahmed2018ijcai-sampling/}
}