LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

Li, Yuchen; Xiong, Haoyi; Kong, Linghe; Wang, Shuaiqiang; Sun, Zeyi; Chen, Hongyang; Chen, Guihai; Yin, Dawei

doi:10.1007/978-3-031-43427-3_38

LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search

Yuchen Li, Haoyi Xiong, Linghe Kong, Shuaiqiang Wang, Zeyi Sun, Hongyang Chen, Guihai Chen, Dawei Yin

ECML-PKDD 2023 pp. 635-651

doi:10.1007/978-3-031-43427-3_38 /ecmlpkdd/2023/li2023ecmlpkdd-ltrgcn/

Abstract

While traditional Learning to Rank (LTR) models use query-webpage pairs to perform regression tasks to predict the ranking scores, they usually fail to capture the structure of interactions between queries and webpages over an extremely large bipartite graph. In recent years, Graph Convolutional Neural Networks (GCNs) have demonstrated their unique advantages in link prediction over bipartite graphs and have been successfully used for user-item recommendations. However, it is still difficult to scale-up GCNs for web search, due to the (1) extreme sparsity of links in query-webpage bipartite graphs caused by the expense of ranking scores annotation and (2) imbalance between queries (billions) and webpages (trillions) for web-scale search as well as the imbalance in annotations. In this work, we introduce the Q -subgraph and W -subgraph to represent every query and webpage with the structure of interaction preserved, and then propose LtrGCN —an LTR pipeline that samples Q -subgraphs and W -subgraphs from all query-webpage pairs, learns to extract features from Q -subgraphs and W -subgraphs, and predict ranking scores in an end-to-end manner. We carried out extensive experiments to evaluate LtrGCN using two real-world datasets and online experiments based on the A/B test at a large-scale search engine. The offline results show that LtrGCN could achieve $\varDelta $ NDCG $_{5}$ = 2.89%–3.97% compared to baselines. We deploy LtrGCN with realistic traffic at a large-scale search engine, where we can still observe significant improvement. LtrGCN performs consistently in both offline and online experiments.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Li et al. "LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023. doi:10.1007/978-3-031-43427-3_38

Markdown

[Li et al. "LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023.](https://mlanthology.org/ecmlpkdd/2023/li2023ecmlpkdd-ltrgcn/) doi:10.1007/978-3-031-43427-3_38

BibTeX

@inproceedings{li2023ecmlpkdd-ltrgcn,
  title     = {{LtrGCN: Large-Scale Graph Convolutional Networks-Based Learning to Rank for Web Search}},
  author    = {Li, Yuchen and Xiong, Haoyi and Kong, Linghe and Wang, Shuaiqiang and Sun, Zeyi and Chen, Hongyang and Chen, Guihai and Yin, Dawei},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2023},
  pages     = {635-651},
  doi       = {10.1007/978-3-031-43427-3_38},
  url       = {https://mlanthology.org/ecmlpkdd/2023/li2023ecmlpkdd-ltrgcn/}
}