FROSH: FasteR Online Sketching Hashing
Abstract
Many hashing methods, especially those that are in the data-dependent category with good learning accuracy, are still inefficient when dealing with three critical problems in modern data analysis. First, data usually come in a streaming fashion, but most of the existing hashing methods are batch-based models. Second, when data become huge, the extensive computational time, large space requirement, and multiple passes to load the data into memory will be prohibitive. Third, data often lack sufficient label information. Although the recently proposed Online Sketching Hashing (OSH) is promising to alleviate all three issues mentioned above, its training procedure still suffers from a high time complexity. In this paper, we propose a FasteR Online Sketching Hashing (FROSH) method to make the training process faster. Compared with OSH, we leverage fast transform to sketch data more compactly. Particularly, we derive independent transformations to guarantee the sketching accuracy, and design a novel implementation to make such transformations applicable to online data sketching without increasing the space cost. We rigorously prove that our method can yield a comparable learning accuracy with a lower time complexity and an equal space cost compared with OSH. Finally, extensive experiments on synthetic and real-world datasets demonstrate the excellent performance of our method.
Cite
Text
Chen et al. "FROSH: FasteR Online Sketching Hashing." Conference on Uncertainty in Artificial Intelligence, 2017.Markdown
[Chen et al. "FROSH: FasteR Online Sketching Hashing." Conference on Uncertainty in Artificial Intelligence, 2017.](https://mlanthology.org/uai/2017/chen2017uai-frosh/)BibTeX
@inproceedings{chen2017uai-frosh,
title = {{FROSH: FasteR Online Sketching Hashing}},
author = {Chen, Xixian and King, Irwin and Lyu, Michael R.},
booktitle = {Conference on Uncertainty in Artificial Intelligence},
year = {2017},
url = {https://mlanthology.org/uai/2017/chen2017uai-frosh/}
}