Scalable Online Index Construction with Multi-Core CPUs

Published in ADC - Australasian Database Conference, 2010

Recommended citation: H. Yamada and M. Toyama. 2010. Scalable Online Index Construction with Multi-core CPUs. In ADC. 29–36. https://dl.acm.org/doi/10.5555/1862242.1862249

Inverted index is a core element of current text retrieval systems. They can be dynamically constructed using online indexing approaches in the environment which even a small delay in timeliness cannot be tolerated, and the index must always be queryable and up to date. Recently, efficient online index construction schemes have been proposed, however, previous works have not focused on scalability with the modern commodity hardware resources such as multi-core CPUs. In this paper, we propose a scalable online index construction method that better utilizes multi-core CPUs. Using experiments on 30 GB of web data, we demonstrate the efficiency of our method in practice, showing that it dramatically reduces online index construction time without sacrificing query performance.

Download paper here

Recommended citation: H. Yamada and M. Toyama. 2010. Scalable Online Index Construction with Multi-core CPUs. In ADC. 29–36.