Abstract—As one of most popular technologies, audio
fingerprinting has recently attracted much attention in music
retrieval systems. In music retrieval methods based on audio
fingerprints, a large database is required in order to compare
the fingerprints extracted from the query. In other words, the
efficient search method has to be developed. In this paper, we
propose a method for index compression using a compressed
suffix array. Taking advantage of the fact that the repetitive
characters occur frequently in higher bits of the sorted audio
fingerprint data, the proposed method compresses the index by
encoding the 8-bit data sequences by Run Length Encoding.
Vertical Code is also used to compress the array, wherein the
positions of the sorted data are stored. Four sets of music
databases are used in experiments to evaluate the effectiveness
of the proposed method. The experimental results show that the
proposed method, compared with the conventional method,
only needs 30% of the space of an audio fingerprints database
for a music database consisting of 8000 songs, and around 80%
of the index space for a database of 1000 songs. Moreover, the
entire space cost is reduced to around 60%, compared with the
method based on the suffix array.
Index Terms—Audio fingerprint, compressed suffix array,
index compression, run length encoding, vertical code.
Qingmei Xiao, Kazuyuki Matsumoto and Kenji Kita are with the
Department of Information Science and Intelligent Systems, the University
of Tokushima, Tokushima, Japan (e-mail: hanmay510122@gmail.com,
matumoto@is.tokushima-u.ac.jp, kita@is.tokushima-u.ac.jp).
Narumi Saito is with OPTPIA Co., Ltd., Tokushima, Japan (e-mail:
saito-narumi@iss.tokushima-u.ac.jp).
Xin Luo is with the School of Computer Science and Technology,
Donghua Universtiy, Shanghai, (e-mail: China rashin.lx@gmail.com).
Yasushi Yokota is with Doi Hospital, Emihe, Japan (e-mail:
yokota@is.tokushima-u.ac.jp).
Cite:Qingmei Xiao, Narumi Saito, Kazuyuki Matsumoto, Xin Luo, Yasushi Yokota, and Kenji Kita, "Index Compression for Audio Fingerprinting Systems Based on Compressed Suffix Array," International Journal of Information and Education Technology vol. 3, no. 4, pp. 455-460, 2013.