An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11752))

Included in the following conference series:

International Conference on Image Analysis and Processing

2109 Accesses
6 Citations

Abstract

The application of the diffusion in many computer vision and artificial intelligence projects has been shown to give excellent improvements in performance. One of the main bottlenecks of this technique is the quadratic growth of the kNN graph size due to the high-quantity of new connections between nodes in the graph, resulting in long computation times. Several strategies have been proposed to address this, but none are effective and efficient. Our novel technique, based on LSH projections, obtains the same performance as the exact kNN graph after diffusion, but in less time (approximately 18 times faster on a dataset of a hundred thousand images). The proposed method was validated and compared with other state-of-the-art on several public image datasets, including Oxford5k, Paris6k, and Oxford105k.

You have full access to this open access chapter, Download conference paper PDF

LSH kNN graph for diffusion on image retrieval

Article 07 January 2021

Image Hub Explorer: Evaluating Representations and Metrics for Content-Based Image Retrieval and Object Recognition

Image Retrieval for Online Browsing in Large Image Collections

Keywords

1 Introduction

Content-Based Image Retrieval (CBIR) is concerned with finding the most similar images to a query in an image dataset, selected or photographed by the user. Recent improvements in features extraction through Convolutional Neural Networks (CNN) and algorithms for embedding, like the several R-MAC strategies [6, 15, 20], have made it possible to obtain excellent results on datasets of hundreds of thousand images in reasonable time [14]. Recently, the application of diffusion process on CBIR datasets have allowed boosting retrieval performance [11]: it permits finding more neighbour, that are close to the query on the nearest-neighbour manifold but not in the Euclidean representation space (Fig. 1). Diffusion propagates the similarities from a query point on a pairwise affinity matrix to all the dataset elements [26]. To apply this process, it is necessary to create a kNN graph of all image embeddings in the dataset. Generally, the more discriminative the embeddings are, the better the results achievable through diffusion.

Diffusion is an iterative process that simulates a random walk on the image similarity graph. It consists of walking on the graph, from the query point, with the objective of finding the best path, i.e. to retrieve the neighbours of the query point. This is possible exploiting the weights of the edges of the kNN graph, which indicate the similarity between two nodes: the greater the weight, the more similar the two nodes are.

Unfortunately, this recent method leads to new challenges: understanding the data distribution to correctly set the diffusion parameters, dealing with the size of the kNN graph, which grows quadratically with the dataset size, and reducing the convergence time for the resolution of the linear system related to the diffusion mechanism.

The kNN graph is needed to apply diffusion and the number of the edges in the graph is important for the final retrieval performance. Furthermore, it is impossible to know how many and which edges the graph needs for achieving good performance before applying diffusion. Therefore, a common strategy used in previous works was a brute-force approach, i.e. the graph is created with the connections between all the possible pairs of nodes. Obviously, increasing the number of edges increases the size of the entire graph. For example, considering a dataset composed by N images, the exact or brute-force graph will have $N \cdot N$ edges and the approach will have a complexity $O(N^2)$; if $N = 100K$, the number of edges will be 10 billion.

Several methods proposed to implement an approximated method for the creation of the kNN graph, drastically reducing the computational time [1, 3, 19, 25].

Following this idea that not all the edges are necessary for creating the kNN graph, we propose a fast approach for the creation of an approximate version of the kNN graph, based on LSH projections [9], which maintains only the useful edges to reduce computation time and memory requirements. The new graph achieves the same retrieval results as the exact kNN graph after diffusion [26] on several public image datasets, but does so more efficiently.

The main contributions of this paper are:

An efficient algorithm based on LSH projections for the creation of an approximate kNN graph that obtains the same performance as brute-force in less time.
Several optimizations in the implementation that reduce the computational time and memory use.
The use of the multi-probe LSH [13] for improving diffusion performance on the kNN graph.

2 Related Work

Graphs have been used for different tasks in computer vision: applying diffusion for image retrieval [11], unsupervised fine-tuning [10], propagating labels for semi-supervised learning [4], creating manifold embeddings [24], and training classifiers exploiting the cycles found in the graph [12].

In particular, a k-Nearest Neighbor (kNN) graph is an undirected graph G denoted by G(V, E), where V represents the set of nodes $V=\left\{ (v_1, v_2, \dots , v_n)\right\} $ and E represents the set of edges $E=\left\{ (e_1, e_2, \dots , e_n)\right\} $. The nodes represent all the images in the dataset and the edges represent the connections between nodes. The weight of each edge determines how much the two images are similar: the higher the weight, the more similar the two images are. The weights of the edges are set using cosine similarities between embeddings.

The problem of kNN graph creation has been addressed in the literature in several ways. The simplest and naive approach is brute-force, which tends to be very slow but usually obtains the best results.

To speed up the process while retaining good retrieval accuracy, approximated kNN graphs have been used. The methods for the construction of the approximate kNN graph can be divided in two strategies: methods following the strategy of divide and conquer, and methods using local search (e.g., NN-descent [3]). Divide-and-conquer methods generally consist of two stages: subdividing the dataset in parts (divide), and creating a kNN graph for each sample followed by merging all subgraphs to form the final kNN graph (conquer).

As foreseeable, the performance and the computational time depend on the number of subdivisions. The most famous approach following the divide and conquer is based on Locality Sensitive Hashing (LSH) projections [9] for creating the approximate kNN graph [25]. The authors in [25] used a spectral decomposition of a low-rank graph matrix that needs much time because it is supervised. Chen et al. [1] follow the same strategy, but they apply recursive Lanczos bisection [1]. They proposed two divide steps: the overlap and the glue method. In the former, the current set is divided in two overlapping subsets, while in the latter, the current set is divided into two disjoint subsets and a third, called gluing set, is used to merge the two resulting disjoint subsets. Wang et al. [21] implemented an algorithm for the creation of an approximate kNN graph through the application in several iterations of the divide-and-conquer strategy. The peculiarity of the method is that the subsets of the dataset elements used during the divide phase are randomly chosen. Repeating several times this process allows to theoretically cover the entire dataset. Our approach differs from these approaches in the way the LSH projections are created. Furthermore, the strategy followed for creating the graph is different and more efficient. Moreover, as an alternative to LSH, there are several strategies for the hashing phase reported in literature [23]. They can be categorized based on the method for preserving the similarities (pairwise, multiwise or implicit) and the quantization phase.

Regarding the second strategy (local search), Dong et al. [3] proposed an approach called NN-descent [3], based on the idea that “a neighbour of my neighbour is my neighbour.” For each image descriptor, a random kNN list is created. The algorithm starts searching random pairs on the kNN list, then it calculates the similarity between elements and finally updates the kNN list of these elements. This process continues until the number of updates is smaller than a threshold. Obviously, by increasing the number of neighbours contained in the kNN list it tends to the brute-force approach; therefore, the trade-off between the speed and accuracy performance needs to be correctly evaluated. Park et al. [16], Houle et al. [7] and Debatty et al. [2] proposed variations to the NN-descent, by adapting the basic approach to their specific application domains. Sieranoja et al. [19] proposed a solution, called Random Pair Division, that exploits both the divide and conquer and the NN-descent techniques. The division of the dataset in subsamples is executed through the random selection of two dataset descriptors: if the descriptor to be clusterized is close to the first one, it will be put in the first set, otherwise in the second one. After that, all image descriptors are clustered: if the size of each set is greater than a threshold, the subdivision process continues in the same way only for this large set. The conquer phase is executed through the application of the brute-force approach. In the end, a one-step neighbour propagation is applied for improving the final performance. It also exploits the principle of NN-descent: similar nodes that are not connected will be connected.

3 Proposed Approach: LSH kNN Graph

The proposed approach uses LSH projections to divide the global descriptors of the dataset in many subsets. We first explain LSH and then detail a fast and efficient solution for kNN graph creation for diffusion in image retrieval.

3.1 Notations and Background of LSH

Locality-Sensitive Hashing (LSH) [9] is one of the first hashing technique proposed for compression and indexing tasks. After the creation of some projection functions, it allows projection of points close to each other into the same bucket with high probability. It is defined as follows [22]: a family of hash functions $\mathcal {H}$ is called $(R,cR,P_{1},P_{2})$-sensitive if, for any two items $\mathbf {p}$ and $\mathbf {q}$, it holds that:

if $\mathrm {dist}(\mathbf {p},\mathbf {q})\le R,$ $\mathrm {Prob}[h(\mathbf {p})=h(\mathbf {q})]\ge P_{1}$
if $\mathrm {dist}(\mathbf {p},\mathbf {q})\ge cR,$ $\mathrm {Prob}[h(\mathbf {p})=h(\mathbf {q})]\le P_{2}$

with $c>1$, $P_{1}>P_{2}$, R is a distance threshold, and $h(\cdot )$ is the hash function. In other words, the hash function h must satisfy the property to project “similar” items (with a distance lower than the threshold R) to the same bucket with a probability higher than $P_{1}$, and have a low probability (lower than $P_{2}<P_{1}$) to do the same for “dissimilar” items (with distance higher than $cR>R$).

The hash function used in LSH for Hamming space embedding is a scalar projection:

$$\begin{aligned} h(\mathbf {x})=\mathrm {sign}(\mathbf {x}\cdot \mathbf {v}) \end{aligned}$$

where $\mathbf {x}$ is the feature vector and $\mathbf {v}$ is a fixed random vector sampled from an D-dimensional isotropic Gaussian distribution $\mathcal {N}(0,I)$. The hashing process is repeated L times, with different Gaussian samples to increase the probability of satisfying the above constraints.

Subsequently, different hashing techniques may be implemented. The multi-probe LSH [13] has the objective to reduce the number of hash tables used for the projections, exploiting the fundamental principle of LSH that similar items will be projected in the same buckets or in near buckets with high probability. During the search phase, multi-probe LSH checks also the buckets near the query bucket. In the end, this approach allows to improve the final performance, but it increases the computational time.

3.2 Basic Algorithm for kNN Graph Construction

Given a dataset $\mathcal {S} = \{s_1, \dots , s_N\}$, composed by N images, and a similarity measure $\theta : \mathcal {S} \times \mathcal {S} \rightarrow \mathbb {R}$, the kNN graph for $\mathcal {S}$ is a undirected graph G, that contains edges between the nodes i and j with the value from the similarity measure $\theta (s_i, s_j) = \theta (s_j, s_i)$. The similarity measure can be calculated in different ways related to the topic. In this case we use the cosine similarity as a metric, so the similarity is calculated through the application of the dot product between the image descriptors.

Our approach, called LSH kNN graph, follows the idea to first subdivide the entire dataset in many subsets, based on the concept of similarity between the images contained in the dataset. This process is done through the use of LSH projections and allows creation of a set of buckets $B = \{B_1, \dots , B_N\}$ from several hash tables $L = \{L_1, \dots , L_M\}$. The number of buckets N depends on the bits used for the projection ($\delta $) and to the number of the hash tables (M): $N = 2^\delta \cdot M$. Each of these buckets will contain the projected elements $B_1 = \{b_{11}, \dots , b_{1n}\}$. The result of this process represents an approximate result because it is not generally possible to project every element the dataset into the same bucket an as all its neighbours. It is therefore necessary to find a trade-off between the number of the buckets for each hash table ($2^\delta $), modifying the bits used ($\delta $) for the hashing, and the number of hash tables (M) used for the projection. For this first task, using a small number of buckets allows to project more data in the same bucket. It reduces the computational time for this phase, but it increases the entire computation of the overall approach. On the other hand, a high number of buckets in each hash table increases the computational time of this step, but it reduces overall computation.

For each bucket containing a subset of the dataset, a brute-force graph with edges $G_i = \{ (b_{ix},b_{iy}, \theta (b_{ix},b_{iy})) : (b_{ix},b_{iy}) \in B_i \}$ is constructed. Applying a brute-force construction on many subsets is faster than apply one time the brute-force on the entire dataset. In the end all the subgraphs need to be merged in the final graph $G = G_1 \cup \dots \cup G_N$.

Unlike the usual LSH [25] based method, the proposed approach does not follow exactly the divide and conquer strategy. LSH projections are applied for dividing the dataset into subsets, but for reducing the computational time it is preferable to start creating the final graph, instead to create many approximate kNN graph and then merge them using the one-step neighbour propagation algorithm. The number of elements to sort in the kNN list and the number of similarity scores to calculate improves the quality of the final graph and the retrieval accuracy, but also reduces the computational time.

3.3 Multi-probe LSH

We also propose a multi-probe version, called multi LSH kNN graph, to reduce the number of hash tables used. Unlike the classic multi-probe LSH algorithm [13], in which the system checks neighboring buckets during the search phase, here all the elements are projected in the neighbors buckets during the projection phase, but only in the 1-neighbourhood. This represents the set of buckets differing by one bit to the analysed bucket (i.e. with Hamming distance $H_d \le 1$). More formally, the elements obtained with the application of the multi-probe LSH are the followings:

$$\begin{aligned} B_{\text {multi-probe}} = \{b_{x1}, \dots , b_{xp} : H_d(b_{\text {query}},b_{xj}) \le 1 ,\, b_{xj} \in B,\, 0 \le x \le P \} \end{aligned}$$

Note that the number of neighbours of each bucket scales with the bits used for the projection as: $\sum _{i=0}^{l}{{\log _{2}\delta } \atopwithdelims (){i}}$. Even though it increases the final retrieval performance, this approach requires more time for the kNN graph creation than the previous one. To get a good trade-off between the computational time required for the similarity measure calculations and the quality of the final graph, only a percentage $\gamma $ of the elements projected on the 1-neighbour buckets are retained. The best trade-off is reached using $\gamma = 50\%$, which means that each element is also projected randomly in the half of its 1-neighbour buckets.

During the conquer phase, as in the previous proposed method, all the pairs of the indexes of the images found in the buckets will be connected through the calculation of the similarity measure.

4 Experimental Results

Previous works have evaluated the methods for creating approximate kNN graphs by checking the number of common edges between the approximate and the exact kNN graph. In our case the kNN graph pipelines are evaluated after the diffusion and retrieval modules in order to evaluate how effective (and efficient) are our proposals for the task in terms of retrieval accuracy when diffusion is applied. The diffusion approach and the R-MAC descriptors adopted are the same of the work of Iscen et al. [11].

4.1 Datasets

There are many different image datasets for Content-Based Image Retrieval that are used to evaluate algorithms. The most used are:

Oxford5k [17] containing 5063 images, subdivided in 11 classes. All the images are used as database images and there are 55 query images, which are cropped to make the querying phase more difficult;
Paris6k [18] containing 6412 images, subdivided in 12 classes. All the images are used as database images and there are 55 query images, again cropped;
Flickr1M [8] containing 1 million Flickr images used for large scale evaluation. The images are divided in multiple classes and are not specifically selected for image retrieval.

The Oxford105K dataset is a combination of Oxford5k and 100K distractors from Flickr1M.

4.2 Evaluation Metrics

Mean Average Precision (mAP) is used on all datasets to evaluate the retrieval accuracy. We use $L_2$ distances to compare query images with the database ones.

4.3 Sparse Matrices for kNN Graph

It is worth emphasizing that there are a lot of null values in the affinity matrix. In fact, on Oxford5k the approximate kNN graph constructed with LSH kNN graph method has only the 0.7% of the edges of the brute-force graph. Furthermore, not all the similarity measure are useful for the diffusion process, suggesting to remove or avoid to insert edges with weight less than a threshold (th), without jeopardizing the final retrieval performance. Hence, each element of the matrix can be represented as following:

$$\begin{aligned} g_{ij} = \bigg \{ \begin{array}{ll} \theta (s_i, s_j) &{} \text { if } \theta (s_i, s_j) \ge th \\ 0 &{} \text { otherwise} \\ \end{array} \end{aligned}$$

From our experiments, this threshold can be set to 0.3. Given the high number of null values in the affinity matrix, sparse matrices can be used to reduce the computational time and still obtain good results also on large datasets. Moreover, considering that the matrix is symmetric, only the upper or lower values of the matrix are needed:

$$\begin{aligned} g_{ij} = \bigg \{ \begin{array}{ll} \theta (s_i, s_j) &{} \text { if } j \ge i \wedge \theta (s_i, s_j) \ge th \\ 0 &{} \text { otherwise } \\ \end{array} \end{aligned}$$

If the similarity value is missing, the row and the column are switched.

4.4 Implementation Details

Two different types of sparse matrix has been tested: Compressed Row Storage (CRS) format and Coordinate (COO) format [5]. The CRS sparse matrix is composed by three vectors: the values of the dense matrix different from zero; the column indexes of the elements contained in the values vector; and the locations of the values vector that indicate the beginning of a new row. Instead, the COO sparse matrix is composed by three vectors: a vector representing the non-zero elements, the row and the column coordinate of each value contained in the values vector. The second solution is simpler than the first to implement, but it requires more space on disk.

However, using hash tables, it happens that the same edge weight is inserted multiple times. Therefore, every time a new value is inserted in a CRS matrix, checking whether the value is already in the matrix might be a possible solution. Unfortunately, this tends to be a time consuming process. Conversely, using a COO matrix, all the values (including repeated ones) are inserted, but a sorting is performed and duplicates are removed.

4.5 Results on Oxford5k

Table 1 reports the retrieval results after diffusion application of different kNN graph techniques. Note that changing the values of LSH ($\delta $ and L) produces different results. The best configuration is $\delta = 6$ and $L = 2$ applying the multi LSH kNN graph approach. the best trade-off between the computational time for kNN graph creation and the final retrieval performance is the LSH kNN graph with $\delta = 6$ and $L = 20$. NN-descent produces good results, but it needs a lot of time for the graph creation (55 s). Furthermore, it does not obtain results comparable to the other methods.

Table 1. Comparison of different approaches of kNN graph creation tested on Oxford5k. * indicates that the method is a C++ re-implementation.

Full size table

RP-div [19] is very fast but collecting random elements from the dataset for the divide task does not give good results in retrieval after diffusion.

The method implemented by Wang et al. [21] obtains a different result each execution, so the reported performance is the average of ten experiments. The approach is very fast, but did not achieve the best mAP. Note also that the brute-force method is executed on GPU, instead all the other methods are executed on CPU.

We also perform some experiments with regional descriptors (Table 2). The use of regional descriptors demonstrates an improvement on the final performance due to high number of descriptors for each image (usually 21). In this case the total number of descriptors used for the creation of the kNN graph are approximately 100K. Note we omit testing RP-div and NN-descent here due to poor previous accuracy/computation performance.

Table 2. Comparison of different approaches of kNN graph creation tested on Oxford5k using regional R-MAC descriptors. * indicates that the method is a C++ re-implementation.

Full size table

4.6 Results on Paris6k

Table 3 shows the results on the Paris6k dataset, which are similar to those obtained to Oxford5k.

Table 3. Comparison of different approaches of kNN graph creation tested on Paris6k. * indicates that the method is a C++ re-implementation.

Full size table

However, in this case, LSH kNN graph is the fastest approach and also it obtains the best retrieval performance after the application of diffusion. Multi-LSH kNN method obtains a good result, but in more time than the brute-force approach.

4.7 Results on Oxford105k

Table 4 reports results for the experiments executed on Oxford105k again RP-div and NN-descent are not tested due to poor trade-off previously obtained. Increasing the dimension of the dataset illustrates the difference in accuracy and computational time between the proposed approach and brute-force. The proposed approaches obtain better results and trade-offs than other methods. In particular, LSH kNN graph ($\delta = 6$ and $L = 20$) achieves 92.50% in only 77 s for the graph creation process. The multi LSH kNN graph needs more time than the previous approach, but it reaches the best mAP on this dataset equals of 92.85%.

Table 4. Comparison of different approaches of kNN graph creation tested on Oxford105k. * indicates that the method is a C++ re-implementation.

Full size table

5 Conclusions

We presented an algorithm called LSH kNN graph for the creation of an approximate kNN graph exploiting LSH projections. First, the elements of the dataset are subdivided in several subsets using an unsupervised hashing function and, then, for each one of the subsets a subgraph is created applying the brute-force approach. The application of this algorithm with sparse matrices achieves very good results even on datasets that with a large number of images. The proposed methods can generate a kNN graph faster than the brute-force approach and other state-of-the-art approaches, obtaining the same or better accuracy results after diffusion. Furthermore, another version of the algorithm called multi LSH kNN graph was proposed, which uses multi-probe LSH instead of LSH for the subdivision of the elements in the subsets, increasing the quality of the final graph due to the greater number of elements found in the buckets of the hash tables. In future work, we are pursuing the distribution these approaches across several machines to allow processing even larger datasets.

References

Chen, J., Fang, H.R., Saad, Y.: Fast approximate kNN graph construction for high dimensional data via recursive Lanczos bisection. J. Mach. Learn. Res. 10(Sep), 1989–2012 (2009)
MathSciNet MATH Google Scholar
Debatty, T., Michiardi, P., Thonnard, O., Mees, W.: Building K-NN graphs from large text data. In: IEEE International Conference on Big Data, pp. 573–578. IEEE (2014)
Google Scholar
Dong, W., Moses, C., Li, K.: Efficient K-nearest neighbor graph construction for generic similarity measures. In: Proceedings of the 20th International Conference on World Wide Web, pp. 577–586. ACM (2011)
Google Scholar
Douze, M., Szlam, A., Hariharan, B., Jégou, H.: Low-shot learning with large-scale diffusion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3349–3358 (2018)
Google Scholar
Golub, G.H., Van Loan, C.F.: Matrix Computations, vol. 3. JHU press, Baltimore (2012)
MATH Google Scholar
Gordo, A., Almazan, J., Revaud, J., Larlus, D.: End-to-end learning of deep visual representations for image retrieval. Int. J. Comput. Vis. 124(2), 237–254 (2017)
Article MathSciNet Google Scholar
Houle, M.E., Ma, X., Oria, V., Sun, J.: Improving the quality of K-NN graphs for image databases through vector sparsification. In: Proceedings of International Conference on Multimedia Retrieval, p. 89. ACM (2014)
Google Scholar
Huiskes, M.J., Lew, M.S.: The MIR flickr retrieval evaluation. In: Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval, pp. 39–43. ACM (2008)
Google Scholar
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, pp. 604–613. ACM (1998)
Google Scholar
Iscen, A., Tolias, G., Avrithis, Y., Chum, O.: Mining on manifolds: metric learning without labels. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7642–7651 (2018)
Google Scholar
Iscen, A., Tolias, G., Avrithis, Y.S., Furon, T., Chum, O.: Efficient diffusion on region manifolds: recovering small objects with compact CNN representations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, p. 3 (2017)
Google Scholar
Li, D., Hung, W.-C., Huang, J.-B., Wang, S., Ahuja, N., Yang, M.-H.: Unsupervised visual representation learning by graph-based consistent constraints. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 678–694. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_41
Chapter Google Scholar
Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 950–961. VLDB Endowment (2007)
Google Scholar
Magliani, F., Fontanini, T., Prati, A.: Landmark recognition: from small-scale to large-scale retrieval. In: Hassaballah, M., Hosny, K.M. (eds.) Recent Advances in Computer Vision. SCI, vol. 804, pp. 237–259. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-03000-1_10
Chapter Google Scholar
Magliani, F., Prati, A.: An accurate retrieval through R-MAC+ descriptors for landmark recognition. In: Proceedings of the 12th International Conference on Distributed Smart Cameras, p. 6. ACM (2018)
Google Scholar
Park, Y., Park, S., Lee, S.G., Jung, W.: Scalable K-nearest neighbor graph construction based on greedy filtering. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 227–228. ACM (2013)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008)
Google Scholar
Sieranoja, S., Fränti, P.: Fast random pair divisive construction of KNN graph using generic distance measures. In: Proceedings of the 2018 International Conference on Big Data and Computing, pp. 95–98. ACM (2018)
Google Scholar
Tolias, G., Sicre, R., Jégou, H.: Particular object retrieval with integral max-pooling of CNN activations. arXiv preprint arXiv:1511.05879 (2015)
Wang, J., Wang, J., Zeng, G., Tu, Z., Gan, R., Li, S.: Scalable K-NN graph construction for visual descriptors. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1106–1113. IEEE (2012)
Google Scholar
Wang, J., Shen, H.T., Song, J., Ji, J.: Hashing for similarity search: a survey. arXiv preprint arXiv:1408.2927 (2014)
Wang, J., Zhang, T., Sebe, N., Shen, H.T., et al.: A survey on learning to hash. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 769–790 (2017)
Article Google Scholar
Xu, J., Wang, C., Qi, C., Shi, C., Xiao, B.: Iterative manifold embedding layer learned by incomplete data for large-scale image retrieval. IEEE Trans. Multimed. 21(6), 1551–1562 (2019)
Article Google Scholar
Zhang, Y.-M., Huang, K., Geng, G., Liu, C.-L.: Fast kNN graph construction with locality sensitive hashing. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013. LNCS (LNAI), vol. 8189, pp. 660–674. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40991-2_42
Chapter Google Scholar
Zhou, D., Weston, J., Gretton, A., Bousquet, O., Schölkopf, B.: Ranking on data manifolds. In: Advances in Neural Information Processing Systems, pp. 169–176 (2004)
Google Scholar

Download references

Acknowledgment

This is work is partially funded by Regione Emilia Romagna under the “Piano triennale alte competenze per la ricerca, il trasferimento tecnologico e l’imprenditorialità”.

This publication has emanated from research conducted with the financial support of Science Foundation Ireland (SFI) under grant number SFI/15/SIRG/3283 and SFI/12/RC/2289.

Author information

Authors and Affiliations

IMP Lab, University of Parma, Parma, Italy
Federico Magliani & Andrea Prati
Insight Centre for Data Analytics, DCU, Dublin, Ireland
Kevin McGuinness & Eva Mohedano

Authors

Federico Magliani
View author publications
You can also search for this author in PubMed Google Scholar
Kevin McGuinness
View author publications
You can also search for this author in PubMed Google Scholar
Eva Mohedano
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Prati
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Federico Magliani .

Editor information

Editors and Affiliations

University of Trento, Povo, Italy
Elisa Ricci
Mapillary Research, Graz, Austria
Samuel Rota Bulò
University of Amsterdam, Amsterdam, The Netherlands
Cees Snoek
Fondazione Bruno Kessler, Povo, Italy
Oswald Lanz
Fondazione Bruno Kessler, Povo, Italy
Stefano Messelodi
University of Trento, Povo, Italy
Nicu Sebe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Magliani, F., McGuinness, K., Mohedano, E., Prati, A. (2019). An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval. In: Ricci, E., Rota Bulò, S., Snoek, C., Lanz, O., Messelodi, S., Sebe, N. (eds) Image Analysis and Processing – ICIAP 2019. ICIAP 2019. Lecture Notes in Computer Science(), vol 11752. Springer, Cham. https://doi.org/10.1007/978-3-030-30645-8_49

Download citation

DOI: https://doi.org/10.1007/978-3-030-30645-8_49
Published: 02 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30644-1
Online ISBN: 978-3-030-30645-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval

Abstract

Similar content being viewed by others

LSH kNN graph for diffusion on image retrieval

Image Hub Explorer: Evaluating Representations and Metrics for Content-Based Image Retrieval and Object Recognition

Image Retrieval for Online Browsing in Large Image Collections

Keywords

1 Introduction

2 Related Work

3 Proposed Approach: LSH kNN Graph

3.1 Notations and Background of LSH

3.2 Basic Algorithm for kNN Graph Construction

3.3 Multi-probe LSH

4 Experimental Results

4.1 Datasets

4.2 Evaluation Metrics

4.3 Sparse Matrices for kNN Graph

4.4 Implementation Details

4.5 Results on Oxford5k

4.6 Results on Paris6k

4.7 Results on Oxford105k

5 Conclusions

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval

Abstract

Similar content being viewed by others

LSH kNN graph for diffusion on image retrieval

Image Hub Explorer: Evaluating Representations and Metrics for Content-Based Image Retrieval and Object Recognition

Image Retrieval for Online Browsing in Large Image Collections

Keywords

1 Introduction

2 Related Work

3 Proposed Approach: LSH kNN Graph

3.1 Notations and Background of LSH

3.2 Basic Algorithm for kNN Graph Construction

3.3 Multi-probe LSH

4 Experimental Results

4.1 Datasets

4.2 Evaluation Metrics

4.3 Sparse Matrices for kNN Graph

4.4 Implementation Details

4.5 Results on Oxford5k

4.6 Results on Paris6k

4.7 Results on Oxford105k

5 Conclusions

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation