skip to main content
research-article

Automatic weight selection for multi-metric distances

Published: 30 June 2011 Publication History

Abstract

Content-Based Multimedia Information Retrieval retrieves multimedia documents based on their content (colors, edges, textures, etc.). The content of a whole multimedia document is represented by a global descriptor. The similarity of two multimedia documents can be defined as the distance between their descriptors. A multi-metric function that combines distances from many descriptors usually outperforms the effectiveness of any single descriptor. In this case, a different weight is assigned to each descriptor representing its relative importance in the combination. Usually, these sets of weights are fixed manually or by performing many effectiveness evaluations. In this work, we present three novel techniques for weighting multi-metrics: á-normalization, which is a generalization of the normalization by maximum distance that uses the histogram of distances, MID-weighting which selects weights that maximize intrinsic dimensionality, and MID-á-weighting that combines the two previous techniques. These techniques enable the selection of a set of weights with satisfactory effectiveness without performing any effectiveness evaluation. Thus, they are suitable when a ground truth does not exist or when it is expensive to perform an evaluation. We tested their effectiveness on a content-based copy detection corpus, and we analyzed the behavior of effectiveness and efficiency in a multi-metric space. We conclude that MID-á-weighting outperforms the widely used maximum distance normalization, and that it can be used as an automatic weight selection for further manual adjustment.

References

[1]
M. Batko, P. Kohoutkova, and D. Novak. Cophir image collection under the microscope. In Proc. of the intl. workshop on Similarity Search and Applications (SISAP), pages 47--54. IEEE, 2009.
[2]
H. Bay, A. Ess, T. Tuytelaars, and L. V. Gool. Speeded-up robust features (SURF). Computer Vision and Image Understanding, 110(3):346--359, 2008.
[3]
S. Brin. Near neighbor search in large metric spaces. In Proc. of the int. conf. on Very Large Databases (VLDB'95)., pages 574--584. Morgan Kauffman, 1995.
[4]
B. Bustos and T. Skopal. Dynamic similarity search in multi-metric spaces. In Proc. of the int. workshop on Multimedia Information Retrieval (MIR), pages 137--146. ACM, 2006.
[5]
E. Chávez, G. Navarro, R. Baeza-Yates, and J. L. Marroquín. Searching in metric spaces. ACM Computing Surveys, 33(3):273--321, 2001.
[6]
P. Ciaccia, M. Patella, and P. Zezula. M-tree: An efficient access method for similarity search in metric spaces. In Proc. of the int. conf. on Very Large Databases (VLDB'97)., pages 426--435. Morgan Kauffman, 1997.
[7]
T. Deselaers, T. Weyand, and H. Ney. Image retrieval and annotation using maximum entropy. In CLEF Workshop 2006, volume 4730 of LNCS, pages 725--734. Springer, 2007.
[8]
M. Douze, A. Gaidon, H. Jegou, M. Marszalek, and C. Schmid. Inria Lear's video copy detection system. In TRECVID. NIST, 2008.
[9]
V. Gupta, G. Boulianne, and P. Cardinal. CRIM's content-based audio copy detection system for TRECVID 2009. In Proc. of the int. workshop on Content-Based Multimedia Indexing (CBMI). IEEE, 2010.
[10]
A. Hampapur and R. Bolle. Comparison of distance measures for video copy detection. In Proc. of the IEEE int. conf. on Multimedia and Expo (ICME), pages 737--740. IEEE, 2001.
[11]
A. Joly, O. Buisson, and C. Frélicot. Content-based copy retrieval using distortion-based probabilistic similarity search. IEEE Transactions on Multimedia, 9(2):293--306, 2007.
[12]
C. Kim and B. Vasudev. Spatiotemporal sequence matching for efficient video copy detection. IEEE Transactions on Circuits and Systems for Video Technology, 15(1):127--132, 2005.
[13]
J. Law-To, O. Buisson, V. Gouet-Brunet, and N. Boujemaa. Robust voting algorithm based on labels of behavior for video copy detection. In Proc. of the int. conf. on Multimedia (MM), pages 835--844. ACM, 2006.
[14]
J. Law-To, A. Joly, and N. Boujemaa. MUSCLE-VCD-2007: a live benchmark for video copy detection, 2007. http://www-rocq.inria.fr/imedia/civr-bench/.
[15]
M. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based multimedia information retrieval: State of the art and challenges. ACM Transactions on Multimedia Computing, Communications and Applications, 2(1):1--19, 2006.
[16]
D. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004.
[17]
B. S. Manjunath, J.-R. Ohm, V. V. Vasudevan, and A. Yamada. Color and texture descriptors. IEEE Transactions on Circuits and Systems for Video Technology, 11(6):703--715, 2001.
[18]
J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proc. of the IEEE int. conf. on Computer Vision (ICCV), volume 2, pages 1470--1477. IEEE, 2003.
[19]
T. Skopal. Unified framework for fast exact and approximate search in dissimilarity spaces. ACM Transactions on Database Systems, 32(4):29--47, 2007.
[20]
A. W. M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1349--1380, 2000.
[21]
P. Zezula, G. Amato, V. Dohnal, and M. Batko. Similarity Search: The Metric Space Approach (Advances in Database Systems). Springer, 2005.

Cited By

View all
  • (2017)Comparative analysis of shape descriptors for 3D objectsMultimedia Tools and Applications10.1007/s11042-016-3330-576:5(6993-7040)Online publication date: 1-Mar-2017
  • (2017)Fusion Strategies for Large-Scale Multi-modal Image RetrievalTransactions on Large-Scale Data- and Knowledge-Centered Systems XXXIII10.1007/978-3-662-55696-2_5(146-184)Online publication date: 8-Aug-2017
  • (2015)Speeding up the combination of multiple descriptors for different boundary conditions2015 Latin American Computing Conference (CLEI)10.1109/CLEI.2015.7359983(1-11)Online publication date: Oct-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SISAP '11: Proceedings of the Fourth International Conference on SImilarity Search and APplications
June 2011
120 pages
ISBN:9781450307956
DOI:10.1145/1995412
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • J. T. Schwartz: J. T. Schwartz International School for Scientific Research - University of Catania
  • University of Catania

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content-based copy detection
  2. metric spaces
  3. multi-metric spaces
  4. multimedia information retrieval

Qualifiers

  • Research-article

Conference

SISAP '11
Sponsor:
  • J. T. Schwartz

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 23 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Comparative analysis of shape descriptors for 3D objectsMultimedia Tools and Applications10.1007/s11042-016-3330-576:5(6993-7040)Online publication date: 1-Mar-2017
  • (2017)Fusion Strategies for Large-Scale Multi-modal Image RetrievalTransactions on Large-Scale Data- and Knowledge-Centered Systems XXXIII10.1007/978-3-662-55696-2_5(146-184)Online publication date: 8-Aug-2017
  • (2015)Speeding up the combination of multiple descriptors for different boundary conditions2015 Latin American Computing Conference (CLEI)10.1109/CLEI.2015.7359983(1-11)Online publication date: Oct-2015
  • (2014)Multi-measure Similarity Searching for Time SeriesJournal of Computers10.4304/jcp.9.10.2266-22739:10Online publication date: 30-Oct-2014
  • (2012)Visual image searchProceedings of the 5th international conference on Similarity Search and Applications10.1007/978-3-642-32153-5_13(177-191)Online publication date: 9-Aug-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media