Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos

Barbara André; Tom Vercauteren; Anna M. Buchner; Michael B. Wallace; Nicholas Ayache

doi:10.1007/978-3-642-23626-6_37

Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos

Barbara André, Tom Vercauteren, Anna M. Buchner, Michael B. Wallace, Nicholas Ayache

Gastroenterology and Hepatology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

10 Scopus citations

Abstract

Evaluating content-based retrieval (CBR) is challenging because it requires an adequate ground-truth. When the available ground-truth is limited to textual metadata such as pathological classes, retrieval results can only be evaluated indirectly, for example in terms of classification performance. In this study we first present a tool to generate perceived similarity ground-truth that enables direct evaluation of endomicroscopic video retrieval. This tool uses a four-points Likert scale and collects subjective pairwise similarities perceived by multiple expert observers. We then evaluate against the generated ground-truth a previously developed dense bag-of-visual-words method for endomicroscopic video retrieval. Confirming the results of previous indirect evaluation based on classification, our direct evaluation shows that this method significantly outperforms several other state-of-the-art CBR methods. In a second step, we propose to improve the CBR method by learning an adjusted similarity metric from the perceived similarity ground-truth. By minimizing a margin-based cost function that differentiates similar and dissimilar video pairs, we learn a weight vector applied to the visual word signatures of videos. Using cross-validation, we demonstrate that the learned similarity distance is significantly better correlated with the perceived similarity than the original visual-word-based distance.

Original language	English (US)
Title of host publication	Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings
Pages	297-304
Number of pages	8
Edition	PART 3
DOIs	https://doi.org/10.1007/978-3-642-23626-6_37
State	Published - 2011
Event	14th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2011 - Toronto, ON, Canada Duration: Sep 18 2011 → Sep 22 2011

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Number	PART 3
Volume	6893 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Other

Other	14th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2011
Country/Territory	Canada
City	Toronto, ON
Period	9/18/11 → 9/22/11

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-642-23626-6_37

Cite this

André, B., Vercauteren, T., Buchner, A. M., Wallace, M. B., & Ayache, N. (2011). Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. In Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings (PART 3 ed., pp. 297-304). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6893 LNCS, No. PART 3). https://doi.org/10.1007/978-3-642-23626-6_37

Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. / André, Barbara; Vercauteren, Tom; Buchner, Anna M. et al.
Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings. PART 3. ed. 2011. p. 297-304 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6893 LNCS, No. PART 3).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

André, B, Vercauteren, T, Buchner, AM, Wallace, MB & Ayache, N 2011, Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. in Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings. PART 3 edn, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 3, vol. 6893 LNCS, pp. 297-304, 14th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2011, Toronto, ON, Canada, 9/18/11. https://doi.org/10.1007/978-3-642-23626-6_37

André B, Vercauteren T, Buchner AM, Wallace MB, Ayache N. Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. In Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings. PART 3 ed. 2011. p. 297-304. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 3). doi: 10.1007/978-3-642-23626-6_37

André, Barbara ; Vercauteren, Tom ; Buchner, Anna M. et al. / Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos. Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings. PART 3. ed. 2011. pp. 297-304 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 3).

@inproceedings{4ea7bde281a24bd297d14dd23375e204,

title = "Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos",

abstract = "Evaluating content-based retrieval (CBR) is challenging because it requires an adequate ground-truth. When the available ground-truth is limited to textual metadata such as pathological classes, retrieval results can only be evaluated indirectly, for example in terms of classification performance. In this study we first present a tool to generate perceived similarity ground-truth that enables direct evaluation of endomicroscopic video retrieval. This tool uses a four-points Likert scale and collects subjective pairwise similarities perceived by multiple expert observers. We then evaluate against the generated ground-truth a previously developed dense bag-of-visual-words method for endomicroscopic video retrieval. Confirming the results of previous indirect evaluation based on classification, our direct evaluation shows that this method significantly outperforms several other state-of-the-art CBR methods. In a second step, we propose to improve the CBR method by learning an adjusted similarity metric from the perceived similarity ground-truth. By minimizing a margin-based cost function that differentiates similar and dissimilar video pairs, we learn a weight vector applied to the visual word signatures of videos. Using cross-validation, we demonstrate that the learned similarity distance is significantly better correlated with the perceived similarity than the original visual-word-based distance.",

author = "Barbara Andr{\'e} and Tom Vercauteren and Buchner, {Anna M.} and Wallace, {Michael B.} and Nicholas Ayache",

year = "2011",

doi = "10.1007/978-3-642-23626-6_37",

language = "English (US)",

isbn = "9783642236259",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

number = "PART 3",

pages = "297--304",

booktitle = "Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings",

edition = "PART 3",

note = "14th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2011 ; Conference date: 18-09-2011 Through 22-09-2011",

}

TY - GEN

T1 - Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos

AU - André, Barbara

AU - Vercauteren, Tom

AU - Buchner, Anna M.

AU - Wallace, Michael B.

AU - Ayache, Nicholas

PY - 2011

Y1 - 2011

N2 - Evaluating content-based retrieval (CBR) is challenging because it requires an adequate ground-truth. When the available ground-truth is limited to textual metadata such as pathological classes, retrieval results can only be evaluated indirectly, for example in terms of classification performance. In this study we first present a tool to generate perceived similarity ground-truth that enables direct evaluation of endomicroscopic video retrieval. This tool uses a four-points Likert scale and collects subjective pairwise similarities perceived by multiple expert observers. We then evaluate against the generated ground-truth a previously developed dense bag-of-visual-words method for endomicroscopic video retrieval. Confirming the results of previous indirect evaluation based on classification, our direct evaluation shows that this method significantly outperforms several other state-of-the-art CBR methods. In a second step, we propose to improve the CBR method by learning an adjusted similarity metric from the perceived similarity ground-truth. By minimizing a margin-based cost function that differentiates similar and dissimilar video pairs, we learn a weight vector applied to the visual word signatures of videos. Using cross-validation, we demonstrate that the learned similarity distance is significantly better correlated with the perceived similarity than the original visual-word-based distance.

AB - Evaluating content-based retrieval (CBR) is challenging because it requires an adequate ground-truth. When the available ground-truth is limited to textual metadata such as pathological classes, retrieval results can only be evaluated indirectly, for example in terms of classification performance. In this study we first present a tool to generate perceived similarity ground-truth that enables direct evaluation of endomicroscopic video retrieval. This tool uses a four-points Likert scale and collects subjective pairwise similarities perceived by multiple expert observers. We then evaluate against the generated ground-truth a previously developed dense bag-of-visual-words method for endomicroscopic video retrieval. Confirming the results of previous indirect evaluation based on classification, our direct evaluation shows that this method significantly outperforms several other state-of-the-art CBR methods. In a second step, we propose to improve the CBR method by learning an adjusted similarity metric from the perceived similarity ground-truth. By minimizing a margin-based cost function that differentiates similar and dissimilar video pairs, we learn a weight vector applied to the visual word signatures of videos. Using cross-validation, we demonstrate that the learned similarity distance is significantly better correlated with the perceived similarity than the original visual-word-based distance.

UR - http://www.scopus.com/inward/record.url?scp=82255164550&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=82255164550&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-23626-6_37

DO - 10.1007/978-3-642-23626-6_37

M3 - Conference contribution

C2 - 22003712

AN - SCOPUS:82255164550

SN - 9783642236259

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 297

EP - 304

BT - Medical Image Computing and Computer-Assisted Intervention, MICCAI 2011 - 14th International Conference, Proceedings

T2 - 14th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2011

Y2 - 18 September 2011 through 22 September 2011

ER -

Retrieval evaluation and distance learning from perceived similarity between endomicroscopy videos

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this