A Visual Model Approach for Parsing Colonoscopy Videos

Yu Cao, Wallapak Tavanapong, Dalei Li, Junghwan Oh, Piet C. De Groen, Johnny Wong

Research output: Chapter in Book/Report/Conference proceedingChapter

9 Scopus citations

Abstract

Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsPeter Enser, Yiannis Kompatsiaris, Noel E. O’Connor, Alan F. Smeaton, Arnold W. M. Smeulders
PublisherSpringer Verlag
Pages160-169
Number of pages10
ISBN (Print)3540225390, 9783540225393
DOIs
StatePublished - 2004

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3115
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'A Visual Model Approach for Parsing Colonoscopy Videos'. Together they form a unique fingerprint.

  • Cite this

    Cao, Y., Tavanapong, W., Li, D., Oh, J., De Groen, P. C., & Wong, J. (2004). A Visual Model Approach for Parsing Colonoscopy Videos. In P. Enser, Y. Kompatsiaris, N. E. O’Connor, A. F. Smeaton, & A. W. M. Smeulders (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 160-169). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3115). Springer Verlag. https://doi.org/10.1007/978-3-540-27814-6_22