Parsing and browsing tools for colonoscopy videos

Yu Cao, Dalei Li, Wallapak Tavanapong, Junghwan Oh, Johnny Wong, Piet C. De Groen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Scopus citations

Abstract

Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.

Original languageEnglish (US)
Title of host publicationACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery (ACM)
Pages844-851
Number of pages8
ISBN (Print)1581138938, 9781581138931
DOIs
StatePublished - Jan 1 2004
EventACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia - New York, NY, United States
Duration: Oct 10 2004Oct 16 2004

Publication series

NameACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia

Other

OtherACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
CountryUnited States
CityNew York, NY
Period10/10/0410/16/04

Keywords

  • Browsing
  • Image and audio analysis
  • Video segmentation

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'Parsing and browsing tools for colonoscopy videos'. Together they form a unique fingerprint.

  • Cite this

    Cao, Y., Li, D., Tavanapong, W., Oh, J., Wong, J., & De Groen, P. C. (2004). Parsing and browsing tools for colonoscopy videos. In ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia (pp. 844-851). (ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia). Association for Computing Machinery (ACM). https://doi.org/10.1145/1027527.1027723