Parsing and browsing tools for colonoscopy videos

Yu Cao, Dalei Li, Wallapak Tavanapong, Junghwan Oh, Johnny Wong, Piet C. De Groen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.

Original languageEnglish (US)
Title of host publicationACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
Pages844-851
Number of pages8
StatePublished - 2004
EventACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia - New York, NY, United States
Duration: Oct 10 2004Oct 16 2004

Other

OtherACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
CountryUnited States
CityNew York, NY
Period10/10/0410/16/04

Fingerprint

Biopsy
Semantics
Content based retrieval
Endoscopy
Video cameras
Screening
Education
Experiments
Mucous Membrane

Keywords

  • Browsing
  • Image and audio analysis
  • Video segmentation

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Cao, Y., Li, D., Tavanapong, W., Oh, J., Wong, J., & De Groen, P. C. (2004). Parsing and browsing tools for colonoscopy videos. In ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia (pp. 844-851)

Parsing and browsing tools for colonoscopy videos. / Cao, Yu; Li, Dalei; Tavanapong, Wallapak; Oh, Junghwan; Wong, Johnny; De Groen, Piet C.

ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia. 2004. p. 844-851.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cao, Y, Li, D, Tavanapong, W, Oh, J, Wong, J & De Groen, PC 2004, Parsing and browsing tools for colonoscopy videos. in ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia. pp. 844-851, ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia, New York, NY, United States, 10/10/04.
Cao Y, Li D, Tavanapong W, Oh J, Wong J, De Groen PC. Parsing and browsing tools for colonoscopy videos. In ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia. 2004. p. 844-851
Cao, Yu ; Li, Dalei ; Tavanapong, Wallapak ; Oh, Junghwan ; Wong, Johnny ; De Groen, Piet C. / Parsing and browsing tools for colonoscopy videos. ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia. 2004. pp. 844-851
@inproceedings{1e1469f0adac4b4fa2cd0de0f88572d5,
title = "Parsing and browsing tools for colonoscopy videos",
abstract = "Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.",
keywords = "Browsing, Image and audio analysis, Video segmentation",
author = "Yu Cao and Dalei Li and Wallapak Tavanapong and Junghwan Oh and Johnny Wong and {De Groen}, {Piet C.}",
year = "2004",
language = "English (US)",
isbn = "1581138938",
pages = "844--851",
booktitle = "ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia",

}

TY - GEN

T1 - Parsing and browsing tools for colonoscopy videos

AU - Cao, Yu

AU - Li, Dalei

AU - Tavanapong, Wallapak

AU - Oh, Junghwan

AU - Wong, Johnny

AU - De Groen, Piet C.

PY - 2004

Y1 - 2004

N2 - Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.

AB - Colonoscopy is an important screening tool for colorectal cancer. During a colonoscopic procedure, a tiny video camera at the tip of the endoscope generates a video signal of the internal mucosa of the colon. The video data are displayed on a monitor for real-time analysis by the endoscopist. We call videos captured from colonoscopic procedures colonoscopy videos. Because these videos possess unique characteristics, new types of semantic units and parsing techniques are required. In this paper, we define new semantic units called operation shots, each is a segment of visual and audio data that correspond to a therapeutic or biopsy operation. We introduce a new spatio-temporal analysis technique to detect operation shots. Our experiments on colonoscopy videos demonstrate that the technique does not miss any meaningful operation shots and incurs a small number of false operation shots. Our prototype parsing software implements the operation shot detection technique along with our other techniques previously developed for colonoscopy videos. Our browsing tool enables users to quickly locate operation shots of interest. The proposed technique and software are useful (1) for post-procedure reviews and analyses for causes of complications due to biopsy or therapeutic operations, (2) for developing an effective content-based retrieval system for colonoscopy videos to facilitate endoscopic research and education, and (3) for development of a systematic approach to assess endoscopists' procedural skills.

KW - Browsing

KW - Image and audio analysis

KW - Video segmentation

UR - http://www.scopus.com/inward/record.url?scp=13444310520&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=13444310520&partnerID=8YFLogxK

M3 - Conference contribution

SN - 1581138938

SN - 9781581138931

SP - 844

EP - 851

BT - ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia

ER -