A Visual Model Approach for Parsing Colonoscopy Videos

Yu Cao; Wallapak Tavanapong; Dalei Li; Junghwan Oh; Piet C. De Groen; Johnny Wong

doi:10.1007/978-3-540-27814-6_22

A Visual Model Approach for Parsing Colonoscopy Videos

Yu Cao, Wallapak Tavanapong, Dalei Li, Junghwan Oh, Piet C. De Groen, Johnny Wong

Research output: Chapter in Book/Report/Conference proceeding › Chapter

9 Scopus citations

Abstract

Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

Original language	English (US)
Title of host publication	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Editors	Peter Enser, Yiannis Kompatsiaris, Noel E. O’Connor, Alan F. Smeaton, Arnold W. M. Smeulders
Publisher	Springer Verlag
Pages	160-169
Number of pages	10
ISBN (Print)	3540225390, 9783540225393
DOIs	https://doi.org/10.1007/978-3-540-27814-6_22
State	Published - 2004

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	3115
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-540-27814-6_22

Cite this

Cao, Y., Tavanapong, W., Li, D., Oh, J., De Groen, P. C., & Wong, J. (2004). A Visual Model Approach for Parsing Colonoscopy Videos. In P. Enser, Y. Kompatsiaris, N. E. O’Connor, A. F. Smeaton, & A. W. M. Smeulders (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 160-169). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3115). Springer Verlag. https://doi.org/10.1007/978-3-540-27814-6_22

A Visual Model Approach for Parsing Colonoscopy Videos. / Cao, Yu; Tavanapong, Wallapak; Li, Dalei et al.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). ed. / Peter Enser; Yiannis Kompatsiaris; Noel E. O’Connor; Alan F. Smeaton; Arnold W. M. Smeulders. Springer Verlag, 2004. p. 160-169 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 3115).

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Cao, Y, Tavanapong, W, Li, D, Oh, J, De Groen, PC & Wong, J 2004, A Visual Model Approach for Parsing Colonoscopy Videos. in P Enser, Y Kompatsiaris, NE O’Connor, AF Smeaton & AWM Smeulders (eds), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 3115, Springer Verlag, pp. 160-169. https://doi.org/10.1007/978-3-540-27814-6_22

Cao Y, Tavanapong W, Li D, Oh J, De Groen PC, Wong J. A Visual Model Approach for Parsing Colonoscopy Videos. In Enser P, Kompatsiaris Y, O’Connor NE, Smeaton AF, Smeulders AWM, editors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer Verlag. 2004. p. 160-169. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-540-27814-6_22

Cao, Yu ; Tavanapong, Wallapak ; Li, Dalei et al. / A Visual Model Approach for Parsing Colonoscopy Videos. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). editor / Peter Enser ; Yiannis Kompatsiaris ; Noel E. O’Connor ; Alan F. Smeaton ; Arnold W. M. Smeulders. Springer Verlag, 2004. pp. 160-169 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inbook{f67f9333cefa404a8ae9fa3e2a062ef4,

title = "A Visual Model Approach for Parsing Colonoscopy Videos",

abstract = "Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.",

author = "Yu Cao and Wallapak Tavanapong and Dalei Li and Junghwan Oh and {De Groen}, {Piet C.} and Johnny Wong",

year = "2004",

doi = "10.1007/978-3-540-27814-6_22",

language = "English (US)",

isbn = "3540225390",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "160--169",

editor = "Peter Enser and Yiannis Kompatsiaris and O{\textquoteright}Connor, {Noel E.} and Smeaton, {Alan F.} and Smeulders, {Arnold W. M.}",

booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - CHAP

T1 - A Visual Model Approach for Parsing Colonoscopy Videos

AU - Cao, Yu

AU - Tavanapong, Wallapak

AU - Li, Dalei

AU - Oh, Junghwan

AU - De Groen, Piet C.

AU - Wong, Johnny

PY - 2004

Y1 - 2004

N2 - Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

AB - Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=35048844831&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35048844831&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-27814-6_22

DO - 10.1007/978-3-540-27814-6_22

M3 - Chapter

AN - SCOPUS:35048844831

SN - 3540225390

SN - 9783540225393

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 160

EP - 169

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

A2 - Enser, Peter

A2 - Kompatsiaris, Yiannis

A2 - O’Connor, Noel E.

A2 - Smeaton, Alan F.

A2 - Smeulders, Arnold W. M.

PB - Springer Verlag

ER -

A Visual Model Approach for Parsing Colonoscopy Videos

Abstract

Publication series

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this