A Visual Model Approach for Parsing Colonoscopy Videos

Yu Cao; Wallapak Tavanapong; Dalei Li; Junghwan Oh; Piet C. De Groen; Johnny Wong

A Visual Model Approach for Parsing Colonoscopy Videos

Yu Cao, Wallapak Tavanapong, Dalei Li, Junghwan Oh, Piet C. De Groen, Johnny Wong

Medicine - Gastro, Hepatology, Nutrition Division

Research output: Contribution to journal › Article › peer-review

11 Scopus citations

Abstract

Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

Original language	English (US)
Pages (from-to)	160-169
Number of pages	10
Journal	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	3115
State	Published - Dec 1 2004

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

OpenUrl availability

Full text

Cite this

@article{4c408c163d374721b7f23d7e880dbfe1,

title = "A Visual Model Approach for Parsing Colonoscopy Videos",

abstract = "Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.",

author = "Yu Cao and Wallapak Tavanapong and Dalei Li and Junghwan Oh and {De Groen}, {Piet C.} and Johnny Wong",

year = "2004",

month = dec,

day = "1",

language = "English (US)",

volume = "3115",

pages = "160--169",

journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

issn = "0302-9743",

publisher = "Springer Verlag",

}

TY - JOUR

T1 - A Visual Model Approach for Parsing Colonoscopy Videos

AU - Cao, Yu

AU - Tavanapong, Wallapak

AU - Li, Dalei

AU - Oh, Junghwan

AU - De Groen, Piet C.

AU - Wong, Johnny

PY - 2004/12/1

Y1 - 2004/12/1

N2 - Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

AB - Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

UR - http://www.scopus.com/inward/record.url?scp=35048844831&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35048844831&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:35048844831

SN - 0302-9743

VL - 3115

SP - 160

EP - 169

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -

A Visual Model Approach for Parsing Colonoscopy Videos

Abstract

UN SDGs

OpenUrl availability

Other files and links

Fingerprint

Cite this