SUN Colonoscopy Video Database
Update
- 2022 10/08 : Added supplementary data: Accuracy of the latest approved model (July 2022).
- 2022 07/11 : Revised notations of polyp location in Table 2.
- 2020 10/07 : Released SUN Colonoscopy Video Database.
Abstract
SUN (Showa University and Nagoya University) Colonoscopy Video Database is the colonoscopy-video database for the evaluation of an automated colorectal-polyp detection.
The database comprises of still images of videos, which are collected at the Showa University Northern Yokohama Hospital. Mori Laboratory, Graduate School of Informatics, Nagoya University developed this database.
Every frame in the database was annotated by the expert endoscopists at Showa University.
Summary of database
The SUN database includes
49,136 polyp frames taken from
different 100 polyps, which were fully annotated with bounding boxes.
Non-polyp scenes of 109,554 frames are also included in this database. The characteristics of the database are summarized in Tables 1-3.
In polyp-exsiting frames, each polyp is annotated with a bounding box as shown in Figure 1.
The file formats of images, and bounding boxes are jpeg and a text file, respectively.
In the text file, each row represents a bounding box of a polyp, that is,
"Fielename min_Xcoordinate,min_Ycoordinate,max_Xcorrdinate,max_Ycoordinate,class_id",
where class_id of 0 and 1 represent polyp and non-polyp frames, respectively.
Here are an examples:
polyp1_00001.jpg 50,100,150,200,0
polyp1_00002.jpg 120,300,250,600,0
...
Figure 1: Examples of annotated images. The information of the bounding box is provided by a text file besides image files.
Table 1: Characteristics of the SUN database.
Patients registered as SUN database (n = 99) |
|
|
Sex (Male / Female) |
71 / 28 |
Median Age (IQR) |
69 (58 – 74) |
Lesions registered as SUN database (n = 100) |
|
|
Median Size (IQR) mm |
5 (3 – 7) |
Number of diminutive polyp (≤5mm) |
60 |
Morphology (protruded / flat) |
66 / 34 |
Location (Right / Left / Rectum) |
47 / 44 / 8 |
Pathological diagnosis |
|
|
Hyperplastic polyp |
7 |
Sessile serrated lesion |
4 |
Low grade adenoma |
82 |
Traditional serrated adenoma |
2 |
High grade adenoma |
4 |
Invasive carcinoma |
1 |
Table 2: Breakdown of polyp samples of SUN database.
ID |
Number of frames |
Shape |
Size |
Location |
Pathological diagnosis |
1 |
527 |
Is |
6mm |
Cecum |
Low-grade adenoma |
2 |
1,313 |
Is |
18mm |
Rectum |
High-grade adenoma |
3 |
292 |
IIa |
3mm |
Ascending colon |
Low-grade adenoma |
4 |
80 |
Is |
4mm |
Sigmoid colon |
Low-grade adenoma |
5 |
930 |
IIa |
3mm |
Transverse colon |
Low-grade adenoma |
6 |
491 |
IIa |
3mm |
Sigmoid colon |
Low-grade adenoma |
7 |
315 |
IIa |
6mm |
Descending colon |
Low-grade adenoma |
8 |
256 |
Isp |
12mm |
Sigmoid colon |
Low-grade adenoma |
9 |
136 |
Is |
4mm |
Sigmoid colon |
Low-grade adenoma |
10 |
436 |
IIa |
3mm |
Transverse colon |
Low-grade adenoma |
11 |
113 |
IIa |
5mm |
Descending colon |
Low-grade adenoma |
12 |
538 |
Is |
5mm |
Rectum |
Low-grade adenoma |
13 |
479 |
Is |
5mm |
Transverse colon |
Low-grade adenoma |
14 |
1,183 |
IIa |
3mm |
Sigmoid colon |
Low-grade adenoma |
15 |
487 |
Is |
5mm |
Transverse colon |
Low-grade adenoma |
16 |
199 |
Is |
4mm |
Transverse colon |
Low-grade adenoma |
17 |
304 |
Is |
4mm |
Sigmoid colon |
Low-grade adenoma |
18 |
243 |
Is |
2mm |
Sigmoid colon |
Hyperplastic polyp |
19 |
96 |
IIa |
3mm |
Transverse colon |
Low-grade adenoma |
20 |
3159 |
IIa |
3mm |
Ascending colon |
Low-grade adenoma |
21 |
100 |
IIa |
3mm |
Sigmoid colon |
Low-grade adenoma |
22 |
314 |
IIa |
2mm |
Ascending colon |
Low-grade adenoma |
23 |
182 |
Ip |
12mm |
Ascending colon |
Low-grade adenoma |
24 |
973 |
Ip |
15mm- |
Sigmoid colon |
Low-grade adenoma |
25 |
338 |
Is |
7mm |
Sigmoid colon |
Low-grade adenoma |
26 |
370 |
Is |
5mm |
Descending colon |
Low-grade adenoma |
27 |
249 |
Is |
5mm |
Ascending colon |
Hyperplastic polyp |
28 |
195 |
Is |
2mm |
Transverse colon |
Low-grade adenoma |
29 |
377 |
Isp |
13mm |
Sigmoid colon |
Low-grade adenoma |
30 |
224 |
IIa |
4mm |
Sigmoid colon |
Low-grade adenoma |
31 |
183 |
Ip |
12mm |
Descending colon |
Low-grade adenoma |
32 |
981 |
Ip |
15mm- |
Ascending colon |
Traditional serrated adenoma |
33 |
594 |
Is |
5mm |
Sigmoid colon |
Low-grade adenoma |
34 |
245 |
Is |
3mm |
Ascending colon |
Low-grade adenoma |
35 |
1,212 |
Ip |
15mm- |
Sigmoid colon |
High-grade adenoma |
36 |
815 |
IIa |
7mm |
Sigmoid colon |
Low-grade adenoma |
37 |
448 |
Is |
7mm |
Transverse colon |
Low-grade adenoma |
38 |
509 |
Is |
5mm |
Ascending colon |
Low-grade adenoma |
39 |
713 |
IIa |
13mm |
Ascending colon |
Low-grade adenoma |
40 |
159 |
IIa |
5mm |
Transverse colon |
Low-grade adenoma |
41 |
108 |
IIa |
3mm |
Rectum |
Low-grade adenoma |
42 |
268 |
Is |
7mm |
Transverse colon |
Low-grade adenoma |
43 |
260 |
Isp |
10mm |
Ascending colon |
Low-grade adenoma |
44 |
745 |
IIa |
5mm |
Sigmoid colon |
Low-grade adenoma |
45 |
383 |
Is |
3mm |
Ascending colon |
Low-grade adenoma |
46 |
170 |
IIa |
2mm |
Transverse colon |
Hyperplastic polyp |
47 |
705 |
Is |
5mm |
Transverse colon |
Low-grade adenoma |
48 |
176 |
Is |
3mm |
Transverse colon |
Low-grade adenoma |
49 |
181 |
IIa |
3mm |
Transverse colon |
Low-grade adenoma |
50 |
740 |
Ip |
10mm |
Sigmoid colon |
Low-grade adenoma |
51 |
1,737 |
IIa(LST-NG) |
15mm- |
Cecum |
Low-grade adenoma |
52 |
207 |
IIa |
6mm |
Sigmoid colon |
Low-grade adenoma |
53 |
245 |
Is |
4mm |
Rectum |
Hyperplastic polyp |
54 |
345 |
Is |
4mm |
Sigmoid colon |
Low-grade adenoma |
55 |
700 |
Is |
3mm |
Ascending colon |
Low-grade adenoma |
56 |
248 |
Is |
4mm |
Sigmoid colon |
Hyperplastic polyp |
57 |
326 |
Is |
5mm |
Transverse colon |
Low-grade adenoma |
58 |
267 |
IIa |
6mm |
Transverse colon |
Sessile serrated lesion |
59 |
646 |
Isp |
8mm |
Sigmoid colon |
Traditional serrated adenoma |
60 |
146 |
IIa |
8mm |
Transverse colon |
Low-grade adenoma |
61 |
679 |
Isp |
6mm |
Ascending colon |
Low-grade adenoma |
62 |
351 |
Is |
7mm |
Ascending colon |
Low-grade adenoma |
63 |
632 |
Is |
7mm |
Rectum |
Invasive cancer (T1b) |
64 |
81 |
IIa |
3mm |
Sigmoid colon |
Low-grade adenoma |
65 |
222 |
IIa |
3mm |
Cecum |
Low-grade adenoma |
66 |
1,685 |
Is |
6mm |
Sigmoid colon |
Low-grade adenoma |
67 |
191 |
IIa |
5mm |
Transverse colon |
Low-grade adenoma |
68 |
1319 |
Is |
15mm- |
Rectum |
High-grade adenoma |
69 |
130 |
IIa |
3mm |
Descending colon |
Low-grade adenoma |
70 |
264 |
Ip |
15mm- |
Sigmoid colon |
Low-grade adenoma |
71 |
1,021 |
Is |
4mm |
Ascending colon |
Low-grade adenoma |
72 |
774 |
Is |
5mm |
Ascending colon |
Low-grade adenoma |
73 |
1,285 |
Is |
3mm |
Cecum |
Low-grade adenoma |
74 |
276 |
Isp |
5mm |
Sigmoid colon |
Low-grade adenoma |
75 |
343 |
Is |
3mm |
Transverse colon |
Low-grade adenoma |
76 |
343 |
Is |
3mm |
Cecum |
Low-grade adenoma |
77 |
215 |
Is |
4mm |
Ascending colon |
Low-grade adenoma |
78 |
267 |
Isp |
12mm |
Sigmoid colon |
High-grade adenoma |
79 |
76 |
Is |
4mm |
Descending colon |
Low-grade adenoma |
80 |
1,192 |
Is |
10mm |
Sigmoid colon |
Low-grade adenoma |
81 |
427 |
Is |
6mm |
Sigmoid colon |
Low-grade adenoma |
82 |
111 |
IIa |
3mm |
Sigmoid colon |
Sessile serrated lesion |
83 |
795 |
Isp |
13mm |
Rectum |
Low-grade adenoma |
84 |
218 |
Is |
5mm |
Descending colon |
Low-grade adenoma |
85 |
1,393 |
IIa |
8mm |
Ascending colon |
Low-grade adenoma |
86 |
257 |
IIa |
4mm |
Sigmoid colon |
Low-grade adenoma |
87 |
454 |
Is |
3mm |
Cecum |
Low-grade adenoma |
88 |
249 |
Is |
4mm |
Ascending colon |
Low-grade adenoma |
89 |
149 |
Ip |
5mm |
Descending colon |
Low-grade adenoma |
90 |
479 |
Is |
10mm |
Ascending colon |
Sessile serrated lesion |
91 |
1,061 |
IIa |
13mm |
Ascending colon |
Low-grade adenoma |
92 |
391 |
Is |
7mm |
Descending colon |
Low-grade adenoma |
93 |
452 |
Is |
7mm |
Descending colon |
Low-grade adenoma |
94 |
136 |
Is |
6mm |
Sigmoid colon |
Low-grade adenoma |
95 |
606 |
Isp |
8mm |
Sigmoid colon |
Low-grade adenoma |
96 |
301 |
Is |
5mm |
Sigmoid colon |
Hyperplastic polyp |
97 |
431 |
IIa |
15mm- |
Cecum |
Sessile serrated lesion |
98 |
170 |
IIa |
4mm |
Transverse colon |
Low-grade adenoma |
99 |
161 |
Is |
5mm |
Sigmoid colon |
Low-grade adenoma |
100 |
188 |
IIa |
3mm |
Rectum |
Hyperplastic polyp |
Table 3: Breakdown of non-polyp samples of SUN database.
ID |
Number of frames |
Lenth of each video (seconds) |
1 |
9,961 |
332.0 |
2 |
10,073 |
335.8 |
3 |
7,152 |
238.4 |
4 |
14,635 |
487.8 |
5 |
7,916 |
263.9 |
6 |
17,046 |
511.4 |
7 |
5,636 |
169.1 |
8 |
2,568 |
85.6 |
9 |
9,522 |
317.4 |
10 |
7,086 |
236.2 |
11 |
4,832 |
161.1 |
12 |
6,799 |
226.6 |
13 |
6,328 |
210.9 |
[Back to top]
Supplementary data
Accuracy of the latest approved model (July 2022)
After the publication (Misawa M et al. Gastrointest Endosc 2021;93(4):960-967e3), we updated the EndoBRAIN-EYE (CADe). Therefore, we conducted re-analysis using the latest model to show current performance. The following tables show the latest performances for SUN Colonoscopy Video Database.
Table 4: The perfromances of the latest model and previously reported model.
|
Latest model* |
Reported model† |
Percent(95% confidence interval) |
n/N |
Percent |
n/N |
Sensitivity (per-lesion) |
98.0 (93.0-99.8) |
98/100 |
98.0 (93.0-99.8) |
98/100 |
Sensitivity (per-frame) |
91.5 (91.2-91.7) |
44,092/48212 ‡,§ |
90.5 (90.2-90.7) |
44,472/49140 ‡ |
Specificity |
98.2 (98.1-98.2) |
90,068/91,764 ‡,§ |
93.7 (93.5-93.8) |
88,075/94,039 ‡ |
*The latest model was regulatory approved in July 2022. |
†Refer to the Misawa M et al. Gastrointest Endosc 2021;93(4):960-967e3. |
‡The numbers of frames were inconsistent since the database was released with the removal of inappropriate frames after the paper publication. |
§The frames which were identified as inappropriate frames by the CADe system were excluded from the analysis. |
Table 5: Results of the positive videos (per-frame analysis).
Total number of frames |
True positive* |
False negative |
False positive† |
Unanalyzable frames‡ |
48,344 |
44,902 |
3,964 |
156 |
132 |
*Number of the frames in which the IoU between the predicted bounding box and ground truth more than or equals to 0.3. |
†Number of the frames to which the trained model outputs bounding box but the IoU less than 0.3. |
‡Number of the frames which were identified as inappropriate frame by the CADe system. |
Table 6: Results of the negative videos (per-frame analysis).
Total number of frames |
False positive |
True negative |
Unanalyzable framese* |
109,516 |
1,696 |
90,068 |
17,752 |
*Number of the frames which were identified as inappropriate frame by the CADe system. |
Table 7: Results of the per-polyp analysis of the positive videos.
Number of polyp videos |
True positive |
False negative |
100 |
98 |
2 |
[Back to top]
Terms of use
Copyright
All intellectual property rights, including copyrights, relating to the information contained in this database are held by Showa University Northern Yokohama Hospital and Mori Lab., Graduate School of Informatics, Nagoya University.
Intended use
This database is available for only non-commercial use in research or educational purpose. As long as you use the database for these purposes, you can edit or process images and annotations in this database. Without permission from Mori Lab., commercial use of this dataset is prohibited even after copying, editing, processing or any operations of this database. Please contact us for commercial use or if you are uncertain about the decision.
Citation
If you clearly indicate the following information, you are allowed to refer to or cite the part of this database. By downloading and using the SUN database you agree to cite this database in any publication based on research in which this database has been used.
- Development of a computer-aided detection system for colonoscopy and a publicly accessible large colonoscopy video database (with video). Masashi Misawa, Shin-ei Kudo, Yuichi Mori, Kinichi Hotta, Kazuo Ohtsuka, Takahisa Matsuda, Shoichi Saito, Toyoki Kudo, Toshiyuki Baba, Fumio Ishida, Hayato Itoh, Masahiro Oda, Kensaku Mori, Gastrointestinal Endoscopy, Vol. 93, Issue 4, pp. 960-967.e3, 2021. DOI: 10.1016/j.gie.2020.07.060
- SUN Colonoscopy Video Database. Hayato Itoh, Masashi Misawa, Yuichi Mori, Masahiro Oda, Shin-Ei Kudo, Kensaku Mori, 2020, http://amed8k.sundatabase.org/
Distribution
It is prohibited to sell, transfer, lend, lease, resell, distribute, etc., as it is, or copy, edit, or process this database, in whole or in part.
Personal information
The use of personal information sent to issue the download path of this database is limited to the notifications about this database and usage statistics of this database. Management of information follows the Nagoya University Privacy Policy. Note that this website uses Google Analytics with Cookie for analysis of its accesses. If you want to deny this analysis, please disable Cookie in your webbrowser.
Immunity
Showa University, Nagoya University, and Mori Lab. are not responsible for any damage caused by use and download of this database. We do not guarantee the accuracy, completeness, or usefulness of this database.
If you violate any of the above terms, or at the discretion of Showa University, Mori Lab., or Nagoya University, you may not be allowed to use the database even after downloading.
Showa University, Mori Lab, Nagoya University may arbitrarily revise this agreement without the approval of the registrant and this data user.
Contact
* If you agree to all the terms of use, please send a request e-mail to hitoh (a t) mori.m.is.nagoya-u.ac.jp.
[Back to top]