Johns Hopkins University
Computer vision techniques have the potential to improve the diagnostic performance of colonoscopy, but the lack of 3D colonoscopy datasets for training and validation hinders their development. This paper introduces C3VDv2, the second version (v2) of the high-definition Colonoscopy 3D Video Dataset, featuring enhanced realism designed to facilitate the quantitative evaluation of 3D colon reconstruction algorithms. 192 video sequences were captured by imaging 60 unique, high-fidelity silicone colon phantom segments. Ground truth depth, surface normals, optical flow, occlusion, six-degree-of-freedom pose, coverage maps, and 3D models are provided for 169 colonoscopy videos. Eight simulated screening colonoscopy videos acquired by a gastroenterologist are provided with ground truth poses. The dataset includes 15 videos featuring colon deformations for qualitative assessment. C3VDv2 emulates diverse and challenging scenarios for 3D reconstruction algorithms, including fecal debris, mucous pools, blood, debris obscuring the colonoscope lens, en-face views, and fast camera motion. The enhanced realism of C3VDv2 will allow for more robust and representative development and evaluation of 3D reconstruction algorithms.
Polyp cleaning with water jet followed by scope dipping in mucous pool and lens cleaning.
Fast Loop
Flowing red debris with dirty lens and lens cleaning. First half of camera trajectory mirrors the second half.
Exploratory Motion
En face to down the barrel motion
Synchronized clean and debris colon video pair.
Colon deformation video.
C3VDv2 consists of two distinct colon shapes (c1 and c2), each segmented into seven to eight anatomical regions, with each segment further having four unique textures and colors (t1, t2, t3, and t4). C3VDv2 contains 192 videos with a total of 169,371 frames. It comprises three different types of video sequences:
The dataset is publicly hosted on Johns Hopkins Research Data Repository. You can either directly download from the repository page or via links below (Dataverse API based). We have also provided a bash script to download data via Dataverse API calls.
For each registered video frame, the dataset includes:
Colon | Segment | Phantom Number | Video Number | # Frames | Preview | Download |
---|
Colon | Segment | Phantom Number | Video Number | # Frames | Preview | Download |
---|
Colon | Segment | Phantom Number | Video Number | # Frames | Preview | Download |
---|
Colon | Segment | Lumen Download | Mold Download |
---|
The spherical omnidirectional camera intrinsics are given in camera_intrinsics.txt. Additionally, two calibration sequences are provided for geometric and photometric calibration in the camera_calibration folder:
Colon | Segment | Texture | Video | # Frames | Preview | Old Name | Download |
---|
Colon | Segment | Texture | Video | # Frames | Preview | Download | Old Name |
---|
Colon | Segment | Lumen Download | Mold Download |
---|
Please consider citing our publications if you use code or data from this site.
@article{golhar2025c3vdv2,
title={C3VDv2--Colonoscopy 3D video dataset with enhanced realism},
author={Golhar, Mayank V and Fretes, Lucas Sebastian Galeano and Ayers, Loren and Akshintala, Venkata S and Bobrow, Taylor L and Durr, Nicholas J},
journal={arXiv preprint arXiv:2506.24074},
year={2025}
}
@article{bobrow2023,
title={Colonoscopy 3D video dataset with paired depth from 2D-3D registration},
author={Bobrow, Taylor L and Golhar, Mayank and Vijayan, Rohan and Akshintala, Venkata S and Garcia, Juan R and Durr, Nicholas J},
journal={Medical Image Analysis},
pages={102956},
year={2023},
publisher={Elsevier},
}
This work is licensed under CC BY-NC-SA 4.0