KLI

How Many Private Data Are Needed for Deep Learning in Lung Nodule Detection on CT Scans? A Retrospective Multicenter Study

Metadata Downloads
Abstract
Early detection of lung nodules is essential for preventing lung cancer. However, the number of radiologists who can diagnose lung nodules is limited, and considerable effort and time are required. To address this problem, researchers are investigating the automation of deep-learning-based lung nodule detection. However, deep learning requires large amounts of data, which can be difficult to collect. Therefore, data collection should be optimized to facilitate experiments at the beginning of lung nodule detection studies. We collected chest computed tomography scans from 515 patients with lung nodules from three hospitals and high-quality lung nodule annotations reviewed by radiologists. We conducted several experiments using the collected datasets and publicly available data from LUNA16. The object detection model, YOLOX was used in the lung nodule detection experiment. Similar or better performance was obtained when training the model with the collected data rather than LUNA16 with large amounts of data. We also show that weight transfer learning from pre-trained open data is very useful when it is difficult to collect large amounts of data. Good performance can otherwise be expected when reaching more than 100 patients. This study offers valuable insights for guiding data collection in lung nodules studies in the future.
Author(s)
Jeong Woo SonJi Young HongYoon KimWoo Jin KimDae-Yong ShinHyun-Soo ChoiSo Hyeon BakKyoung Min Moon
Issued Date
2022
Type
Article
Keyword
computed tomographydeep learninglung nodulenodule detectionpublicly available dataradiologisttransfer learning
DOI
10.3390/cancers14133174
URI
https://oak.ulsan.ac.kr/handle/2021.oak/15206
Publisher
Cancers
Language
영어
ISSN
2072-6694
Citation Volume
14
Citation Number
13
Citation Start Page
1
Citation End Page
19
Appears in Collections:
Medicine > Nursing
공개 및 라이선스
  • 공개 구분공개
파일 목록
  • 관련 파일이 존재하지 않습니다.

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.