Repeatability of radiomics studies in colorectal cancer: a systematic review
BMC Gastroenterology
(2023) 23:125
Liu et al. BMC Gastroenterology
https://doi.org/10.1186/s12876-023-02743-1
Open Access
RESEARCH ARTICLE
Repeatability of radiomics studies
in colorectal cancer: a systematic review
Ying Liu1†, Xiaoqin Wei1†, Xu Feng, Yan Liu2, Guiling Feng2 and Yong Du2*
Abstract
Background Recently, radiomics has been widely used in colorectal cancer, but many variable factors affect the
repeatability of radiomics research. This review aims to analyze the repeatability of radiomics studies in colorectal
cancer and to evaluate the current status of radiomics in the field of colorectal cancer.
Methods The included studies in this review by searching from the PubMed and Embase databases. Then each study
in our review was evaluated using the Radiomics Quality Score (RQS). We analyzed the factors that may affect the
repeatability in the radiomics workflow and discussed the repeatability of the included studies.
Results A total of 188 studies was included in this review, of which only two (2/188, 1.06%) studies controlled the
influence of individual factors. In addition, the median score of RQS was 11 (out of 36), range-1 to 27.
Conclusions The RQS score was moderately low, and most studies did not consider the repeatability of radiomics
features, especially in terms of Intra-individual, scanners, and scanning parameters. To improve the generalization of
the radiomics model, it is necessary to further control the variable factors of repeatability.
Keywords Colorectal cancer, Artificial intelligence, Radiomics, Repeatability, Machine learning
Background
Colorectal cancer (CRC) is one of the most common
clinical malignant tumors [1]. Medical imaging tools
have become crucial in CRC for staging and treatment
evaluation [2]. However, traditional radiology is mainly
dependent on the subjective qualitative interpretations
of the doctor [2], which often leads to suboptimal positive and negative predictive values [2, 3]. In recent years,
with the rapid development of image analysis methods
†
Ying Liu and Xiaoqin Wei contributed equally to this article, and both should
be considered first author.
*Correspondence:
Yong Du
1
School of Medical Imaging, North Sichuan Medical College, Sichuan
Province, Nanchong City 637000, China
2
Department of Radiology, the Affiliated Hospital of North
Sichuan Medical College, 1 Maoyuannan Road, Sichuan Province
637000 Nanchong City, China
and pattern recognition tools, there is a growing shift
away from qualitative to quantitative analysis of medical
images [2].
As a quantitative analysis tool, radiomics extracts features from medical images through high-throughput
computing and applies them to personalized clinical decisions to improve the accuracy of diagnosis and prognosis
[4]. In recent years, radiomics showed a unique advantage for staging, differential diagnosis, and prognosis [5].
Although an increasing amount of radiomics research
has been published, the comparability and repeatability of radiomics models remain a great challenge due to
the lack of standardization in the field of radiomics [6, 7].
Assessing the repeatability of radiomics is necessary to
achieve the clinical implementation of radiomics results
and to ensure a high predictive capability of the radiomics model for a variety of populations and institutions [8].
In addition, several factors that affect the repeatability
have been identified in the complicated workflow of radiomics, such as scanner [9–11], acquisition parameters
© The Author(s) 2023. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which
permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the
original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or
other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line
to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory
regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this
licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativeco
mmons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Liu et al. BMC Gastroenterology
(2023) 23:125
Page 2 of 12
[11–16], pretreatment method [17, 18], segmentation
method [19–22], inter/intra-observer variability [16, 17,
19], feature selection method [23], modeling method
[23].
Therefore, we conducted a systematic review to survey
the repeatability of radiomics research in CRC. Furthermore, we gave some suggestions to increase radiomics
repeatability for future research.
(RQS) [4]. The RQS was a unique quality assessment tool
in radiomics [25], which score was composed of 16 parts
with a total score of 36. A higher score represents better
quality of the article. There were great differences in the
methods used in the eligible studies, so the meta-analysis
did not conduct.
Methods
The common bias analysis tools were not applicable here
for the following reasons. First, the systematic review
aims to assess the repeatability of radiomics research
rather than the clinical purpose and outcomes. Second,
there is no strictly causal association between repeatability and outcomes (diagnostic or prognosis performance).
So the TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) and ROBINS (Risk Of Bias in Non-randomized
Studies) were not applicable. Finally, the purposes of the
eligible studies were highly heterogeneous, including
staging, diagnosis, prognosis, and evaluating treatment.
Thus, QUADAS-2(Quality Assessment of Diagnostic
Accuracy Studies), which assesses the risk of bias in diagnostic studies, and QUIPS (Quality in Prognosis Studies),
which assesses the risk of bias in prognostic studies, were
not applicable.
Quality assessment was conducted using the RQS.
Furthermore, the risk of bias in the eligible studies was
assessed by two reviewers from the following specific
aspects:
Review strategy
We conducted a systematic review according to the Preferred Reporting items for Systematic review and MetaAnalysis (PRISMA) checklist [24]. But the review was not
registered before. The systematic search was conducted
by two reviewers via PubMed and Embase databases
until Jul 4, 2022. The full search strategies from Additional Text 1.
Study selection
Population
We included primary research assessing the role of radiomics for diagnostic or prognostic with CRC patients.
However, studies consisting of animal subjects and other
types of articles than original articl (...truncated)