Performance of an Artificial Multi-observer Deep Neural Network for Fully Automated Segmentation of Polycystic Kidneys
J Digit Imaging
DOI 10.1007/s10278-017-9978-1
Performance of an Artificial Multi-observer Deep Neural
Network for Fully Automated Segmentation of Polycystic Kidneys
Timothy L. Kline 1 & Panagiotis Korfiatis 1 & Marie E. Edwards 2 & Jaime D. Blais 3 &
Frank S. Czerwiec 3 & Peter C. Harris 2 & Bernard F. King 1 & Vicente E. Torres 2 &
Bradley J. Erickson 1
# The Author(s) 2017. This article is an open access publication
Abstract Deep learning techniques are being rapidly applied
to medical imaging tasks—from organ and lesion segmentation to tissue and tumor classification. These techniques are
becoming the leading algorithmic approaches to solve inherently difficult image processing tasks. Currently, the most critical requirement for successful implementation lies in the need
for relatively large datasets that can be used for training the
deep learning networks. Based on our initial studies of MR
imaging examinations of the kidneys of patients affected by
polycystic kidney disease (PKD), we have generated a unique
database of imaging data and corresponding reference standard segmentations of polycystic kidneys. In the study of
PKD, segmentation of the kidneys is needed in order to measure total kidney volume (TKV). Automated methods to segment the kidneys and measure TKV are needed to increase
measurement throughput and alleviate the inherent variability
of human-derived measurements. We hypothesize that deep
learning techniques can be leveraged to perform fast, accurate,
reproducible, and fully automated segmentation of polycystic
kidneys. Here, we describe a fully automated approach for
segmenting PKD kidneys within MR images that simulates
a multi-observer approach in order to create an accurate and
robust method for the task of segmentation and computation
of TKV for PKD patients. A total of 2000 cases were used for
* Timothy L. Kline
1
Department of Radiology, Mayo Clinic College of Medicine, 200
First St SW, Rochester, MN 55905, USA
2
Division of Nephrology and Hypertension, Mayo Clinic College of
Medicine, Rochester, MN, USA
3
Otsuka Pharmaceutical Development & Commercialization Inc.,
Rockville, MD, USA
training and validation, and 400 cases were used for testing.
The multi-observer ensemble method had mean ± SD percent
volume difference of 0.68 ± 2.2% compared with the reference standard segmentations. The complete framework performs fully automated segmentation at a level comparable
with interobserver variability and could be considered as a
replacement for the task of segmentation of PKD kidneys by
a human.
Keywords Autosomal dominant polycystic kidney disease .
Deep learning . Magnetic resonance imaging . Planimetry .
Segmentation . Total kidney volume
Introduction
A particular section of machine learning, known as deep learning, is currently enjoying its renaissance in the area of artificial
intelligence [1]. For computer vision tasks, the primary motivation of deep learning techniques is the biomimicry of the
human visual system, allowing computers to learn from experience and formulate an understanding in terms of a hierarchy
of concepts. In the field of medical image processing, deep
learning approaches are providing computational solutions to
a wide range of automation and classification tasks [2]. For
instance, deep learning techniques have been used in organ [3]
and tumor segmentation tasks [4], as well as tissue and tumor
classification [5, 6]. The fundamental difference of deep learning methods is that they take a unique approach to solving
classical image processing tasks by allowing the computer to
identify image features of interest. This is in contrast to traditional machine learning that requires predefining the features
of interest (e.g., image edges, intensity, and/or texture). Based
on the successes of deep learning techniques, we sought to
explore their potential in solving the difficult task of
J Digit Imaging
segmenting the kidneys of patients affected by autosomal
dominant polycystic kidney disease (ADPKD).
In ADPKD, these phenotypic differences include renal size
(e.g., renal volumes can vary from ~200 ml to more than
7000 ml), shape, and composition (e.g., appearance of the
border of the kidneys in MR images has highly variable signal
intensities resulting from whether the border is composed of
simple and/or complex cysts, varying degrees of fibrosis, or
healthy renal parenchyma). The natural course of ADPKD is
highly variable and is characterized by progressive enlargement of cysts within the kidneys and is a leading cause of endstage renal disease (ESRD) [7–10]. Total kidney volume
(TKV) has become the main image-based biomarker for following ADPKD progression at early stages of the disease
[11–15]. Imaging methods such as ultrasound (US), computed
tomography (CT), and magnetic resonance imaging (MRI) are
employed to diagnose, monitor, and predict outcomes for patients affected by ADPKD [16–19]. MRI has become the imaging modality of choice due to its superior soft tissue contrast, non-ionizing radiation, and accuracy. Current methods to
manually measure TKV using MR images include volume
calculation by the ellipsoidal method [20], stereological approaches [21], and planimetry tracings [22, 23].
Due to the large time requirement of manual tracing, automated approaches to segment kidneys are desirable. However,
segmentation of ADPKD kidneys is challenging due to a
number of factors. For instance, the shapes of the kidneys
are highly irregular, and the contrast at the border of the kidney is highly variable at the interface of several different tissue
types including fluid-filled cysts, calcified cysts, renal parenchyma, and fibrotic tissue. In addition, MR acquisition parameters vary widely from institution, requiring a robust approach
which can handle not only the wide range of disease presentations but also the drastic difference in tissue contrast due to
how the images were acquired.
We previously developed both semi- and fully automated
segmentation approaches to allow accurate and reproducible
measurement of TKV in ADPKD patients [24, 25].
Fortunately, these developments have allowed for the creation
of a database of thousands of reference standard segmentations by which we have been able to explore novel, nextgeneration image processing techniques in order to finally
and fully address the problem of segmentation of the PKD
kidney in order to accurately and reproducibly derive TKV.
We have developed a deep neural network model that can
capture both local and global context within the image. This
model is based on a convolutional neural network (CNN)
approach that performs a series of downsampling (i.e., max
pooling operations which select the maximum value from a
patch of features which help to reduce the data dimensionality)
and upsampling procedures (similar to autoencoders [26],
which allow classification to be made at the voxel level).
The network also incorporates skip connections (similar to a
CNN architect (...truncated)