Data exploration in evolutionary reconstruction of PET images (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007%2Fs10710-018-9330-7.pdf

Data exploration in evolutionary reconstruction of PET images

Genetic Programming and Evolvable Machines https://doi.org/10.1007/s10710-018-9330-7 Data exploration in evolutionary reconstruction of PET images Cameron C. Gray1 · Shatha F. Al‑Maliki1,2 · Franck P. Vidal1 Received: 11 December 2017 / Revised: 5 July 2018 © The Author(s) 2018 Abstract This work is based on a cooperative co-evolution algorithm called ‘Fly Algorithm’, which is an evolutionary algorithm (EA) where individuals are called ‘flies’. It is a specific case of the ‘Parisian Approach’ where the solution of an optimisation problem is a set of individuals (e.g. the whole population) instead of a single individual (the best one) as in typical EAs. The optimisation problem considered here is tomography reconstruction in positron emission tomography (PET). It estimates the concentration of a radioactive substance (called a radiotracer) within the body. Tomography, in this context, is considered as a difficult ill-posed inverse problem. The Fly Algorithm aims at optimising the position of 3-D points that mimic the radiotracer. At the end of the optimisation process, the fly population is extracted as it corresponds to an estimate of the radioactive concentration. During the optimisation loop a lot of data is generated by the algorithm, such as image metrics, duration, and internal states. This data is recorded in a log file that can be post-processed and visualised. We propose using information visualisation and user interaction techniques to explore the algorithm’s internal data. Our aim is to better understand what happens during the evolutionary loop. Using an example, we demonstrate that it is possible to interactively discover when an early termination could be triggered. It is implemented in a new stopping criterion. It is tested on two other examples on which it leads to a 60% reduction of the number of iterations without any loss of accuracy. Keywords Fly Algorithm · Tomography reconstruction · Information visualisation · Data exploration · Artificial evolution · Parisian evolution Abbreviations EA Evolutionary algorithm PET Positron emission tomography ET Emission tomography * Franck P. Vidal Extended author information available on the last page of the article 13 Vol.:(0123456789) Genetic Programming and Evolvable Machines InfoVis Information visualisation CSV Comma-Separated Values CCEA Cooperative co-evolution algorithm CT Computed tomography SPECT Single-photon emission computed tomography SNR Signal-to-noise ratio MRI Magnetic resonance imaging keV Kiloelectron volt LOR Line of response MLEM Maximum-Likelihood Expectation-Maximization EM Expectation-Maximization OSEM Ordered Subset Expectation-Maximization voxel Volume element SVG Structured Vector Graphics MAE Mean absolute error MSE Mean squared error RMSE Root mean squared error ZNCC Zero-normalised cross-correlation PSNR Peak signal-to-noise ratio SSIM Structural similarity DSSIM Structural dissimilarity TV Total variation 1 Introduction This research is related to the use of evolutionary computing in nuclear medicine, more particularly positron emission tomography (PET) reconstruction. In this paper, we investigate the use of information visualisation (InfoVis) and data exploration to understand some of the behaviours of an evolutionary algorithm (EA). In particular, we want to assess if the algorithm could have been stopped earlier to get a reasonable solution instead of waiting until the algorithm ends and using the final solution as the problem answer. The combination of visualisation and evolutionary computing is still a relatively overlooked field. Two different approaches can be distinguished: • visualisation to understand an evolutionary algorithm [30, 38, 52, 53], and • interactive artificial evolution to improve the visualisation [9, 24, 36]. First attempts were reported at the end of the 90s. Early visualisations were using relatively basic techniques that mostly relied on plotting with limited or no interactivity. During the evolutionary PET reconstruction, multiple time series are recorded hundreds of thousands of times. Comparing these time series by hand using typical scatterplots and line charts with no interactivity is not practically feasible: 13 Genetic Programming and Evolvable Machines • The order of magnitude of each time series is different. They would need to be independently normalised before plotting. • Trial and error would be needed to choose the axis of interest because it is not necessarily straightforward to do so without a deep a priori understanding of the data. • Adjusting the data range visualised in the scatterplots would also need to be performed with trial and error. • Displaying selected images would need to be done manually. The use of Parallel Coordinate Plots is very popular to visualise high-dimensional geometry and analyse multivariate data [28], which is the type of data considered here. Interactivity using the brushing technique [37] makes it feasible to easily explore parts of this high-dimensional space and visually analyse this complex multivariate dataset. This is the approach we adopted here to develop an integrated visualisation framework dedicated to our evolutionary PET reconstruction algorithm. The emergence of information visualisation and data analytics is opening new doors for its use in the evolutionary computing domain. We adopt the first approach to analyse the evolutionary process of our image reconstruction algorithm for tomography in nuclear medicine. In typical evolutionary algorithms, the best individual of the final population is the solution of the optimisation problem. Our algorithm relies on the Parisian approach where the solution to the problem is a group of individuals, e.g. the whole population or a subset of the population. The population size progressively increases to improve the resolution of the output image. The algorithm is launched with input parameters such as the initial number of individuals, the final number of individuals, the probability of operators, etc., the final solution is extracted at the end of the optimisation process then converted into a problem-specific answer. A lot of the data is generated during the evolution process, in particular data based on error metrics and correlation measurements. Traditionally all this data is discarded at the end of the evolutionary process, as only the final population is considered. Our hypothesis is that intermediate populations and internal data should not be systematically discarded as they can be reviewed offline. They can be used to analyse the performance of the population over time. When using stagnation as the stopping criterion, the final population is not necessarily the best one due to oscillations around the minimal fitness value. In such a case, past generations will have to be accessible. Also, reaching the targeted number of individuals may not be necessary if the reconstructed image stops improving. Offline (...truncated)