A Distributed Privacy Preserved Federated Learning Approach for Revolutionizing Pneumonia Detection in Isolated Heterogenous Data Silos

International Journal of Mathematical, Engineering and Management Sciences, Jul 2025

Pneumonia is a respiratory lung contamination that ranges in severity from mild to lethal outcomes. The analysis of tomographic images is the most significant method of pneumonia detection. The image analysis requires expertise and proficiency to diagnose the disease correctly. The medical reports with multiple diseases have overlapping symptoms, which may lead to misdiagnosis and deferred identification. The misdiagnosis results in increased healthcare costs, worsened medical conditions, and legal implications. Centralized deep learning enhances the feature extraction process and optimally improves the prediction outcomes; however, these models have data privacy concerns due to centralized storage systems. The healthcare departments follow the Health Insurance Portability and Accountability Act. (HIPAA) to maintain the retaining of patient data and improve the portability and continuity of health insurance coverage. In the proposed work, federated learning has been utilized to enhance data privacy and deal with imbalanced and diverse data silos. This distributed privacy-preserved model has been employed with a pooled dataset curated from multiple sources in a 5-client architecture. The model was implemented with the FedAVG aggregation technique in independent and identically distributed (IID) and non-IID data distributions. The outcomes of the model exhibit 87.62% accuracy with IID and 86.15% accuracy with non-IID distributions. The comparison of these outcomes with the existing studies shows that the proposed model outperforms by exhibiting better performance and resulting in the minimum loss of 0.4041 and 0.4139 with IID and non-IID distributions, respectively.

A Distributed Privacy Preserved Federated Learning Approach for Revolutionizing Pneumonia Detection in Isolated Heterogenous Data Silos

International Journal of Mathematical, Engineering and Management Sciences Vol. 10, No. 5, 1324-1350, 2025 https://doi.org/10.33889/IJMEMS.2025.10.5.063 A Distributed Privacy Preserved Federated Learning Approach for Revolutionizing Pneumonia Detection in Isolated Heterogenous Data Silos Shagun Sharma Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India. & School of Computing Science & Engineering, VIT Bhopal University, Sehore, Bhopal, Madhya Pradesh, India. E-mail: , Kalpna Guleria Chitkara University Institute of Engineering and Technology, Chitkara University, Punjab, India. Corresponding author: , (Received on February 4, 2025; Revised on April 15, 2025; Accepted on April 29, 2025) Abstract Pneumonia is a respiratory lung contamination that ranges in severity from mild to lethal outcomes. The analysis of tomographic images is the most significant method of pneumonia detection. The image analysis requires expertise and proficiency to diagnose the disease correctly. The medical reports with multiple diseases have overlapping symptoms, which may lead to misdiagnosis and deferred identification. The misdiagnosis results in increased healthcare costs, worsened medical conditions, and legal implications. Centralized deep learning enhances the feature extraction process and optimally improves the prediction outcomes; however, these models have data privacy concerns due to centralized storage systems. The healthcare departments follow the Health Insurance Portability and Accountability Act. (HIPAA) to maintain the retaining of patient data and improve the portability and continuity of health insurance coverage. In the proposed work, federated learning has been utilized to enhance data privacy and deal with imbalanced and diverse data silos. This distributed privacy-preserved model has been employed with a pooled dataset curated from multiple sources in a 5-client architecture. The model was implemented with the FedAVG aggregation technique in independent and identically distributed (IID) and non-IID data distributions. The outcomes of the model exhibit 87.62% accuracy with IID and 86.15% accuracy with non-IID distributions. The comparison of these outcomes with the existing studies shows that the proposed model outperforms by exhibiting better performance and resulting in the minimum loss of 0.4041 and 0.4139 with IID and nonIID distributions, respectively. Keywords- Deep learning, InceptionV3, Distributed architecture, Federated learning, Pneumonia detection, Centralized learning. 1. Introduction In 2019, the number of Covid-19 cases started to increase, resulting in a global health crisis and widespread transmission, hence WHO declaring it as a pandemic in March 2020 (Kafadar et al., 2022). It caused a rapid increase in mortality rates due to the unavailability of vaccines and medications. This pandemic directly impacted the patient care system and economic growth that limited medical resources and healthcare equipment’s worldwide (Holt et al., 2020). The sudden surge of Covid-19 cases and immediate shutdowns led to a decrease in the available healthcare assets and medicine supplies. Covid-19 affects the respiratory system and results in reducing surfactant secretion (Alipoor et al., 2020). The decrease in secretion causes the alveoli to collapse, resulting in pneumonia. In the past 20 years, the mortality cases caused by pediatric pneumonia have significantly decreased (Kanwal et al., 2024). However, these cases started to increase during the pandemic, when the available healthcare resources were insufficient for the treatment of Covid19. Pneumonia is respiratory lung contamination caused by bacteria and viruses and ranges from mild to 1324 | https://www.ijmems.in Sharma & Guleria: A Distributed Privacy Preserved Federated Learning Approach for … lethal outcomes (Adjei-Mensah et al., 2024). The advancement in pneumonia stages may affect other organs, such as the nervous system, heart, and lungs, along with blood vessels (Hatmi, 2021). The identification of pneumonia is challenging to the lower income countries due to the complex structure of the disease (Kundu et al., 2021). The other problem associated with this disease is its misdiagnosis due to the similar kinds of symptoms in tuberculosis, Covid-19, and lung cancer. As per the WHO report, pneumonia affects people over the age of 65 and patients with a weakened immune system the most (Pneumonia, n.d.). There are various other kinds of respiratory diseases, such as silicosis, chronic bronchitis, cystic fibrosis, asthma, lung cancer, and tuberculosis. The statistics presented in Goyal & Singh (2023), show that lung cancer has affected 8 million people to date; however, this patient count is less if compared to the 15-month cases of Covid-19 and pneumonia. Respiratory infections and diseases are detected using the tomography imaging process, specifically with X-ray scans; however, it requires expertise and proficiency to correctly diagnose the disease. Patients with multiple diseases may have overlapping symptoms in their reports, which creates complexity in accurately diagnosing the associated disease, and the obliviousness of the medical imaging process leads to misdiagnosis of the disease. The misdiagnosis of the disease may result in worse medical conditions, increased healthcare costs, psychological impact, and legal implications, along with lethal results. There are various existing studies that have used different kinds of artificial intelligence (AI) tools on medical images to accurately identify diseases. The deep learning (DL) models are also used for Alzheimer's, Parkinson’s, pneumonia, cancer, and Covid-19 disease detection (Hariri & Avşar, 2023; Kafadar et al., 2022; Kundu et al., 2021; Prakash et al., 2023b; Prakash et al., 2023c; Rahimzadeh & Attar, 2020). These models are highly efficient and optimal for providing improved performance of disease detection by performing proficient image processing and feature extraction. The Xception, InceptionV3, VGG16, VGG19, ResNet50, CNN, and ensemble models have been used by various researchers to predict pneumonia from lung CXRs (Alyasseri et al., 2022; Gulati et al., 2022; Hasan et al., 2021; Ieracitano et al., 2022; Jaiswal et al., 2019; Prakash et al., 2023a; Rehman et al., 2022). These models have hierarchical feature learning, robustness to variance, higher scalability, and capability due to the handling of millions of parameters to capture the image data with vast complexities. These models are trained with the centralized learning process, where the data processing and computational tasks are performed within the central server using the vast volume of the data. DL presents various efficient approaches for optimally analyzing the images, while it also poses significant challenges such as scalability issues, data privacy concerns, and high time complexity due to transferring the data to the centralized loca (...truncated)


This is a preview of a remote PDF: https://www.ijmems.in/cms/storage/app/public/uploads/volumes/63-IJMEMS-25-0064-10-5-1324-1350-2025.pdf
Article home page: https://doaj.org/article/93531c65e7924c469b186dd147e0871a

Shagun Sharma, Kalpna Guleria. A Distributed Privacy Preserved Federated Learning Approach for Revolutionizing Pneumonia Detection in Isolated Heterogenous Data Silos, International Journal of Mathematical, Engineering and Management Sciences, 2025, pp. 1324-1350, Volume 5, DOI: https://doi.org/10.33889/IJMEMS.2025.10.5.063