BMC Bioinformatics

BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas ...

List of Papers (Total 19,958)

TransST: transfer learning embedded spatial factor modeling of spatial transcriptomics data

Nov 2025 | Liu, Shuo Shuo, Wang, Shikun, Chen, Yuxuan, et al.

Spatial transcriptomics have emerged as a powerful tool in biomedical research because of its ability to capture both the spatial contexts and abundance of the complete RNA transcript profile in organs of interest. However, limitations of the technology such as the relatively low resolution and comparatively insufficient sequencing depth make it difficult to reliably extract real...

Nov 2025
Liu, Shuo Shuo, Wang, Shikun, Chen, Yuxuan, et al.

A lightweight single-view contrastive learning hypergraph neural network for food–microbe–disease association prediction

Nov 2025 | Hu, Jianqiang, Hu, Mingyi, Wu, Yangxiang, et al.

Identifying potential associations among food, gut microbiota and disease is fundamental for elucidating interaction mechanisms and advancing personalized healthy dietary strategies. While computational methods have been extensively applied to predict microbiota–disease associations, methods on predicting food–microbiota relationships remain limited, particularly regarding higher...

Nov 2025
Hu, Jianqiang, Hu, Mingyi, Wu, Yangxiang, et al.

A shrinkage-based statistical method for testing group mean differences in quantitative bottom-up proteomics

Oct 2025 | Lee, Namgil, Yoo, Hojin, Kim, Juhyoung, et al.

In bottom-up proteomics using data-independent acquisition mass spectrometry (DIA-MS), quantitative measurements are obtained following multiple steps of protein fragmentation and ionization, which introduces cumulative errors and impairs the effectiveness of classical statistical methods. This study proposes an alternative statistical approach for testing group mean differences...

Oct 2025
Lee, Namgil, Yoo, Hojin, Kim, Juhyoung, et al.

ESAE-SDA: ensemble sparse autoencoder framework for epigenomics-informed snoRNA-disease associations prediction

Oct 2025 | Jiang, Xinqing, Chen, Xiaojun, Xu, Lifeng, et al.

Small nucleolar RNAs (snoRNAs), a class of non-coding RNAs broadly distributed in eukaryotes, are emerging as pivotal regulators in the field of epigenomics. In addition to guiding 2’-O-methylation and pseudouridylation modifications at specific rRNA sites to maintain ribosomal stability and support protein synthesis, snoRNAs have been increasingly implicated in epigenetic...

Oct 2025
Jiang, Xinqing, Chen, Xiaojun, Xu, Lifeng, et al.

A novel modality contribution confidence-enhanced multimodal deep learning framework for multiomics data

Oct 2025 | Zhang, Duoyi, Bashar, Md Abul, Nayak, Richi, et al.

Multimodal learning for classification tasks has recently gained significant attention in bioinformatics. Current approaches primarily concentrate on devising efficient deep learning architectures to capture features within and across modalities. However, they typically assume that each modality contributes equally to the classification objective, overlooking inherent biases...

Oct 2025
Zhang, Duoyi, Bashar, Md Abul, Nayak, Richi, et al.

Delineating markers of disease-disease interaction: a systematic methodology and its application to multiple diabetes-helminth cohorts

Oct 2025 | Subramanian, Nilesh, Philip, Philge, Rajamanickam, Anuradha, et al.

Understanding how the molecules in our bodyrespond to the co-occurrence of two diseases in an individual (comorbidity) could lead tomechanistic insights into novel treatments for comorbid conditions. Studies have shown forinstance, that responses of our immune system to comorbid conditions could be more complexthan the union of immune responses to each disease occurring...

Oct 2025
Subramanian, Nilesh, Philip, Philge, Rajamanickam, Anuradha, et al.

MDG-DDI: multi-feature drug graph for drug-drug interaction prediction

Oct 2025 | Li, Wenjun, Zhou, Yiting, Ma, Wanjun, et al.

Drug–drug interactions (DDIs) frequently occur in combination therapy and may cause adverse effects or reduced efficacy. Existing computational approaches often fail to capture both the semantic information in drug sequences and the structural properties of drug molecules, limiting predictive power. We propose MDG-DDI, a deep learning framework that integrates a Frequent...

Oct 2025
Li, Wenjun, Zhou, Yiting, Ma, Wanjun, et al.

Fine-tuning a sentence transformer for DNA

Oct 2025 | Mokoatle, Mpho, Marivate, Vukosi, Mapiye, Darlington, et al.

Sentence-transformers is a library that provides easy methods for generating embeddings for sentences, paragraphs, and images. Sentiment analysis, retrieval, and clustering are among the applications made possible by the embedding of texts in a vector space where similar texts are located close to one another. This study fine-tunes a sentence transformer model designed for...

Oct 2025
Mokoatle, Mpho, Marivate, Vukosi, Mapiye, Darlington, et al.

BioSet2Vec: extraction of k-mer dictionaries from multiple sets of biological sequences via big data technologies

Oct 2025 | Galluzzo, Ylenia, Giancarlo, Raffaele, Rombo, Simona E., et al.

In several contexts involving large collections of sets of biological sequences, a relevant problem is that of selecting significant groups of k-mers that characterize one set with regards to the others in the same collection. Here a software framework is proposed implementing a novel methodology for the extraction of k-mer dictionaries, from multiple sets of biological sequences...

Oct 2025
Galluzzo, Ylenia, Giancarlo, Raffaele, Rombo, Simona E., et al.

VaMiAnalyzer: an open source, Python-based application for analysis of 3D in vitro vasculogenic mimicry assays

Oct 2025 | Moore, Stephen P. G., Zou, Anqi, Zhang, Xinyu, et al.

Vasculogenic mimicry (VM) is the phenomenon whereby non-vascular tumor cells develop vascular-like structures. VM is linked to more aggressive tumor phenotypes including higher rates of metastasis and invasion and is potentially resistant to anti-angiogenic cancer therapies. VM is investigated in vitro using 3D assays with microscopy images capturing the resulting VM structures...

Oct 2025
Moore, Stephen P. G., Zou, Anqi, Zhang, Xinyu, et al.

Denoising self-supervised learning for disease-gene association prediction

Oct 2025 | Zhang, Yan, Xiang, Ju, Li, Jianming

Understanding the interplay between diseases and genes is crucial for gaining deeper insights into disease mechanisms and optimizing therapeutic strategies. In recent years, various computational methods have been developed to uncover potential disease-gene associations. However, existing computational approaches for disease-gene association prediction still face two major...

Oct 2025
Zhang, Yan, Xiang, Ju, Li, Jianming

Unveiling molecular moieties through hierarchical Grad-CAM graph explainability

Oct 2025 | Contino, Salvatore, Sortino, Paolo, Gulotta, Maria Rita, et al.

Virtual Screening (VS) has become an essential tool in drug discovery, enabling the rapid and cost-effective identification of potential bioactive molecules. Among recent advancements, Graph Neural Networks (GNNs) have gained prominence for their ability to model complex molecular structures using graph-based representations. However, the integration of explainable methods to...

Oct 2025
Contino, Salvatore, Sortino, Paolo, Gulotta, Maria Rita, et al.

Computationally efficient multi-sample flow cytometry data analysis using Gaussian mixture models

Oct 2025 | Rutten, Philip, Mocking, Tim R., Cloos, Jacqueline, et al.

An important challenge in flow cytometry (FCM) data analysis is making comparisons of corresponding cell populations across multiple FCM samples. An interesting solution is creating a statistical mixture model for multiple samples simultaneously, as such a multi-sample model can characterize a heterogeneous set of samples, and facilitates direct comparison of cell populations...

Oct 2025
Rutten, Philip, Mocking, Tim R., Cloos, Jacqueline, et al.

Spatiotemporal segmentation of contraction waves in the extra-embryonic membranes of the red flour beetle

Oct 2025 | Pereyra, Marc, Golden, Mariia, Lange, Zoë, et al.

In this paper, we introduce an image analysis approach for spatiotemporal segmentation, quantification, and visualization of movement or contraction patterns in 2D+t and 3D+t microscopy recordings of biological tissues. The development of this pipeline was motivated by the observation of contraction waves in the extra-embryonic membranes of the red flour beetle Tribolium...

Oct 2025
Pereyra, Marc, Golden, Mariia, Lange, Zoë, et al.

VESNA: an open-source tool for automated 3D vessel segmentation and network analysis

Oct 2025 | Schüttler, Magdalena, Doğan, Leyla, Kirchner, Jana, et al.

Vasculature is an essential part of all tissues and organs and is involved in a wide range of different diseases. However, available software for blood vessel image analysis is often limited: Some only process two-dimensional data, others lack batch processing, putting a time burden on the user, while still others require tightly defined culturing methods and experimental...

Oct 2025
Schüttler, Magdalena, Doğan, Leyla, Kirchner, Jana, et al.

Topology-aware functional similarity: integrating extended neighborhoods via exponential attenuation

Oct 2025 | Wang, Peng

The annotation of protein functions constitutes a key connection between genetic sequences, molecular conformations, and biochemical roles, driving progress in biomedical studies. Traditional experimental methods are time-consuming and resource-intensive, making it difficult to meet the demand for functional annotation of a vast number of proteins in the post-genomic era. The...

Oct 2025
Wang, Peng

Big data dimensionality reduction-based supervised machine learning algorithms for NASH diagnosis

Oct 2025 | Tutsoy, Onder, Ozturk, Huseyin Ali, Sumbul, Hilmi Erdem

Identifying the Non-Alcoholic Steatohepatitis (NASH) that can cause liver failure-based morbidity remains a challenging research problem since there is no confirmed and effective approach for its early and accurate diagnosis yet. A large amount of medical data is collected to diagnose the NASH where the majority of them are redundant. This paper initially focuses on selecting the...

Oct 2025
Tutsoy, Onder, Ozturk, Huseyin Ali, Sumbul, Hilmi Erdem

ASET: an end-to-end pipeline for quantification and visualization of allele specific expression

Oct 2025 | Wu, Weisheng, Shedden, Kerby, Vincenz, Claudius, et al.

Allele-specific expression (ASE) analyses from RNA-Seq data provide quantitative insights into genomic imprinting and the genetic variants that affect transcription. Robust ASE analysis requires the integration of multiple computational steps, including read alignment, read counting, data visualization, and statistical testing—this complexity creates challenges for...

Oct 2025
Wu, Weisheng, Shedden, Kerby, Vincenz, Claudius, et al.

MultiModalGraphics: an R package for graphical integration of multi-omics datasets

Oct 2025 | Mohammed, Foziya Ahmed, Fall, El Hadj Malick, Tune, Kula Kekeba, et al.

Multimodal visualizations are essential for identifying and interpreting complex relationships in diverse, high-dimensional biological datasets. However, existing visualization tools often lack native capabilities for embedding explicit statistical and computational annotations, hindering effective quantitative interpretation. We introduce MultiModalGraphics, an R package...

Oct 2025
Mohammed, Foziya Ahmed, Fall, El Hadj Malick, Tune, Kula Kekeba, et al.

GRiNS: a python library for simulating gene regulatory network dynamics

Oct 2025 | Harlapur, Pradyumna, BV, Harshavardhan, Jolly, Mohit Kumar

The emergent dynamics of complex gene regulatory networks govern various cellular processes. However, understanding these dynamics is challenging due to the difficulty of parameterizing the computational models for these networks, especially as the network size increases. Here, we introduce a simulation library, Gene Regulatory Interaction Network Simulator (GRiNS), to address...

Oct 2025
Harlapur, Pradyumna, BV, Harshavardhan, Jolly, Mohit Kumar

SMILES alignment: a dynamic programming approach for the alignment of metabolites and other small organic molecules

Oct 2025 | Tang, Alexis L., Liberles, David A.

There is a need for computational approaches to compare small organic molecules based on chemical similarity or for evaluating biochemical transformations. No tool currently exists to generate global molecular alignments for small organic molecules. The study introduces a new approach to molecular alignment in the Simplified Molecular Input Line Entry System (SMILES) format. This...

Oct 2025
Tang, Alexis L., Liberles, David A.

Direct construction of sparse suffix arrays with Libsais

Oct 2025 | Van de Vyver, Simon, Moortele, Tibo Vande, Dawyndt, Peter, et al.

Pattern matching is a fundamental challenge in bioinformatics, especially in the fields of genomics, transcriptomics and proteomics. Efficient indexing structures, such as suffix arrays, are critical for searching large datasets. A sparse suffix array (SSA) retains only suffixes at every k-th position in the text, where k is the sparseness factor. While sparse suffix arrays offer...

Oct 2025
Van de Vyver, Simon, Moortele, Tibo Vande, Dawyndt, Peter, et al.

JINet: easy and secure private data analysis for everyone

Oct 2025 | Lalli, Giada, Collier, James, Moreau, Yves, et al.

The barriers to effective data analysis are sometimes insurmountable. Concerns ranging from privacy, security, and complexity can prevent researchers from using existing data analysis tools. JINet is a web browser-based platform intended to democratise access to advanced clinical and genomic data analysis software. It hosts numerous data analysis applications that are run in the...

Oct 2025
Lalli, Giada, Collier, James, Moreau, Yves, et al.

Generalized probabilistic canonical correlation analysis for multi-modal data integration with full or partial observations

Oct 2025 | Yang, Tianjian, Li, Wei Vivian

The integration and analysis of multi-modal data are increasingly essential across various domains including bioinformatics. As the volume and complexity of such data grow, there is a pressing need for computational models that not only integrate diverse modalities but also leverage their complementary information to improve clustering accuracy and insights, especially when...

Oct 2025
Yang, Tianjian, Li, Wei Vivian

DCMF-PPI: a protein-protein interaction predictor based on dynamic condition and multi-feature fusion

Oct 2025 | Chen, Siqi, Zheng, Anhong, Yu, Weichi, et al.

The identification of protein-protein interaction (PPI) plays a crucial role in understanding the mechanisms of complex biological processes. Current research in predicting PPI has shown remarkable progress by integrating protein information with PPI topology structure. Nevertheless, these approaches frequently overlook the dynamic nature of protein and PPI structures during...

Oct 2025
Chen, Siqi, Zheng, Anhong, Yu, Weichi, et al.