BMC Bioinformatics

Papers
(The TQCC of BMC Bioinformatics is 10. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Linear programming based gene expression model (LPM-GEM) predicts the carbon source for Bacillus subtilis1572
Nonnegative matrix factorization analysis and multiple machine learning methods identified IL17C and ACOXL as novel diagnostic biomarkers for atherosclerosis346
SALON ontology for the formal description of sequence alignments164
CMIC: predicting DNA methylation inheritance of CpG islands with embedding vectors of variable-length k-mers143
Combining whole genome sequencing and non-adaptive group testing for large-scale ethnicity screens135
Weighted overlapping group lasso for integrating prior network knowledge into gene set analysis122
REDalign: accurate RNA structural alignment using residual encoder-decoder network121
SKiM-GPT: combining biomedical literature-based discovery with large language model hypothesis evaluation120
Prior knowledge on context-driven DNA fragmentation probabilities can improve de novo genome assembly algorithms120
A shrinkage-based statistical method for testing group mean differences in quantitative bottom-up proteomics118
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis110
SumStatsRehab: an efficient algorithm for GWAS summary statistics assessment and restoration108
CircWalk: a novel approach to predict CircRNA-disease association based on heterogeneous network representation learning104
Enabling personalised disease diagnosis by combining a patient’s time-specific gene expression profile with a biomedical knowledge base99
Prediction of hot spots in protein–DNA binding interfaces based on discrete wavelet transform and wavelet packet transform93
Machine learning for multi-omics data integration in crop improvement: a systematic review89
DualGCN-GE: integration of spatiotemporal representations from whole-blood expression data with dual-view graph convolution network to identify Parkinson’s disease subtypes73
Combining denoising of RNA-seq data and flux balance analysis for cluster analysis of single cells72
Correction: DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction72
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships72
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks70
CoQUAD: a COVID-19 question answering dataset system, facilitating research, benchmarking, and practice69
A two-phase clustering procedure based on allele specific expression68
Mabs, a suite of tools for gene-informed genome assembly67
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens66
HPC-T-Assembly: a pipeline for de novo transcriptome assembly of large multi-specie datasets66
A gene based combination test using GWAS summary data65
Hitac: a hierarchical taxonomic classifier for fungal ITS sequences compatible with QIIME263
Latent dirichlet allocation for double clustering (LDA-DC): discovering patients phenotypes and cell populations within a single Bayesian framework63
Locality-sensitive hashing enables efficient and scalable signal classification in high-throughput mass spectrometry raw data62
Abstraction-based segmental simulation of reaction networks using adaptive memoization62
Prediction of HIV-1 protease cleavage site from octapeptide sequence information using selected classifiers and hybrid descriptors61
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform61
A comparative analysis of topological domain callers over RNA-associated interactome60
PEPMatch: a tool to identify short peptide sequence matches in large sets of proteins57
Graph regularized non-negative matrix factorization with prior knowledge consistency constraint for drug–target interactions prediction56
Implementation of machine learning in the clinic: challenges and lessons in prospective deployment from the System for High Intensity EvaLuation During Radiation Therapy (SHIELD-RT) randomized control55
Multilayer network alignment based on topological assessment via embeddings55
Deep learning and multi-omics approach to predict drug responses in cancer54
Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures54
Integrated analysis of the voltage-gated potassium channel-associated gene KCNH2 across cancers53
Not seeing the trees for the forest. The impact of neighbours on graph-based configurations in histopathology52
SVhound: detection of regions that harbor yet undetected structural variation51
DTIP-WINDGRU a novel drug-target interaction prediction with wind-enhanced gated recurrent unit51
A novel modality contribution confidence-enhanced multimodal deep learning framework for multiomics data50
Identification of cuproptosis-related lncRNAs to predict prognosis and immune infiltration characteristics in alimentary tract malignancies49
NODeJ: an ImageJ plugin for 3D segmentation of nuclear objects49
MTAGCN: predicting miRNA-target associations in Camellia sinensis var. assamica through graph convolution neural network49
A novel IVN-entropy based distance-driven MARCOS framework for evaluating and ranking global green hydrogen-producing countries49
MGATAF: multi-channel graph attention network with adaptive fusion for cancer-drug response prediction49
Assessment of deep learning and transfer learning for cancer prediction based on gene expression data47
INFLECT: an R-package for cytometry cluster evaluation using marker modality47
Optimal construction of a functional interaction network from pooled library CRISPR fitness screens45
MultiToxPred 1.0: a novel comprehensive tool for predicting 27 classes of protein toxins using an ensemble machine learning approach45
Impaired time-distance reconfiguration patterns in Alzheimer's disease: a dynamic functional connectivity study with 809 individuals from 7 sites45
SPAC: a scalable and integrated enterprise platform for single-cell spatial analysis44
Gene expression variability across cells and species shapes the relationship between renal resident macrophages and infiltrated macrophages44
Correction: Deep learning model integrating positron emission tomography and clinical data for prognosis prediction in non-small cell lung cancer patients44
UniAMP: enhancing AMP prediction using deep neural networks with inferred information of peptides42
DeepCAC: a deep learning approach on DNA transcription factors classification based on multi-head self-attention and concatenate convolutional neural network42
PreAcrs: a machine learning framework for identifying anti-CRISPR proteins41
Semantic interoperability: ontological unpacking of a viral conceptual model41
False discovery rate estimation using candidate peptides for each spectrum40
CCL-DTI: contributing the contrastive loss in drug–target interaction prediction40
BADASS: BActeriocin-Diversity ASsessment Software40
AutoPrompt-SAM3D: integrated generation and selection for SAM2-based 3D medical segmentation40
iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects39
An adaptive multi-modal hybrid model for classifying thyroid nodules by combining ultrasound and infrared thermal images39
Fasta2Structure: a user-friendly tool for converting multiple aligned FASTA files to STRUCTURE format38
Can large language models understand molecules?38
VirPool: model-based estimation of SARS-CoV-2 variant proportions in wastewater samples38
DHGCMDA: a dual-view heterogeneous graph contrastive learning framework for miRNA-disease association type prediction38
GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes38
Using empirical biological knowledge to infer regulatory networks from multi-omics data37
qRAT: an R-based stand-alone application for relative expression analysis of RT-qPCR data37
Ant colony optimization for the identification of dysregulated gene subnetworks from expression data37
LinG3D: visualizing the spatio-temporal dynamics of clonal evolution36
A prefix and attention map discrimination fusion guided attention for biomedical named entity recognition36
Propensity scores as a novel method to guide sample allocation and minimize batch effects during the design of high throughput experiments36
LincRNA ZNF529-AS1 inhibits hepatocellular carcinoma via FBXO31 and predicts the prognosis of hepatocellular carcinoma patients36
EZH2 as a prognostic-related biomarker in lung adenocarcinoma correlating with cell cycle and immune infiltrates35
A graph neural network framework for mapping histological topology in oral mucosal tissue35
‘gitana’ (phyloGenetic Imaging Tool for Adjusting Nodes and other Arrangements), a tool for plotting phylogenetic trees into ready-to-publish figures35
Identification of fish species through tRNA-based primer design35
CurvAGN: Curvature-based Adaptive Graph Neural Networks for Predicting Protein-Ligand Binding Affinity35
Integration of bulk RNA-seq pipeline metrics for assessing low-quality samples35
DiseaseNet: a transfer learning approach to noncommunicable disease classification34
Blastn2dotplots: multiple dot-plot visualizer for genome comparisons34
Glucostats: an efficient Python library for glucose time series feature extraction and visual analysis33
Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space33
Deafness gene screening based on a multilevel cascaded BPNN model33
Informeasure: an R/bioconductor package for quantifying nonlinear dependence between variables in biological networks from an information theory perspective32
Combining single-cell ATAC and RNA sequencing for supervised cell annotation32
MR-GGI: accurate inference of gene–gene interactions using Mendelian randomization32
A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models31
Inference of single-cell network using mutual information for scRNA-seq data analysis31
MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads31
refMLST: reference-based multilocus sequence typing enables universal bacterial typing31
Development of a TSR-based method for understanding structural relationships of cofactors and local environments in photosystem I31
SaeGraphDTI: drug–target interaction prediction based on sequence attribute extraction and graph neural network31
Conformal novelty detection for multiple metabolic networks31
A hybrid algorithm for clinical decision support in precision medicine based on machine learning31
Study on the prognosis, immune and drug resistance of m6A-related genes in lung cancer31
Fractal feature selection model for enhancing high-dimensional biological problems31
Designing multi-epitope vaccine against important colorectal cancer (CRC) associated pathogens based on immunoinformatics approach30
GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder30
Immunoinformatics design of multi-epitope vaccine using OmpA, OmpD and enterotoxin against non-typhoidal salmonellosis30
A seed expansion-based method to identify essential proteins by integrating protein–protein interaction sub-networks and multiple biological characteristics30
Reducing Boolean networks with backward equivalence30
A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization30
Statistical methods and resources for biomarker discovery using metabolomics29
Integrative analysis of TP53 mutations in lung adenocarcinoma for immunotherapies and prognosis29
BPFun: a deep learning framework for bioactive peptide function prediction using multi-label strategy by transformer-driven and sequence rich intrinsic information29
CircPrimer 2.0: a software for annotating circRNAs and predicting translation potential of circRNAs29
A novel bi-directional heterogeneous network selection method for disease and microbial association prediction29
In-vitro validated methods for encoding digital data in deoxyribonucleic acid (DNA)28
Serial KinderMiner (SKiM) discovers and annotates biomedical knowledge using co-occurrence and transformer models28
ORFeus: a computational method to detect programmed ribosomal frameshifts and other non-canonical translation events28
Single-cell spatial explorer: easy exploration of spatial and multimodal transcriptomics28
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking28
MHESMMR: a multilevel model for predicting the regulation of miRNAs expression by small molecules28
The prognostic value of autophagy related genes with potential protective function in Ewing sarcoma27
A clinical knowledge graph-based framework to prioritize candidate genes for facilitating diagnosis of Mendelian diseases and rare genetic conditions27
Probabilistic quotient’s work and pharmacokinetics’ contribution: countering size effect in metabolic time series measurements27
PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering27
Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method27
The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes27
DeShiftNet: a deformable-shifted cross-attention network for lightweight and robust organoid image segmentation27
A robust and accurate single-cell data trajectory inference method using ensemble pseudotime27
SeqForge: a scalable platform for alignment-based searches, motif detection, and sequence curation across meta/genomic datasets26
Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts26
GVC: efficient random access compression for gene sequence variations26
Taxanorm: a novel taxa-specific normalization approach for microbiome data26
LOCC: a novel visualization and scoring of cutoffs for continuous variables with hepatocellular carcinoma prognosis as an example26
Clinical applications of machine learning in predicting 3D shapes of the human body: a systematic review26
Rendering protein mutation movies with MutAmore26
The FBA solution space kernel: introduction and illustrative examples26
ProTaxoVis—protein taxonomic visualisation of presence26
Extract antibody and antigen names from biomedical literature26
Image-centric compression of protein structures improves space savings26
Comprehensive analysis of cuproptosis-related lncRNAs in immune infiltration and prognosis in hepatocellular carcinoma25
Closha 2.0: a bio-workflow design system for massive genome data analysis on high performance cluster infrastructure25
An approach for proteins and their encoding genes synonyms integration based on protein ontology25
Ion-pumping microbial rhodopsin protein classification by machine learning approach25
Data-driven discovery of chemotactic migration of bacteria via coordinate-invariant machine learning25
Goistrat: gene-of-interest-based sample stratification for the evaluation of functional differences25
Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning25
Identification of biomarkers predictive of metastasis development in early-stage colorectal cancer using network-based regularization25
DiCleave: a deep learning model for predicting human Dicer cleavage sites25
PerFSeeB: designing long high-weight single spaced seeds for full sensitivity alignment with a given number of mismatches25
Dual-approach co-expression analysis framework (D-CAF) enables identification of novel circadian co-regulation from multi-omic timeseries data25
Deep learning-enabled natural language processing to identify directional pharmacokinetic drug–drug interactions25
PhenoExam: gene set analyses through integration of different phenotype databases25
Detection of cell markers from single cell RNA-seq with sc2marker25
scSMD: a deep learning method for accurate clustering of single cells based on auto-encoder25
Multi-granularity transformer contrastive learning and feature reconstruction for prediction of disease-related miRNAs24
Robust classification of wound healing stages in both mice and humans for acute and burn wounds based on transcriptomic data24
Comparing neural models for nested and overlapping biomedical event detection24
BIOWATCH: a R shiny application for the detection of species of interest in metabarcoding datasets24
A new biomarker panel of ultraconserved long non-coding RNAs for bladder cancer prognosis by a machine learning based methodology24
GKLOMLI: a link prediction model for inferring miRNA–lncRNA interactions by using Gaussian kernel-based method on network profile and linear optimization algorithm24
MetageneCluster: a Python package for filtering conflicting signal trends in metagene plots23
Biocaiv: an integrative webserver for motif-based clustering analysis and interactive visualization of biological networks23
A fair experimental comparison of neural network architectures for latent representations of multi-omics for drug response prediction23
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models23
DENSEN: a convolutional neural network for estimating chronological ages from panoramic radiographs23
AlphaFold Database Structure Extractor: a web server and API to download AlphaFold structures using common protein accessions23
A two-stage sperm holomorphological analysis method based on multi-output network construction23
Adaptive enhancement of chest X-ray images using tissue attenuation and local and global fusion23
FindCSV: a long-read based method for detecting complex structural variations22
PRED-LD: efficient imputation of GWAS summary statistics22
M01 tool: an automated, comprehensive computational tool for generating small molecule-peptide hybrids and docking them into curated protein structures22
Interpretable deep learning methods for multiview learning22
AMRViz enables seamless genomics analysis and visualization of antimicrobial resistance22
Optimize data-driven multi-agent simulation for COVID-19 transmission22
Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types22
Child-Sum EATree-LSTMs: enhanced attentive Child-Sum Tree-LSTMs for biomedical event extraction22
Boosting variant-calling performance with multi-platform sequencing data using Clair3-MP21
A voting-based machine learning approach for classifying biological and clinical datasets21
ScLSTM: single-cell type detection by siamese recurrent network and hierarchical clustering21
JCBIE: a joint continual learning neural network for biomedical information extraction21
KEGG orthology prediction of bacterial proteins using natural language processing21
Genealyzer: web application for the analysis and comparison of gene expression data21
CNVizard—a lightweight streamlit application for an interactive analysis of copy number variants21
Multi-objective data enhancement for deep learning-based ultrasound analysis21
Mdwgan-gp: data augmentation for gene expression data based on multiple discriminator WGAN-GP21
Fast and sensitive validation of fusion transcripts in whole-genome sequencing data21
Automatic block-wise genotype-phenotype association detection based on hidden Markov model21
Computational application of internationally harmonized defined approaches to skin sensitization: DASS App21
Phylo-rs: an extensible phylogenetic analysis library in rust21
C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data21
SEMgsa: topology-based pathway enrichment analysis with structural equation models21
Correction: On Bayesian modeling of censored data in JAGS21
Cancer detection via one-shot learning: integrating gene expression and genomic mutation analysis21
Constrained Fourier estimation of short-term time-series gene expression data reduces noise and improves clustering and gene regulatory network predictions20
An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models20
The effect of data balancing approaches on the prediction of metabolic syndrome using non-invasive parameters based on random forest20
HPC-T-Annotator: an HPC tool for de novo transcriptome assembly annotation20
ACVI-Med, an open source variant interpretation tool for medical genomics20
circGPA: circRNA functional annotation based on probability-generating functions20
Differential network connectivity analysis for microbiome data adjusted for clinical covariates using jackknife pseudo-values20
Topology-aware functional similarity: integrating extended neighborhoods via exponential attenuation20
MBECS: Microbiome Batch Effects Correction Suite20
Metacells untangle large and complex single-cell transcriptome networks20
VEBA: a modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes20
DANSE: a pipeline for dynamic modelling of time-series multi-omics data20
Automatic generation of pseudoknotted RNAs taxonomy19
GenMasterTable: a user-friendly desktop application for filtering, summarising, and visualising large-scale annotated genetic variants19
CDPMF-DDA: contrastive deep probabilistic matrix factorization for drug-disease association prediction19
DeepMethyGene: a deep-learning model to predict gene expression using DNA methylations19
Incorporating functional annotation with bilevel continuous shrinkage for polygenic risk prediction19
GNNs and ensemble models enhance the prediction of new sRNA-mRNA interactions in unseen conditions19
IMSE: interaction information attention and molecular structure based drug drug interaction extraction19
Implementation of ensemble machine learning algorithms on exome datasets for predicting early diagnosis of cancers19
Exploring gene-patient association to identify personalized cancer driver genes by linear neighborhood propagation19
Using individual barcodes to increase quantification power of massively parallel reporter assays19
ForestSubtype: a cancer subtype identifying approach based on high-dimensional genomic data and a parallel random forest19
Optimizing diabetes classification with a machine learning-based framework18
Graph convolution network based on meta-paths and mutual information for drug-target interaction prediction18
A multilayer dynamic perturbation analysis method for predicting ligand–protein interactions18
Drug response prediction using graph representation learning and Laplacian feature selection18
Ensemble feature selection with data-driven thresholding for Alzheimer's disease biomarker discovery18
DCI-SiteDTA: drug-target affinity prediction based on binding sites detection and site-aware dual cross-interaction block18
Denoising self-supervised learning for disease-gene association prediction18
Prediction of vancomycin initial dosage using artificial intelligence models applying ensemble strategy18
ENTAIL: yEt aNoTher amyloid fIbrils cLassifier18
GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species18
GGAR: gradient guided adaptive regularization enhances deep learning classification of brassica species using codon usage bias18
BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network18
MFCADTI: improving drug-target interaction prediction by integrating multiple feature through cross attention mechanism18
ClearFinder: a Python GUI for annotating cells in cleared mouse brain17
Trade-off between conservation of biological variation and batch effect removal in deep generative modeling for single-cell transcriptomics17
ASV portal: an interface to DNA-based biodiversity data in the Living Atlas17
A novel two-way rebalancing strategy for identifying carbonylation sites17
A MATLAB-based app to improve LC–MS/MS data analysis for N-linked glycan peak identification17
Multi-task genomic prediction using gated residual variable selection neural networks17
EasyKASP: a simple and fast tool for KASP primer design17
cnnLSV: detecting structural variants by encoding long-read alignment information and convolutional neural network17
SeQual-Stream: approaching stream processing to quality control of NGS datasets17
BG2: Bayesian variable selection in generalized linear mixed models with nonlocal priors for non-Gaussian GWAS data17
Practical application of a Bayesian network approach to poultry epigenetics and stress17
An information-theoretic approach to single cell sequencing analysis17
ASET: an end-to-end pipeline for quantification and visualization of allele specific expression17
Prediction of disease-related miRNAs by voting with multiple classifiers17
Control of false discoveries in grouped hypothesis testing for eQTL data17
SynBioTools: a one-stop facility for searching and selecting synthetic biology tools17
M3S-GRPred: a novel ensemble learning approach for the interpretable prediction of glucocorticoid receptor antagonists using a multi-step stacking strategy17
Slideflow: deep learning for digital histopathology with real-time whole-slide visualization16
Feature selection followed by a novel residuals-based normalization that includes variance stabilization simplifies and improves single-cell gene expression analysis16
Cross-attention graph neural networks for inferring gene regulatory networks with skewed degree distribution16
0.46371293067932