OOIR: Observatory of International Research

Papers

(The TQCC of BMC Bioinformatics is 9. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)

Article	Citations
Graph regularized non-negative matrix factorization with prior knowledge consistency constraint for drug–target interactions prediction	1164
Mathematical modelling of SigE regulatory network reveals new insights into bistability of mycobacterial stress response	337
A novel nonparametric computational strategy for identifying differential methylation regions	264
REDalign: accurate RNA structural alignment using residual encoder-decoder network	228
Linear programming based gene expression model (LPM-GEM) predicts the carbon source for Bacillus subtilis	158
Nonnegative matrix factorization analysis and multiple machine learning methods identified IL17C and ACOXL as novel diagnostic biomarkers for atherosclerosis	135
Abstraction-based segmental simulation of reaction networks using adaptive memoization	96
Multivariate estimation of factor structures of complex traits using SNP-based genomic relationships	92
Grace-AKO: a novel and stable knockoff filter for variable selection incorporating gene network structures	91
Employing phylogenetic tree shape statistics to resolve the underlying host population structure	91
Locality-sensitive hashing enables efficient and scalable signal classification in high-throughput mass spectrometry raw data	90
SALON ontology for the formal description of sequence alignments	90
A drug repositioning algorithm based on a deep autoencoder and adaptive fusion	86
Prediction of hot spots in protein–DNA binding interfaces based on discrete wavelet transform and wavelet packet transform	80
Correction: DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction	79
Topology preserving stratification of tissue neoplasticity using Deep Neural Maps and microRNA signatures	77
Not seeing the trees for the forest. The impact of neighbours on graph-based configurations in histopathology	74
CMIC: predicting DNA methylation inheritance of CpG islands with embedding vectors of variable-length k-mers	71
Latent dirichlet allocation for double clustering (LDA-DC): discovering patients phenotypes and cell populations within a single Bayesian framework	68
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks	67
Empowering the discovery of novel target-disease associations via machine learning approaches in the open targets platform	66
DualGCN-GE: integration of spatiotemporal representations from whole-blood expression data with dual-view graph convolution network to identify Parkinson’s disease subtypes	66
Combining whole genome sequencing and non-adaptive group testing for large-scale ethnicity screens	63
Deep learning and multi-omics approach to predict drug responses in cancer	62
Weighted overlapping group lasso for integrating prior network knowledge into gene set analysis	60

Implementation of machine learning in the clinic: challenges and lessons in prospective deployment from the System for High Intensity EvaLuation During Radiation Therapy (SHIELD-RT) randomized control	59
Exploring cell-specific miRNA regulation with single-cell miRNA-mRNA co-sequencing data	56
Integrated analysis of the voltage-gated potassium channel-associated gene KCNH2 across cancers	55
Predictive modeling of gene expression regulation	54
Examination of blood samples using deep learning and mobile microscopy	52
Mabs, a suite of tools for gene-informed genome assembly	52
CoQUAD: a COVID-19 question answering dataset system, facilitating research, benchmarking, and practice	51
Prediction of HIV-1 protease cleavage site from octapeptide sequence information using selected classifiers and hybrid descriptors	51
HPC-T-Assembly: a pipeline for de novo transcriptome assembly of large multi-specie datasets	50
SumStatsRehab: an efficient algorithm for GWAS summary statistics assessment and restoration	49
A gene based combination test using GWAS summary data	49
Combining denoising of RNA-seq data and flux balance analysis for cluster analysis of single cells	48
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis	48
PEPMatch: a tool to identify short peptide sequence matches in large sets of proteins	48
Multilayer network alignment based on topological assessment via embeddings	47
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens	47
CircWalk: a novel approach to predict CircRNA-disease association based on heterogeneous network representation learning	47
SVDNVLDA: predicting lncRNA-disease associations by Singular Value Decomposition and node2vec	46
Enabling personalised disease diagnosis by combining a patient’s time-specific gene expression profile with a biomedical knowledge base	46
Hitac: a hierarchical taxonomic classifier for fungal ITS sequences compatible with QIIME2	46
A prefix and attention map discrimination fusion guided attention for biomedical named entity recognition	45
Ant colony optimization for the identification of dysregulated gene subnetworks from expression data	45
qRAT: an R-based stand-alone application for relative expression analysis of RT-qPCR data	43
SVhound: detection of regions that harbor yet undetected structural variation	43
UniAMP: enhancing AMP prediction using deep neural networks with inferred information of peptides	43
Can large language models understand molecules?	43
NODeJ: an ImageJ plugin for 3D segmentation of nuclear objects	41
EZH2 as a prognostic-related biomarker in lung adenocarcinoma correlating with cell cycle and immune infiltrates	41
DTIP-WINDGRU a novel drug-target interaction prediction with wind-enhanced gated recurrent unit	41
A learning-based method to predict LncRNA-disease associations by combining CNN and ELM	40
An adaptive multi-modal hybrid model for classifying thyroid nodules by combining ultrasound and infrared thermal images	40
GraphKM: machine and deep learning for KM prediction of wildtype and mutant enzymes	39
DeepCAC: a deep learning approach on DNA transcription factors classification based on multi-head self-attention and concatenate convolutional neural network	39
BADASS: BActeriocin-Diversity ASsessment Software	38
Benchmarking for biomedical natural language processing tasks with a domain specific ALBERT	38
A-Prot: protein structure modeling using MSA transformer	38
iDESC: identifying differential expression in single-cell RNA sequencing data with multiple subjects	38
Propensity scores as a novel method to guide sample allocation and minimize batch effects during the design of high throughput experiments	37
Fasta2Structure: a user-friendly tool for converting multiple aligned FASTA files to STRUCTURE format	37
‘gitana’ (phyloGenetic Imaging Tool for Adjusting Nodes and other Arrangements), a tool for plotting phylogenetic trees into ready-to-publish figures	36
MGATAF: multi-channel graph attention network with adaptive fusion for cancer-drug response prediction	36
Identification of cuproptosis-related lncRNAs to predict prognosis and immune infiltration characteristics in alimentary tract malignancies	35
Blastn2dotplots: multiple dot-plot visualizer for genome comparisons	35
Multi-view manifold regularized compact low-rank representation for cancer samples clustering on multi-omics data	35
Secondary structure specific simpler prediction models for protein backbone angles	34
Semantic interoperability: ontological unpacking of a viral conceptual model	34
LinG3D: visualizing the spatio-temporal dynamics of clonal evolution	33
MultiToxPred 1.0: a novel comprehensive tool for predicting 27 classes of protein toxins using an ensemble machine learning approach	33
INFLECT: an R-package for cytometry cluster evaluation using marker modality	33
CircMiMi: a stand-alone software for constructing circular RNA-microRNA-mRNA interactions across species	33

DiseaseNet: a transfer learning approach to noncommunicable disease classification	32
Assessment of deep learning and transfer learning for cancer prediction based on gene expression data	32
Identification of fish species through tRNA-based primer design	32
Correction: Deep learning model integrating positron emission tomography and clinical data for prognosis prediction in non-small cell lung cancer patients	32
Impaired time-distance reconfiguration patterns in Alzheimer's disease: a dynamic functional connectivity study with 809 individuals from 7 sites	32
Evaluation of tree-based statistical learning methods for constructing genetic risk scores	32
PreAcrs: a machine learning framework for identifying anti-CRISPR proteins	32
A deep learning approach to predict inter-omics interactions in multi-layer networks	32
CCL-DTI: contributing the contrastive loss in drug–target interaction prediction	32
Binding affinity prediction for protein–ligand complex using deep attention mechanism based on intermolecular interactions	32
MTAGCN: predicting miRNA-target associations in Camellia sinensis var. assamica through graph convolution neural network	32
VirPool: model-based estimation of SARS-CoV-2 variant proportions in wastewater samples	32
Gene expression variability across cells and species shapes the relationship between renal resident macrophages and infiltrated macrophages	31
Using empirical biological knowledge to infer regulatory networks from multi-omics data	31
LincRNA ZNF529-AS1 inhibits hepatocellular carcinoma via FBXO31 and predicts the prognosis of hepatocellular carcinoma patients	31
False discovery rate estimation using candidate peptides for each spectrum	31
A graph neural network framework for mapping histological topology in oral mucosal tissue	31
Optimal construction of a functional interaction network from pooled library CRISPR fitness screens	31
The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes	30
Deafness gene screening based on a multilevel cascaded BPNN model	30
CurvAGN: Curvature-based Adaptive Graph Neural Networks for Predicting Protein-Ligand Binding Affinity	30
HGGA: hierarchical guided genome assembler	30
BPFun: a deep learning framework for bioactive peptide function prediction using multi-label strategy by transformer-driven and sequence rich intrinsic information	29
Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method	29
Exploring deep learning methods for recognizing rare diseases and their clinical manifestations from texts	29
A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization	29
Classifying chest CT images as COVID-19 positive/negative using a convolutional neural network ensemble model and uniform experimental design method	28
A novel bi-directional heterogeneous network selection method for disease and microbial association prediction	28
Serial KinderMiner (SKiM) discovers and annotates biomedical knowledge using co-occurrence and transformer models	28
Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking	28
SaeGraphDTI: drug–target interaction prediction based on sequence attribute extraction and graph neural network	28
Inference of single-cell network using mutual information for scRNA-seq data analysis	28
refMLST: reference-based multilocus sequence typing enables universal bacterial typing	28
MHESMMR: a multilevel model for predicting the regulation of miRNAs expression by small molecules	28
PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering	28
Predicting subcellular location of protein with evolution information and sequence-based deep learning	27
In-vitro validated methods for encoding digital data in deoxyribonucleic acid (DNA)	27
A seed expansion-based method to identify essential proteins by integrating protein–protein interaction sub-networks and multiple biological characteristics	27
MR-GGI: accurate inference of gene–gene interactions using Mendelian randomization	27
PyToxo: a Python tool for calculating penetrance tables of high-order epistasis models	27
Fractal feature selection model for enhancing high-dimensional biological problems	27
Combining single-cell ATAC and RNA sequencing for supervised cell annotation	26
Probabilistic quotient’s work and pharmacokinetics’ contribution: countering size effect in metabolic time series measurements	26
Glucostats: an efficient Python library for glucose time series feature extraction and visual analysis	26
A clinical knowledge graph-based framework to prioritize candidate genes for facilitating diagnosis of Mendelian diseases and rare genetic conditions	26
Reducing Boolean networks with backward equivalence	26
Development of a TSR-based method for understanding structural relationships of cofactors and local environments in photosystem I	26
A robust and accurate single-cell data trajectory inference method using ensemble pseudotime	26
CircPrimer 2.0: a software for annotating circRNAs and predicting translation potential of circRNAs	26
A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models	26
A hybrid algorithm for clinical decision support in precision medicine based on machine learning	26
Study on the prognosis, immune and drug resistance of m6A-related genes in lung cancer	26
Piikun: an information theoretic toolkit for analysis and visualization of species delimitation metric space	26
The prognostic value of autophagy related genes with potential protective function in Ewing sarcoma	26
Conformal novelty detection for multiple metabolic networks	26
FragGeneScanRs: faster gene prediction for short reads	25
Classification of age-related macular degeneration using convolutional-neural-network-based transfer learning	25
A tensor-based bi-random walks model for protein function prediction	25
Single-cell spatial explorer: easy exploration of spatial and multimodal transcriptomics	25
Informeasure: an R/bioconductor package for quantifying nonlinear dependence between variables in biological networks from an information theory perspective	25
Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction	25
MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads	25
Designing multi-epitope vaccine against important colorectal cancer (CRC) associated pathogens based on immunoinformatics approach	25
Integrative analysis of TP53 mutations in lung adenocarcinoma for immunotherapies and prognosis	24
Immunoinformatics design of multi-epitope vaccine using OmpA, OmpD and enterotoxin against non-typhoidal salmonellosis	24
Extract antibody and antigen names from biomedical literature	24
ORFeus: a computational method to detect programmed ribosomal frameshifts and other non-canonical translation events	24
GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder	24
Statistical methods and resources for biomarker discovery using metabolomics	24
Data-driven discovery of chemotactic migration of bacteria via coordinate-invariant machine learning	23
LOCC: a novel visualization and scoring of cutoffs for continuous variables with hepatocellular carcinoma prognosis as an example	23
LPMX: a pure rootless composable container system	23
Wavelet Screening: a novel approach to analyzing GWAS data	23
A consensus-based ensemble approach to improve transcriptome assembly	23
The FBA solution space kernel: introduction and illustrative examples	23
Comparative analysis of aneurysm subtypes associated genes based on protein–protein interaction network	23
DiCleave: a deep learning model for predicting human Dicer cleavage sites	23
Using amino acids co-occurrence matrices and explainability model to investigate patterns in dengue virus proteins	22
Using entropy-driven amplifier circuit response to build nonlinear model under the influence of Lévy jump	22
scSMD: a deep learning method for accurate clustering of single cells based on auto-encoder	22

PerFSeeB: designing long high-weight single spaced seeds for full sensitivity alignment with a given number of mismatches	22
Rendering protein mutation movies with MutAmore	22
Protein complexes detection based on node local properties and gene expression in PPI weighted networks	22
Closha 2.0: a bio-workflow design system for massive genome data analysis on high performance cluster infrastructure	22
Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning	21
GVC: efficient random access compression for gene sequence variations	21
ProTaxoVis—protein taxonomic visualisation of presence	21
Dual-approach co-expression analysis framework (D-CAF) enables identification of novel circadian co-regulation from multi-omic timeseries data	21
A fair experimental comparison of neural network architectures for latent representations of multi-omics for drug response prediction	21
Biocaiv: an integrative webserver for motif-based clustering analysis and interactive visualization of biological networks	21
Clinical applications of machine learning in predicting 3D shapes of the human body: a systematic review	21
Taxanorm: a novel taxa-specific normalization approach for microbiome data	21
Goistrat: gene-of-interest-based sample stratification for the evaluation of functional differences	21
An approach for proteins and their encoding genes synonyms integration based on protein ontology	21
Image-centric compression of protein structures improves space savings	20
Comparing neural models for nested and overlapping biomedical event detection	20
PhenoExam: gene set analyses through integration of different phenotype databases	20
Bayesian variable selection for high-dimensional data with an ordinal response: identifying genes associated with prognostic risk group in acute myeloid leukemia	20
Deep learning-enabled natural language processing to identify directional pharmacokinetic drug–drug interactions	20
A new biomarker panel of ultraconserved long non-coding RNAs for bladder cancer prognosis by a machine learning based methodology	20
DENSEN: a convolutional neural network for estimating chronological ages from panoramic radiographs	20
Robust classification of wound healing stages in both mice and humans for acute and burn wounds based on transcriptomic data	20
Comprehensive analysis of cuproptosis-related lncRNAs in immune infiltration and prognosis in hepatocellular carcinoma	19
Prediction of diabetes disease using an ensemble of machine learning multi-classifier models	19
Child-Sum EATree-LSTMs: enhanced attentive Child-Sum Tree-LSTMs for biomedical event extraction	19
Correction to: Mining a stroke knowledge graph from literature	19
Identification of biomarkers predictive of metastasis development in early-stage colorectal cancer using network-based regularization	19
Impact of gene annotation choice on the quantification of RNA-seq data	19
SEMgsa: topology-based pathway enrichment analysis with structural equation models	19
Differential network connectivity analysis for microbiome data adjusted for clinical covariates using jackknife pseudo-values	19
Correction to: ncDLRES: a novel method for non‑coding RNAs family prediction based on dynamic LSTM and ResNet	19
Ion-pumping microbial rhodopsin protein classification by machine learning approach	19
Comparison of sequencing data processing pipelines and application to underrepresented African human populations	19
Detection of cell markers from single cell RNA-seq with sc2marker	19
MetageneCluster: a Python package for filtering conflicting signal trends in metagene plots	19
Optimize data-driven multi-agent simulation for COVID-19 transmission	19
GKLOMLI: a link prediction model for inferring miRNA–lncRNA interactions by using Gaussian kernel-based method on network profile and linear optimization algorithm	19
LaRA 2: parallel and vectorized program for sequence–structure alignment of RNA sequences	19
Constrained Fourier estimation of short-term time-series gene expression data reduces noise and improves clustering and gene regulatory network predictions	18
MFCADTI: improving drug-target interaction prediction by integrating multiple feature through cross attention mechanism	18
Correction: On Bayesian modeling of censored data in JAGS	18
KEGG orthology prediction of bacterial proteins using natural language processing	18
DeepMethyGene: a deep-learning model to predict gene expression using DNA methylations	18
FindCSV: a long-read based method for detecting complex structural variations	18
ForestSubtype: a cancer subtype identifying approach based on high-dimensional genomic data and a parallel random forest	18
Metacells untangle large and complex single-cell transcriptome networks	18
Multi-objective data enhancement for deep learning-based ultrasound analysis	18
ScLSTM: single-cell type detection by siamese recurrent network and hierarchical clustering	18
AMRViz enables seamless genomics analysis and visualization of antimicrobial resistance	18
GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species	18
The effect of data balancing approaches on the prediction of metabolic syndrome using non-invasive parameters based on random forest	18
RSNET: inferring gene regulatory networks by a redundancy silencing and network enhancement technique	17
MBECS: Microbiome Batch Effects Correction Suite	17
Implementation of ensemble machine learning algorithms on exome datasets for predicting early diagnosis of cancers	17
BioEGRE: a linguistic topology enhanced method for biomedical relation extraction based on BioELECTRA and graph pointer neural network	17
CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network	17
Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types	17
Expression-based species deconvolution and realignment removes misalignment error in multispecies single-cell data	17
Exploring gene-patient association to identify personalized cancer driver genes by linear neighborhood propagation	17
HGDTI: predicting drug–target interaction by using information aggregation based on heterogeneous graph neural network	17
An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models	17
Boosting variant-calling performance with multi-platform sequencing data using Clair3-MP	17
Automatic block-wise genotype-phenotype association detection based on hidden Markov model	17
BarWare: efficient software tools for barcoded single-cell genomics	17
Incorporating functional annotation with bilevel continuous shrinkage for polygenic risk prediction	17
C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data	17
M01 tool: an automated, comprehensive computational tool for generating small molecule-peptide hybrids and docking them into curated protein structures	17
Fast and sensitive validation of fusion transcripts in whole-genome sequencing data	17
ANINet: a deep neural network for skull ancestry estimation	17
CNVizard—a lightweight streamlit application for an interactive analysis of copy number variants	17
Clustering biological sequences with dynamic sequence similarity threshold	17
GNNs and ensemble models enhance the prediction of new sRNA-mRNA interactions in unseen conditions	17
MSA: reproducible mutational signature attribution with confidence based on simulations	16
CDPMF-DDA: contrastive deep probabilistic matrix factorization for drug-disease association prediction	16
HPC-T-Annotator: an HPC tool for de novo transcriptome assembly annotation	16
PRED-LD: efficient imputation of GWAS summary statistics	16
SeQual-Stream: approaching stream processing to quality control of NGS datasets	16
IMSE: interaction information attention and molecular structure based drug drug interaction extraction	16
Optimizing diabetes classification with a machine learning-based framework	16
Prediction of vancomycin initial dosage using artificial intelligence models applying ensemble strategy	16
A voting-based machine learning approach for classifying biological and clinical datasets	16
Mdwgan-gp: data augmentation for gene expression data based on multiple discriminator WGAN-GP	16
JCBIE: a joint continual learning neural network for biomedical information extraction	16
ImmunoDataAnalyzer: a bioinformatics pipeline for processing barcoded and UMI tagged immunological NGS data	16
EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning	16
Drug response prediction using graph representation learning and Laplacian feature selection	16
Interpretable deep learning methods for multiview learning	16
Using individual barcodes to increase quantification power of massively parallel reporter assays	16
Genealyzer: web application for the analysis and comparison of gene expression data	16
Computational application of internationally harmonized defined approaches to skin sensitization: DASS App	16
Automatic generation of pseudoknotted RNAs taxonomy	16
circGPA: circRNA functional annotation based on probability-generating functions	16
A MATLAB-based app to improve LC–MS/MS data analysis for N-linked glycan peak identification	16
VEBA: a modular end-to-end suite for in silico recovery, clustering, and analysis of prokaryotic, microeukaryotic, and viral genomes from metagenomes	16
GenMasterTable: a user-friendly desktop application for filtering, summarising, and visualising large-scale annotated genetic variants	16