Biodata Mining

Papers
(The median citation count of Biodata Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-07-01 to 2025-07-01.)
ArticleCitations
MOCAT: multi-omics integration with auxiliary classifiers enhanced autoencoder324
Transcriptome-based network analysis related to regulatory T cells infiltration identified RCN1 as a potential biomarker for prognosis in clear cell renal cell carcinoma243
A new pipeline for structural characterization and classification of RNA-Seq microbiome data52
Investigating potential drug targets for IgA nephropathy and membranous nephropathy through multi-queue plasma protein analysis: a Mendelian randomization study based on SMR and co-localization analys43
Correction: Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning43
Processing imbalanced medical data at the data level with assisted-reproduction data as an example41
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion33
Comparing new tools of artificial intelligence to the authentic intelligence of our global health students26
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records25
Unsupervised clustering based coronary artery segmentation22
Personalized single-cell networks: a framework to predict the response of any gene to any drug for any patient22
Ten simple rules for providing bioinformatics support within a hospital22
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data22
Polygenic risk modeling of tumor stage and survival in bladder cancer21
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning19
Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning17
Decoding dynamic miRNA:ceRNA interactions unveils therapeutic insights and targets across predominant cancer landscapes17
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data14
Supervised multiple kernel learning approaches for multi-omics data integration13
Genetics and precision health: the ecological fallacy and artificial intelligence solutions13
Comparison of 16S and whole genome dog microbiomes using machine learning10
Correction: Motif clustering and digital biomarker extraction for free-living physical activity analysis9
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making9
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models9
Advancing preeclampsia prediction: a tailored machine learning pipeline integrating resampling and ensemble models for handling imbalanced medical data9
Using GPT-4 to write a scientific review article: a pilot evaluation study9
Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles8
Effective hybrid feature selection using different bootstrap enhances cancers classification performance8
GenoVault: a cloud based genomics repository8
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis8
Motif clustering and digital biomarker extraction for free-living physical activity analysis8
Predictive modeling of ALS progression: an XGBoost approach using clinical features8
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases8
Understanding predictions of drug profiles using explainable machine learning models7
Reference-free phylogeny from sequencing data7
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability7
An unsupervised image segmentation algorithm for coronary angiography7
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery7
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods7
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms7
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records6
Open challenges and opportunities in federated foundation models towards biomedical healthcare6
Deep learning-based Emergency Department In-hospital Cardiac Arrest Score (Deep EDICAS) for early prediction of cardiac arrest and cardiopulmonary resuscitation in the emergency department6
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis6
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site6
Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms’ representation6
Changing word meanings in biomedical literature reveal pandemics and new technologies5
Correction: A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma5
Evaluation of network-guided random forest for disease gene discovery5
Biological knowledge-slanted random forest approach for the classification of calcified aortic valve stenosis5
Optimizing age-related hearing risk predictions: an advanced machine learning integration with HHIE-S5
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis5
MultiChem: predicting chemical properties using multi-view graph attention network5
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification5
ChatGPT and large language models in academia: opportunities and challenges5
Endoscopy-based IBD identification by a quantized deep learning pipeline5
Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies5
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature4
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data4
Machine Learning Algorithms for understanding the determinants of under-five Mortality4
A regularized Cox hierarchical model for incorporating annotation information in predictive omic studies4
Network-based multi-omics integrative analysis methods in drug discovery: a systematic review4
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization4
Polymorphisms in the mTOR-PI3K-Akt pathway, energy balance-related exposures and colorectal cancer risk in the Netherlands Cohort Study4
Analysis of risk factors progression of preterm delivery using electronic health records4
TGNet: tensor-based graph convolutional networks for multimodal brain network analysis4
Construction and application of medication reminder system: intelligent generation of universal medication schedule4
Machine-learning-based models to predict cardiovascular risk using oculomics and clinic variables in KNHANES4
Prediction of MoRFs based on sequence properties and convolutional neural networks4
iGlioSub: an integrative transcriptomic and epigenomic classifier for glioblastoma molecular subtypes3
Novel digital approaches to the assessment of problematic opioid use3
subMG automates data submission for metagenomics studies3
eQTpLot: a user-friendly R package for the visualization of colocalization between eQTL and GWAS signals3
Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline3
Machine learning based study for the classification of Type 2 diabetes mellitus subtypes3
Electronic medical records imputation by temporal Generative Adversarial Network3
Priority-Elastic net for binary disease outcome prediction based on multi-omics data3
Deciphering the tissue-specific functional effect of Alzheimer risk SNPs with deep genome annotation3
PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies3
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data3
mSRFR: a machine learning model using microalgal signature features for ncRNA classification3
Algorithm-based detection of acute kidney injury according to full KDIGO criteria including urine output following cardiac surgery: a descriptive analysis3
QIGTD: identifying critical genes in the evolution of lung adenocarcinoma with tensor decomposition3
Quantum analysis of squiggle data2
Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis2
Leveraging mixed-effects regression trees for the analysis of high-dimensional longitudinal data to identify the low and high-risk subgroups: simulation study with application to genetic study2
Inverse problem for parameters identification in a modified SIRD epidemic model using ensemble neural networks2
iSuc-ChiDT: a computational method for identifying succinylation sites using statistical difference table encoding and the chi-square decision table classifier2
Agenda setting for health equity assessment through the lenses of social determinants of health using machine learning approach: a framework and preliminary pilot study2
Private pathological assessment via machine learning and homomorphic encryption2
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning2
Distinct network patterns emerge from Cartesian and XOR epistasis models: a comparative network science analysis2
Automated quantitative trait locus analysis (AutoQTL)2
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection2
Deep learning-based approaches for multi-omics data integration and analysis2
Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping2
Taxonomy-based data representation for data mining: an example of the magnitude of risk associated with H. pylori infection2
Assessing the limitations of relief-based algorithms in detecting higher-order interactions2
AI as an accelerator for defining new problems that transcends boundaries2
Enhanced labor pain monitoring using machine learning and ECG waveform analysis for uterine contraction-induced pain2
Predicting molecular initiating events using chemical target annotations and gene expression2
Learning the therapeutic targets of acute myeloid leukemia through multiscale human interactome network and community analysis2
Overlapping filter bank convolutional neural network for multisubject multicategory motor imagery brain-computer interface2
0.06938099861145