Biodata Mining

Papers
(The TQCC of Biodata Mining is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
Investigating potential drug targets for IgA nephropathy and membranous nephropathy through multi-queue plasma protein analysis: a Mendelian randomization study based on SMR and co-localization analys380
Correction: Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning333
Transcriptome-based network analysis related to regulatory T cells infiltration identified RCN1 as a potential biomarker for prognosis in clear cell renal cell carcinoma74
MOCAT: multi-omics integration with auxiliary classifiers enhanced autoencoder67
Exploring the common genetic basis of metabolic syndrome-related diseases and chronic kidney disease: insights from extensive genome-wide cross-trait analyses55
Processing imbalanced medical data at the data level with assisted-reproduction data as an example54
A simple guide to the use of Student’s t-test, Mann-Whitney U test, Chi-squared test, and Kruskal-Wallis test in biostatistics36
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion30
Polygenic risk modeling of tumor stage and survival in bladder cancer28
Unsupervised clustering based coronary artery segmentation25
circGPAcorr: an integrative tool for functional annotation of circular RNAs using expression data23
Skin in the game: a review of computational models of the skin23
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data21
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records20
Comparing new tools of artificial intelligence to the authentic intelligence of our global health students20
Ten simple rules for providing bioinformatics support within a hospital17
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data17
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning16
Decoding dynamic miRNA:ceRNA interactions unveils therapeutic insights and targets across predominant cancer landscapes16
Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning15
Genetics and precision health: the ecological fallacy and artificial intelligence solutions14
Mapping the evolving trend of research on efferocytosis: a comprehensive data-mining-based study13
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making13
Supervised multiple kernel learning approaches for multi-omics data integration12
FISM: harnessing deep learning and reinforcement learning for precision detection of microaneurysms and retinal exudates for early diabetic retinopathy diagnosis12
Advancing preeclampsia prediction: a tailored machine learning pipeline integrating resampling and ensemble models for handling imbalanced medical data12
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models11
Correction: Motif clustering and digital biomarker extraction for free-living physical activity analysis11
Using GPT-4 to write a scientific review article: a pilot evaluation study11
Effective hybrid feature selection using different bootstrap enhances cancers classification performance10
Motif clustering and digital biomarker extraction for free-living physical activity analysis10
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases10
An unsupervised image segmentation algorithm for coronary angiography9
MediNet: ensemble transfer learning approach for classification of medical drugs-related text reviews using significant combined-embeddings9
Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles9
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis9
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery9
Understanding predictions of drug profiles using explainable machine learning models9
Predictive modeling of ALS progression: an XGBoost approach using clinical features9
Open challenges and opportunities in federated foundation models towards biomedical healthcare8
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods8
Reference-free phylogeny from sequencing data8
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability8
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site7
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis7
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records7
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis7
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms7
Deep learning-based Emergency Department In-hospital Cardiac Arrest Score (Deep EDICAS) for early prediction of cardiac arrest and cardiopulmonary resuscitation in the emergency department7
Decoding ancestry-specific genetic risk: interpretable deep feature selection reveals prostate cancer SNP disparities in diverse populations7
Changing word meanings in biomedical literature reveal pandemics and new technologies6
Endoscopy-based IBD identification by a quantized deep learning pipeline6
ChatGPT and large language models in academia: opportunities and challenges6
Optimizing age-related hearing risk predictions: an advanced machine learning integration with HHIE-S6
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification6
0.29764413833618