Biodata Mining

Papers
(The median citation count of Biodata Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Investigating potential drug targets for IgA nephropathy and membranous nephropathy through multi-queue plasma protein analysis: a Mendelian randomization study based on SMR and co-localization analys359
Correction: Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning296
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion60
Processing imbalanced medical data at the data level with assisted-reproduction data as an example51
MOCAT: multi-omics integration with auxiliary classifiers enhanced autoencoder49
Exploring the common genetic basis of metabolic syndrome-related diseases and chronic kidney disease: insights from extensive genome-wide cross-trait analyses47
A simple guide to the use of Student’s t-test, Mann-Whitney U test, Chi-squared test, and Kruskal-Wallis test in biostatistics41
Transcriptome-based network analysis related to regulatory T cells infiltration identified RCN1 as a potential biomarker for prognosis in clear cell renal cell carcinoma33
Ten simple rules for providing bioinformatics support within a hospital27
Neural network methods for diagnosing patient conditions from cardiopulmonary exercise testing data26
Polygenic risk modeling of tumor stage and survival in bladder cancer23
Comparing new tools of artificial intelligence to the authentic intelligence of our global health students23
circGPAcorr: an integrative tool for functional annotation of circular RNAs using expression data20
Unsupervised clustering based coronary artery segmentation19
Skin in the game: a review of computational models of the skin17
Machine learning approaches to identify systemic lupus erythematosus in anti-nuclear antibody-positive patients using genomic data and electronic health records17
The biomedical knowledge graph of symptom phenotype in coronary artery plaque: machine learning-based analysis of real-world clinical data14
Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning14
Detection and classification of long terminal repeat sequences in plant LTR-retrotransposons and their analysis using explainable machine learning13
Decoding dynamic miRNA:ceRNA interactions unveils therapeutic insights and targets across predominant cancer landscapes12
Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making11
Genetics and precision health: the ecological fallacy and artificial intelligence solutions11
Mapping the evolving trend of research on efferocytosis: a comprehensive data-mining-based study11
Supervised multiple kernel learning approaches for multi-omics data integration11
Advancing preeclampsia prediction: a tailored machine learning pipeline integrating resampling and ensemble models for handling imbalanced medical data10
Assessment of the causal relationship between gut microbiota and cardiovascular diseases: a bidirectional Mendelian randomization analysis9
Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles9
Correction: Motif clustering and digital biomarker extraction for free-living physical activity analysis9
Motif clustering and digital biomarker extraction for free-living physical activity analysis9
Machine learning models for reinjury risk prediction using cardiopulmonary exercise testing (CPET) data: optimizing athlete recovery9
Using GPT-4 to write a scientific review article: a pilot evaluation study9
From COVID-19 to monkeypox: a novel predictive model for emerging infectious diseases9
m1A-Ensem: accurate identification of 1-methyladenosine sites through ensemble models9
Understanding predictions of drug profiles using explainable machine learning models8
Predictive modeling of ALS progression: an XGBoost approach using clinical features8
Open challenges and opportunities in federated foundation models towards biomedical healthcare8
Effective hybrid feature selection using different bootstrap enhances cancers classification performance8
An unsupervised image segmentation algorithm for coronary angiography8
6mA-StackingCV: an improved stacking ensemble model for predicting DNA N6-methyladenine site7
Detection of iron deficiency anemia by medical images: a comparative study of machine learning algorithms7
Ensemble feature selection and tabular data augmentation with generative adversarial networks to enhance cutaneous melanoma identification and interpretability7
Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms’ representation7
Reference-free phylogeny from sequencing data7
Deep learning-based Emergency Department In-hospital Cardiac Arrest Score (Deep EDICAS) for early prediction of cardiac arrest and cardiopulmonary resuscitation in the emergency department7
Saliency-driven explainable deep learning in medical imaging: bridging visual explainability and statistical quantitative analysis7
Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods7
Changing word meanings in biomedical literature reveal pandemics and new technologies6
Optimizing age-related hearing risk predictions: an advanced machine learning integration with HHIE-S6
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis6
A machine learning approach using conditional normalizing flow to address extreme class imbalance problems in personal health records6
The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification6
A Gated Recurrent Unit based architecture for recognizing ontology concepts from biological literature5
Correction: A prognostic model based on seven immune-related genes predicts the overall survival of patients with hepatocellular carcinoma5
Exo-Tox: Identifying Exotoxins from secreted bacterial proteins5
Evaluation of network-guided random forest for disease gene discovery5
ChatGPT and large language models in academia: opportunities and challenges5
TGNet: tensor-based graph convolutional networks for multimodal brain network analysis5
Endoscopy-based IBD identification by a quantized deep learning pipeline5
Network-based multi-omics integrative analysis methods in drug discovery: a systematic review4
Machine Learning Algorithms for understanding the determinants of under-five Mortality4
Machine-learning-based models to predict cardiovascular risk using oculomics and clinic variables in KNHANES4
Analysis of risk factors progression of preterm delivery using electronic health records4
Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies4
Construction and application of medication reminder system: intelligent generation of universal medication schedule4
The ethics of data mining in healthcare: challenges, frameworks, and future directions4
Polymorphisms in the mTOR-PI3K-Akt pathway, energy balance-related exposures and colorectal cancer risk in the Netherlands Cohort Study4
MultiChem: predicting chemical properties using multi-view graph attention network4
Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data4
ScInfoVAE: interpretable dimensional reduction of single cell transcription data with variational autoencoders and extended mutual information regularization4
A regularized Cox hierarchical model for incorporating annotation information in predictive omic studies4
Deciphering the tissue-specific functional effect of Alzheimer risk SNPs with deep genome annotation3
subMG automates data submission for metagenomics studies3
Novel digital approaches to the assessment of problematic opioid use3
QIGTD: identifying critical genes in the evolution of lung adenocarcinoma with tensor decomposition3
Priority-Elastic net for binary disease outcome prediction based on multi-omics data3
Electronic medical records imputation by temporal Generative Adversarial Network3
Quantum analysis of squiggle data3
Algorithm-based detection of acute kidney injury according to full KDIGO criteria including urine output following cardiac surgery: a descriptive analysis3
Enhancing hepatopathy clinical trial efficiency: a secure, large language model-powered pre-screening pipeline3
ParticleChromo3D: a Particle Swarm Optimization algorithm for chromosome 3D structure prediction from Hi-C data3
mSRFR: a machine learning model using microalgal signature features for ncRNA classification3
Towards a potential pan-cancer prognostic signature for gene expression based on probesets and ensemble machine learning3
Machine learning based study for the classification of Type 2 diabetes mellitus subtypes3
PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies3
AI as an accelerator for defining new problems that transcends boundaries2
Enhanced labor pain monitoring using machine learning and ECG waveform analysis for uterine contraction-induced pain2
Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection2
Automated quantitative trait locus analysis (AutoQTL)2
Private pathological assessment via machine learning and homomorphic encryption2
Can open source large language models be used for tumor documentation in Germany?—An evaluation on urological doctors’ notes2
A deep learning approach for classifying and predicting children's nutritional status in Ethiopia using LSTM-FC neural networks2
Overlapping filter bank convolutional neural network for multisubject multicategory motor imagery brain-computer interface2
Comparison of cancer subtype identification methods combined with feature selection methods in omics data analysis2
Short- and long-term weekly patient-reported outcomes prediction undergoing radiotherapy: single-patient time series model vs. transformer-based multi-patient time series model2
Agenda setting for health equity assessment through the lenses of social determinants of health using machine learning approach: a framework and preliminary pilot study2
Inverse problem for parameters identification in a modified SIRD epidemic model using ensemble neural networks2
Feature graphs for interpretable unsupervised tree ensembles: centrality, interaction, and application in disease subtyping2
Deep learning-based approaches for multi-omics data integration and analysis2
iSuc-ChiDT: a computational method for identifying succinylation sites using statistical difference table encoding and the chi-square decision table classifier2
Distinct network patterns emerge from Cartesian and XOR epistasis models: a comparative network science analysis2
Assessing the limitations of relief-based algorithms in detecting higher-order interactions2
Learning the therapeutic targets of acute myeloid leukemia through multiscale human interactome network and community analysis2
Leveraging mixed-effects regression trees for the analysis of high-dimensional longitudinal data to identify the low and high-risk subgroups: simulation study with application to genetic study2
Correction: Predictive modeling of ALS progression: an XGBoost approach using clinical features2
Predicting molecular initiating events using chemical target annotations and gene expression2
0.040585041046143