Statistical Analysis and Data Mining

Papers
(The TQCC of Statistical Analysis and Data Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
430
Semi‐Parametric Least‐Area Linear‐Circular Regression Through Möbius Transformation27
Sample selection bias in evaluation of prediction performance of causal models19
Data‐drivensparse partial least squares17
Predictive models with end user preference15
Modeling and inference for mixtures of simple symmetric exponential families of ‐dimensional distributions for vectors with binary coordinates15
Randomized multiarm bandits: An improved adaptive data collection method12
12
Survival trees based on heterogeneity in time‐to‐event and censoring distributions using parameter instability test12
Issue Information11
Model Averaging for Regression Kink Models10
Some Bayesian biclustering methods: Modeling and inference9
Data Twinning9
Kernel learning with nonconvex ramp loss9
CLADAG 2021 special issue: Selected papers on classification and data analysis8
BayesMultiomics: An R Package for Bayesian Shrinkage Models for Integration and Analysis of Multi‐Platform High‐Dimensional Genomics Data8
Optimal ratio for data splitting7
An efficientk‐modes algorithm for clustering categorical datasets6
Negative binomial graphical model with excess zeros6
Multi‐node Expectation–Maximization algorithm for finite mixture models5
Bayesian shrinkage models for integration and analysis of multiplatform high‐dimensional genomics data5
Weighted AutoEncoding recommender system5
Integrative learning of structuredhigh‐dimensionaldata from multiple datasets5
5
Tracking clusters and anomalies in evolving data streams5
Issue Information5
An ImprovedD2GAN‐based oversampling algorithm for imbalanced data classification4
A tree‐based gene–environment interaction analysis with rare features4
Comparison of merging strategies for building machine learning models on multiple independent gene expression data sets4
Robust deep neural network surrogate models with uncertainty quantification via adversarial training4
Model‐Based Recursive Partitioning for Discrete Event Times3
Issue Information3
Bayesian inference for nonprobability samples with nonignorable missingness3
Multivariate contaminated normal mixture regression modeling of longitudinal data based on jointmean‐covariancemodel3
Input‐response space‐filling designs incorporating response uncertainty3
A finely tuned deep transfer learning algorithm to compare outsole images3
Nonparametric clustering of RNA‐sequencing data3
The analysis of association rules: Latent class analysis3
Estimating basis functions in massive fields under the spatial mixed effects model3
3
Sparse Bayesian variable selection in high‐dimensional logistic regression models with correlated priors3
Local influence analysis for the sliced average third‐moment estimation3
The fairness‐accuracy Pareto front3
On difference‐based gradient estimation in nonparametric regression3
Development and validation of models for two‐week mortality of inpatients with COVID‐19 infection: A large prospective cohort study2
Issue Information2
A new formulation of sparse multiple kernel k$$ k $$‐means clustering and its applications2
Bayesian Hybrid Model Search and Averaging for Sparse Gaussian Process Regression2
A deep learning factor analysis model based on importance‐weighted variational inference and normalizing flow priors: Evaluation within a set of multidimensional performance assessments in youth elite2
Data‐driven stochastic model for quantifying the interplay between amyloid‐beta and calcium levels in Alzheimer's disease2
A Conversational Assistant for Democratization of Data Visualization: A Comparative Study of Two Approaches of Interaction2
Interaction Tests With Covariate‐Adaptive Randomization2
Sketched Stochastic Dictionary Learning for large‐scale data and application to high‐throughput mass spectrometry2
Issue Information2
Density estimation via measure transport: Outlook for applications in the biological sciences2
Adversarially robust subspace learning in the spiked covariance model2
Semiparametric detection of changepoints in location, scale, and copula2
2
Driving mode analysis—How uncertain functional inputs propagate to an output2
A Novel Approach for APT Detection Based on Ensemble Learning Model2
Bayesian modeling of location, scale, and shape parameters in skew‐normal regression models2
eRPCA: Robust Principal Component Analysis for Exponential Family Distributions2
Biclustering high‐frequency financial time series based on information theory2
Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies2
Simplicial depth and its median: Selected properties and limitations2
Cost‐sensitive classification with time constraint on incomplete data2
0.10657286643982