Statistical Analysis and Data Mining

Papers
(The median citation count of Statistical Analysis and Data Mining is 0. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Randomized multiarm bandits: An improved adaptive data collection method28
Deep Learning for Variable Selection in Censored Quantile Regression Models20
Semi‐Parametric Least‐Area Linear‐Circular Regression Through Möbius Transformation20
Testing for the Important Components of Predictive Variance18
10
BayesMultiomics : An R Package for Bayesian Shrinkage Models for Integration and Analysis of Multi‐Platform High‐Dimensional Genomics Data9
Issue Information9
CLADAG 2021 special issue: Selected papers on classification and data analysis9
Kernel learning with nonconvex ramp loss8
Model Averaging for Regression Kink Models7
7
Transfer Learning for High‐Dimensional Beta Regression Models7
Bayesian shrinkage models for integration and analysis of multiplatform high‐dimensional genomics data7
Integrative learning of structuredhigh‐dimensionaldata from multiple datasets6
Rank‐Based Inference for Conditional Independence Graph With Missing Values6
Testing Hypotheses of Covariate Effects on Topics of Discourse5
An ImprovedD2GAN‐based oversampling algorithm for imbalanced data classification5
Robust Model‐Based Semi‐Supervised Clustering of Incomplete Records5
Issue Information5
Variational Autoencoder With Gamma Mixture for Clustering High‐Dimensional Right‐Skewed Data5
On difference‐based gradient estimation in nonparametric regression5
Robust and Differentially Private Principal Component Analysis5
A finely tuned deep transfer learning algorithm to compare outsole images4
Issue Information4
Extracting Genetically‐Imputed Causal Features From ECG Data4
Multivariate contaminated normal mixture regression modeling of longitudinal data based on jointmean‐covariancemodel4
Model‐Based Recursive Partitioning for Discrete Event Times4
Bayesian Dirichlet Process Copula Mixtures for Heterogeneous Multi‐Cluster Data: Methods and an NBA Player Stats Application4
Robust deep neural network surrogate models with uncertainty quantification via adversarial training4
Nonparametric clustering of RNA‐sequencing data4
Bayesian inference for nonprobability samples with nonignorable missingness4
Input‐response space‐filling designs incorporating response uncertainty4
Sparse Bayesian variable selection in high‐dimensional logistic regression models with correlated priors3
The analysis of association rules: Latent class analysis3
Issue Information3
Issue Information3
eRPCA: Robust Principal Component Analysis for Exponential Family Distributions3
Recovering the Number of Clusters From a Laplacian Matrix by Nuclear Norm Penalization3
Stabilizing Inference in Dirichlet Regression via Ridge‐Penalized Model3
Driving mode analysis—How uncertain functional inputs propagate to an output3
Transfer Learning for Linearized Maximum Rank Correlation Estimation3
A new formulation of sparse multiple kernel k$$ k $$‐means clustering and its applications3
Interaction Tests With Covariate‐Adaptive Randomization3
A Novel Approach for APT Detection Based on Ensemble Learning Model3
Low‐Dimensional Adaptive Neural Network Regression With Directional Change Detection via Nuclear Norm Penalization2
A deep learning factor analysis model based on importance‐weighted variational inference and normalizing flow priors: Evaluation within a set of multidimensional performance assessments in youth elite2
Semiparametric estimation of average treatment effects in observational studies2
Constructing Cell‐Type Taxonomy by Optimal Transport With Relaxed Marginal Constraints2
2
Cost‐sensitive classification with time constraint on incomplete data2
2
Data‐driven stochastic model for quantifying the interplay between amyloid‐beta and calcium levels in Alzheimer's disease2
Simplicial depth and its median: Selected properties and limitations2
Density estimation via measure transport: Outlook for applications in the biological sciences2
Online Updating Composite Quantile Regression for Streaming Data2
Bayesian Hybrid Model Search and Averaging for Sparse Gaussian Process Regression2
Semiparametric detection of changepoints in location, scale, and copula2
Model Average Estimation of Parameters in Linear Model With Multiple Change Points2
Large Multi‐Response Linear Regression Estimation Based on Low‐Rank Pre‐Smoothing2
A Conversational Assistant for Democratization of Data Visualization: A Comparative Study of Two Approaches of Interaction2
Adaptive Weighted Regularized QRGRU Algorithm and Its Application in Stock Price Prediction2
1
Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies1
1
Issue Information1
Characterizing climate pathways using feature importance on echo state networks1
Hierarchy‐assisted gene expression regulatory network analysis1
A Hybrid Model for Imbalanced Data Classification Using Dynamic Threshold Tuning and Particle Swarm Optimization‐Enhanced Kernel Transformations1
An automated alignment algorithm for identification of the source of footwear impressions with common class characteristics1
Score Tests for Overdispersion in Marginalized Zero‐Inflated Poisson Regression Based on Marginalized Zero‐Inflated Generalized Poisson Model1
Coupled support tensor machine classification for multimodal neuroimaging data1
Online Variable Selection and Parameter Estimation for Massive Data via Square Root Lasso1
A new parametric approach to gender gap with application to EUSILC data in Poland and Italy1
Convolutional Sparse Coding for Time Series Via a ℓ0 Penalty: An Efficient Algorithm With Statistical Guarantees1
Transfer learning under the Cox model with interval‐censored data1
Confidence bounds for threshold similarity graph in random variable network1
Cluster analysis via random partition distributions1
Modeling matrix variate time series via hidden Markov models with skewed emissions1
Recursive Random Binning to Detect and Display Pairwise Dependence1
Individualized image region detection with total variation1
Buckley–Jamesestimation of generalized additive accelerated lifetime model with ultrahigh‐dimensional data1
An Efficient Filtering Approach for Model Estimation in Sparse Regression1
Wasserstein Centroid‐Based Binary Classification for Distributional Data1
Dynamic Clustering of Multivariate Time Series: Modeling Time‐Varying Memberships1
Persistent Classification: Understanding Adversarial Attacks by Studying Decision Boundary Dynamics1
Quantifying Epistemic Uncertainty in Binary Classification via Accuracy Gain1
1
An Adaptable Bayesian Quantile‐Based Regression Framework for Asymmetric Data Structures and Extreme Points1
1
Triangulation‐Based Spatial Clustering for Adjacent Data With Heterogeneous Density1
Corrigendum1
Estimation of disease progression for ischemic heart disease using latent Markov with covariates1
Bayesian batch optimization for molybdenum versus tungsten inertial confinement fusion double shell target design1
Gaussian process selections in semiparametric multi‐kernel machine regression for multi‐pathway analysis1
Issue Information1
Imputed quantile vector autoregressive model for multivariate spatial–temporal data1
Issue Information1
A machine learning oracle for parameter estimation1
Simplicial depth: Characterization and reconstruction1
1
Share density‐based clustering of income data1
1
1
Survival on Image Regression With Application to Partially Functional Distributional Representation of Physical Activity1
A Homogeneity Test for Ordinal Receiver Operating Characteristic Regression With Application to Facial Recognition Accuracy Assessment1
Same/Other/All K‐Fold Cross‐Validation for Estimating Similarity of Patterns in Data Subsets1
Benchmarking of Clustering Validity Measures Revisited0
A treeless absolutely random forest with closed‐form estimators of expected proximities0
The Classification Algorithm Based on Functional Logistic Regression Model With Spatial Effects and Its Application in Air Quality Analysis0
The generalized hyperbolic family and automatic model selection through the multiple‐choiceLASSO0
A novel two‐step extrapolation‐insertion risk model based on the Expectile under the Pareto‐type distribution0
A new logarithmic multiplicative distortion for correlation analysis0
Subsampling under distributional constraints0
Error‐controlled feature selection for ultrahigh‐dimensional and highly correlated feature space using deep learning0
Prior effective sample size for exponential family distributions with multiple parameters0
Rarity updated ensemble with oversampling: An ensemble approach to classification of imbalanced data streams0
Issue Information0
Classifier Performance on Long‐Tail Distributions0
Frequentist model averaging for zero‐inflated Poisson regression models0
Nonlinear Superpopulation Model Inference for Non‐Probability Samples With Nonignorable Missingness0
Issue Information0
Clustering of Longitudinal Data: A Tutorial on a Variety of Approaches0
Nonparametric mean and variance adaptive classification rule for high‐dimensional data with heteroscedastic variances0
Correspondence Analysis From the Viewpoint of Compositional Tables0
Issue Information0
Stratified learning: A general‐purpose statistical method for improved learning under covariate shift0
Identifying Nuclear Data Correlated Through Predicting Bias in Integral Experiments via Applying Principal Component Analysis to Random Forest0
Issue Information0
Noise‐Augmented ℓ0 Regularization of Tensor Regression With Tucker Decomposition0
An auxiliary Part‐of‐Speech tagger for blog and microblog cyber‐slang0
The finite mixture model for the tails of distribution: Monte Carlo experiment and empirical applications0
Neural Estimation of Treatment Bridge Functions for Proximal Causal Inference0
Cumulative Differences Between Paired Samples0
Sequential metamodel‐based approaches to level‐set estimation under heteroscedasticity0
A Transformation‐Based Direction Combination Association Test for GWAS Summary Statistics0
Compositional variable selection in quantile regression for microbiome data with false discovery rate control0
Randomized algorithms for tensor response regression0
Issue Information0
Issue Information0
Evaluation and interpretation of driving risks: Automobile claim frequency modeling with telematics data0
0
Towards accelerating particle‐resolved direct numerical simulation with neural operators0
Node Centrality Inference via Hypothesis Testing0
Analysis of Correlated Image Features Using Scalar‐On‐Matrix Regression0
0
0
Local support vector machine based dimension reduction0
Categorical classifiers in multiclass classification with imbalanced datasets0
Greenwood Statistic Under Distortion Measurement Errors0
A linear time method for the detection of collective and point anomalies0
Sequence Outlier Detection and Application of Gated Recurrent Unit Autoencoder Gaussian Mixture Model Based on Various Loss Optimization0
Pointwise Entropy Distributions for Community‐Level Hypothesis Testing in High‐Dimensional and Sparse Microbiome Data0
Nonparametric Bayesian functional clustering with applications to racial disparities in breast cancer0
Issue Information0
Specifying composites in structural equation modeling: A refinement of the Henseler–Ogasawara specification0
Synthetic Anchoring Under the Specific Source Problem0
Adaptive boosting for ordinal target variables using neural networks0
Issue Information0
Computational Improvements to the Kernel k$$ k $$‐Means Clustering Algorithm0
A deep learning approach for the comparison of handwritten documents using latent feature vectors0
Online learning for streaming data classification in nonstationary environments0
Issue Information0
Deep Symbolic Learning for Histogram‐Valued Regression Data0
Application of nonparametric quantifiers for online handwritten signature verification: A statistical learning approach0
0
Variable Selection in Nonparametric Additive Models via Data Splitting0
A novel Bayesian method for variable selection and estimation in binary quantile regression0
Weighted SurvClipper : Nonlinear Prognostic Biomarker Selection Incorporating Historical Information for Survival Risk With Controlled 0
Spatially‐correlated time series clustering using location‐dependent Dirichlet process mixture model0
0
0
Issue Information0
Out‐of‐bag stability estimation for k‐means clustering0
Two‐sample testing for random graphs0
Lq regularization for fair artificial intelligence robust to covariate shift0
Issue Information0
Statistical Shape Analysis of Human Bodies0
Advancing Coefficient of Variation Estimation Under Additive Distortion Measurement Errors: Exploring New Avenues Beyond Independence Assumptions0
An Adaptive Microbiome‐Based Truncated Test0
0
Bayesian Quantile Semiparametric Mixed‐Effects Double Regression Models for Analyzing Longitudinal Data With Non‐Ignorable Missing Responses0
Distributionally Conservative Stochastic Dominance via Subsampling0
Distributed dimension reduction with nearly oracle rate0
Artificial Neural Network Optimization to Estimate Radon in Soil0
Expert‐in‐the‐loop design of integral nuclear data experiments0
Issue Information0
0
A random forest approach for interval selection in functional regression0
Semi‐supervised multi‐label learning with missing labels by exploiting feature‐label correlations0
Marginal clustered multistate models for longitudinal progressive processes with informative cluster size0
Study of a bounded interval perks distribution with quantile regression analysis0
Ensemble learning for score likelihood ratios under the common source problem0
BOSTONPUPA : A Bayesian Online Spatio‐Temporal Outbreak Detection Framework With Prior Updating and 0
Online embedding and clustering of evolving data streams0
Robust multitask learning in high dimensions under memory constraint0
0
Machine learning and neural network based model predictions of soybean export shares from US Gulf to China0
Model selection with bootstrap validation0
Issue Information0
Issue Information0
Nonparametric Expectile Regression Meets Deep Neural Networks: A Robust Nonlinear Variable Selection method0
0
Bilateral‐WeightedOnline Adaptive Isolation Forest for anomaly detection in streaming data0
Assessment of the real‐time pattern recognition capability of machine learning algorithms0
Association rules and decision rules0
On an Empirical Likelihood Based Solution to the Approximate Bayesian Computation Problem0
Hub‐aware random walk graph embedding methods for classification0
Smart data augmentation: One equation is all you need0
Issue Information0
Boosting diversity in regression ensembles0
Nonparametric Linear Discriminant Analysis for High Dimensional Matrix‐Valued Data0
Exact Score Vector and Hessian Matrix for Mixtures of Matrix‐Variate Normals0
Factor analysis of mixed data for anomaly detection0
Transfer Learning Analysis of the Cox Mixture Model0
Robustifying Marginal Linear Models for Correlated Responses Using a Constructive Multivariate Huber Distribution0
Counterfactual Uncertainty Quantification of Factual Estimand of Efficacy From Before‐and‐After Treatment Repeated Measures Randomized Controlled Trials0
Using Neural Networks to Identify Mixture Components in Hyperspectral Reflectance Data0
Modeling subpopulations for hierarchically structured data0
0
Bayesian relative composite quantile regression approach of ordinal latent regression model with L1/2 regularization0
A neutral zone classifier for three classes with an application to text mining0
Issue Information0
Considerations in Bayesian agent‐based modeling for the analysis of COVID‐19 data0
Traditional kriging versus modern Gaussian processes for large‐scale mining data0
0
Issue Information0
Copula‐Based Deep Learning Models for Competing Risks0
Feature screening of ultrahigh dimensional longitudinal data based on the C‐statistic0
Non‐uniform active learning for Gaussian process models with applications to trajectory informed aerodynamic databases0
Revisiting Winnow: A modified online feature selection algorithm for efficient binary classification0
Doubly robust estimation for non‐probability samples with modified intertwined probabilistic factors decoupling0
Residuals and diagnostics for multinomial regression models0
Neural interval‐censored survival regression with feature selection0
On Algorithms and Approximations for Progressively Type‐I Censoring Schemes0
Conformal Multi‐Target Hyperrectangles0
Detection of Unknown Functional Departure in Generalized Functional Regression0
Trees, forests, chickens, and eggs: when and why to prune trees in a random forest0
0
0.18102288246155