Statistical Analysis and Data Mining

Papers
(The median citation count of Statistical Analysis and Data Mining is 0. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Issue Information401
26
Semi‐Parametric Least‐Area Linear‐Circular Regression Through Möbius Transformation23
Sequence Outlier Detection and Application of Gated Recurrent Unit Autoencoder Gaussian Mixture Model Based on Various Loss Optimization15
A practical extension of the recursive multi‐fidelity model for the emulation of hole closure experiments13
Predictive models with end user preference12
Issue Information12
11
11
Issue Information9
Modeling and inference for mixtures of simple symmetric exponential families of ‐dimensional distributions for vectors with binary coordinates9
Analyzing relevance vector machines using a single penalty approach8
Issue Information8
Factor analysis for high‐dimensional time series: Consistent estimation and efficient computation7
7
Regrouped design in privacy analysis for multinomial microdata7
7
Estimation of disease progression for ischemic heart disease using latent Markov with covariates6
Adaptive boosting for ordinal target variables using neural networks6
eRPCA: Robust Principal Component Analysis for Exponential Family Distributions6
Evaluating causal‐based feature selection for fuel property prediction models5
Randomized multiarm bandits: An improved adaptive data collection method5
Online learning for streaming data classification in nonstationary environments5
Marginal clustered multistate models for longitudinal progressive processes with informative cluster size5
Nonparametric clustering of RNA‐sequencing data4
Bilateral‐WeightedOnline Adaptive Isolation Forest for anomaly detection in streaming data4
Issue Information3
Sample selection bias in evaluation of prediction performance of causal models3
Survival trees based on heterogeneity in time‐to‐event and censoring distributions using parameter instability test3
Local influence analysis for the sliced average third‐moment estimation3
A new logarithmic multiplicative distortion for correlation analysis3
Sketched Stochastic Dictionary Learning for large‐scale data and application to high‐throughput mass spectrometry3
Issue Information3
Imputed quantile vector autoregressive model for multivariate spatial–temporal data3
Data‐drivensparse partial least squares3
Specifying composites in structural equation modeling: A refinement of the Henseler–Ogasawara specification3
Rarity updated ensemble with oversampling: An ensemble approach to classification of imbalanced data streams3
Weighted validation of heteroscedastic regression models for better selection3
Application of the Cox proportional hazards model and competing risks models to critical illness insurance data3
Buckley–Jamesestimation of generalized additive accelerated lifetime model with ultrahigh‐dimensional data3
Subsampling under distributional constraints3
High‐dimensional classification based on nonparametric maximum likelihood estimation under unknown and inhomogeneous variances2
Gaussian process selections in semiparametric multi‐kernel machine regression for multi‐pathway analysis2
Conformal Multi‐Target Hyperrectangles2
Smart data augmentation: One equation is all you need2
The fairness‐accuracy Pareto front2
Bayesian modeling of location, scale, and shape parameters in skew‐normal regression models2
Individualized image region detection with total variation2
Assessment of the real‐time pattern recognition capability of machine learning algorithms2
Modal linear regression models with multiplicative distortion measurement errors2
Corrigendum2
The generalized hyperbolic family and automatic model selection through the multiple‐choiceLASSO2
Confidence bounds for threshold similarity graph in random variable network2
Machine learning and neural network based model predictions of soybean export shares from US Gulf to China2
The analysis of association rules: Latent class analysis2
Sparse Bayesian variable selection in high‐dimensional logistic regression models with correlated priors2
A new formulation of sparse multiple kernel k$$ k $$‐means clustering and its applications2
Characterizing climate pathways using feature importance on echo state networks2
Cluster analysis via random partition distributions2
Driving mode analysis—How uncertain functional inputs propagate to an output1
Issue Information1
Detection of Unknown Functional Departure in Generalized Functional Regression1
1
An automated alignment algorithm for identification of the source of footwear impressions with common class characteristics1
Portability analysis of data mining models for fog events forecasting1
Efficient importance sampling imputation algorithms for quantile and composite quantile regression1
1
Issue Information1
Measure inducing classification and regression trees for functional data1
Coupled support tensor machine classification for multimodal neuroimaging data1
Some Bayesian biclustering methods: Modeling and inference1
Issue Information1
BayesMultiomics: An R Package for Bayesian Shrinkage Models for Integration and Analysis of Multi‐Platform High‐Dimensional Genomics Data1
Robustifying Marginal Linear Models for Correlated Responses Using a Constructive Multivariate Huber Distribution1
Multi‐scale affinities with missing data: Estimation and applications1
Two‐sample testing for random graphs1
A new parametric approach to gender gap with application to EUSILC data in Poland and Italy1
Issue Information1
Nonparametric Expectile Regression Meets Deep Neural Networks: A Robust Nonlinear Variable Selection method1
1
CLADAG 2021 special issue: Selected papers on classification and data analysis1
Development and validation of models for two‐week mortality of inpatients with COVID‐19 infection: A large prospective cohort study1
A Novel Approach for APT Detection Based on Ensemble Learning Model1
1
Interaction Tests With Covariate‐Adaptive Randomization1
Fourier neural networks as function approximators and differential equation solvers1
Kernel learning with nonconvex ramp loss1
Error‐controlled feature selection for ultrahigh‐dimensional and highly correlated feature space using deep learning1
Association rules and decision rules1
Data Twinning1
Convolutional Sparse Coding for Time Series Via a ℓ0 Penalty: An Efficient Algorithm With Statistical Guarantees1
An ImprovedD2GAN‐based oversampling algorithm for imbalanced data classification0
Modeling subpopulations for hierarchically structured data0
Feature selection for imbalanced data with deep sparse autoencoders ensemble0
Issue Information0
Penalized composite likelihood for colored graphical Gaussian models0
Node Centrality Inference via Hypothesis Testing0
Distributed dimension reduction with nearly oracle rate0
Bayesian batch optimization for molybdenum versus tungsten inertial confinement fusion double shell target design0
0
0
Coefficient tree regression for generalized linear models0
0
Issue Information0
Boosting diversity in regression ensembles0
Trees, forests, chickens, and eggs: when and why to prune trees in a random forest0
Issue Information0
Tracking clusters and anomalies in evolving data streams0
A deep learning approach for the comparison of handwritten documents using latent feature vectors0
On difference‐based gradient estimation in nonparametric regression0
Input‐response space‐filling designs incorporating response uncertainty0
Weighted AutoEncoding recommender system0
A treeless absolutely random forest with closed‐form estimators of expected proximities0
Intuitively adaptable outlier detector0
0
Modeling matrix variate time series via hidden Markov models with skewed emissions0
Nonparametric Bayesian functional clustering with applications to racial disparities in breast cancer0
Issue Information0
Non‐uniform active learning for Gaussian process models with applications to trajectory informed aerodynamic databases0
Biclustering high‐frequency financial time series based on information theory0
A random forest approach for interval selection in functional regression0
Ensembled sparse‐input hierarchical networks for high‐dimensional datasets0
Feature screening of ultrahigh dimensional longitudinal data based on the C‐statistic0
Adversarially robust subspace learning in the spiked covariance model0
Supervised compression of big data0
A linear time method for the detection of collective and point anomalies0
Issue Information0
A tutorial on generative adversarial networks with application to classification of imbalanced data0
Markov chain to analyze web usability of a university website using eye tracking data0
Weighted pivot coordinates for partial least squares‐based marker discovery in high‐throughput compositional data0
Frequentist model averaging for zero‐inflated Poisson regression models0
Issue Information0
Lq regularization for fair artificial intelligence robust to covariate shift0
Stratified learning: A general‐purpose statistical method for improved learning under covariate shift0
Handwriting identification using random forests and score‐based likelihood ratios0
Out‐of‐bag stability estimation for k‐means clustering0
Bag of little bootstraps for massive and distributed longitudinal data0
Neural interval‐censored survival regression with feature selection0
On Algorithms and Approximations for Progressively Type‐I Censoring Schemes0
Noise‐Augmented ℓ0 Regularization of Tensor Regression With Tucker Decomposition0
Semi‐supervised multi‐label learning with missing labels by exploiting feature‐label correlations0
Imbalanced classification: A paradigm‐based review0
A novel Bayesian method for variable selection and estimation in binary quantile regression0
Online embedding and clustering of evolving data streams0
Traditional kriging versus modern Gaussian processes for large‐scale mining data0
Semiparametric estimation of average treatment effects in observational studies0
0
A machine learning oracle for parameter estimation0
Multivariate contaminated normal mixture regression modeling of longitudinal data based on jointmean‐covariancemodel0
Quantifying Epistemic Uncertainty in Binary Classification via Accuracy Gain0
Integrative learning of structuredhigh‐dimensionaldata from multiple datasets0
Robust multitask learning in high dimensions under memory constraint0
Expert‐in‐the‐loop design of integral nuclear data experiments0
Hierarchy‐assisted gene expression regulatory network analysis0
0
Issue Information0
Residuals and diagnostics for multinomial regression models0
Bayesian inference for nonprobability samples with nonignorable missingness0
0
A study of the impact of COVID‐19 on the Chinese stock market based on a new textual multiple ARMA model0
Using Neural Networks to Identify Mixture Components in Hyperspectral Reflectance Data0
Comparison of merging strategies for building machine learning models on multiple independent gene expression data sets0
Regression‐based Bayesian estimation and structure learning for nonparanormal graphical models0
An efficientk‐modes algorithm for clustering categorical datasets0
Application of nonparametric quantifiers for online handwritten signature verification: A statistical learning approach0
Issue Information0
A Conversational Assistant for Democratization of Data Visualization: A Comparative Study of Two Approaches of Interaction0
A family of mixture models for biclustering0
Greenwood Statistic Under Distortion Measurement Errors0
Revisiting Winnow: A modified online feature selection algorithm for efficient binary classification0
The finite mixture model for the tails of distribution: Monte Carlo experiment and empirical applications0
Compositional variable selection in quantile regression for microbiome data with false discovery rate control0
A deep learning factor analysis model based on importance‐weighted variational inference and normalizing flow priors: Evaluation within a set of multidimensional performance assessments in youth elite0
Precision aggregated local models0
Multi‐node Expectation–Maximization algorithm for finite mixture models0
Factor analysis of mixed data for anomaly detection0
Doubly robust estimation for non‐probability samples with modified intertwined probabilistic factors decoupling0
Semiparametric detection of changepoints in location, scale, and copula0
0
Neural‐networktransformation models for counting processes0
Ensemble learning for score likelihood ratios under the common source problem0
Transfer learning under the Cox model with interval‐censored data0
Randomized algorithms for tensor response regression0
Negative binomial graphical model with excess zeros0
Optimal ratio for data splitting0
Parallel coordinate order forhigh‐dimensionaldata0
Categorical classifiers in multiclass classification with imbalanced datasets0
Cost‐sensitive classification with time constraint on incomplete data0
Considerations in Bayesian agent‐based modeling for the analysis of COVID‐19 data0
A general iterative clustering algorithm0
A tree‐based gene–environment interaction analysis with rare features0
Towards accelerating particle‐resolved direct numerical simulation with neural operators0
0
An Adaptive Microbiome‐Based Truncated Test0
0
Simplicial depth: Characterization and reconstruction0
A modified least angle regression algorithm for interaction selection with heredity0
Prior effective sample size for exponential family distributions with multiple parameters0
Erratum to “Data‐driven dimension reduction in functional principal component analysis identifying the change‐point in functional data”0
Power grid frequency prediction using spatiotemporal modeling0
Data‐driven stochastic model for quantifying the interplay between amyloid‐beta and calcium levels in Alzheimer's disease0
Issue Information0
A fast and efficient Modal EM algorithm for Gaussian mixtures0
Adaptive batching for Gaussian process surrogates with application in noisy level set estimation0
On an Empirical Likelihood Based Solution to the Approximate Bayesian Computation Problem0
CLADAG 2019 Special Issue: Selected Papers on Classification and Data Analysis0
0
Issue Information0
Robust deep neural network surrogate models with uncertainty quantification via adversarial training0
The Classification Algorithm Based on Functional Logistic Regression Model With Spatial Effects and Its Application in Air Quality Analysis0
A neutral zone classifier for three classes with an application to text mining0
Hub‐aware random walk graph embedding methods for classification0
Local support vector machine based dimension reduction0
Issue Information0
0
An Efficient Filtering Approach for Model Estimation in Sparse Regression0
Issue Information0
Bayesian relative composite quantile regression approach of ordinal latent regression model with L1/2 regularization0
0
0
Study of a bounded interval perks distribution with quantile regression analysis0
Issue Information0
An auxiliary Part‐of‐Speech tagger for blog and microblog cyber‐slang0
Estimating basis functions in massive fields under the spatial mixed effects model0
Model selection with bootstrap validation0
Share density‐based clustering of income data0
Bayesian Posterior Interval Calibration to Improve the Interpretability of Observational Studies0
0
Residual's influence index (RINFIN), bad leverage and unmasking in high dimensionalL2‐regression0
Nonparametric mean and variance adaptive classification rule for high‐dimensional data with heteroscedastic variances0
Evaluation and interpretation of driving risks: Automobile claim frequency modeling with telematics data0
A finely tuned deep transfer learning algorithm to compare outsole images0
Spatially‐correlated time series clustering using location‐dependent Dirichlet process mixture model0
Density estimation via measure transport: Outlook for applications in the biological sciences0
Multivariate Gaussian RBF‐net for smooth function estimation and variable selection0
Bayesian shrinkage models for integration and analysis of multiplatform high‐dimensional genomics data0
A novel two‐step extrapolation‐insertion risk model based on the Expectile under the Pareto‐type distribution0
Persistent Classification: Understanding Adversarial Attacks by Studying Decision Boundary Dynamics0
Sequential metamodel‐based approaches to level‐set estimation under heteroscedasticity0
Exponential calibration for correlation coefficient with additive distortion measurement errors0
Simplicial depth and its median: Selected properties and limitations0
0.12264013290405