Annals of Statistics

Papers
(The TQCC of Annals of Statistics is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Convergence of de Finetti’s mixing measure in latent structure models for observed exchangeable sequences127
On singular values of data matrices with general independent columns79
Optimal estimation and computational limit of low-rank Gaussian mixtures66
Ridge regression revisited: Debiasing, thresholding and bootstrap64
Change-point analysis with irregular signals62
Online inference with multi-modal likelihood functions61
On the approximation accuracy of Gaussian variational inference61
Statistical inference for rough volatility: Minimax theory59
One-step estimation of differentiable Hilbert-valued parameters46
Online change-point detection for matrix-valued time series with latent two-way factor structure44
Generalization error bounds of dynamic treatment regimes in penalized regression-based learning41
Orthogonal statistical learning33
Foundations of structural causal models with cycles and latent variables33
Maximum likelihood for high-noise group orbit estimation and single-particle cryo-EM31
High-dimensional nonparametric density estimation via symmetry and shape constraints30
Inference in Ising models on dense regular graphs27
Extreme value inference for heterogeneous power law data27
False discovery rate control with unknown null distribution: Is it possible to mimic the oracle?27
Complexity analysis of Bayesian learning of high-dimensional DAG models and their equivalence classes27
Dispersal density estimation across scales27
Parametric copula adjusted for non- and semiparametric regression26
Carving model-free inference25
Off-policy evaluation in partially observed Markov decision processes under sequential ignorability24
The Stein effect for Fréchet means24
A cross-validation framework for signal denoising with applications to trend filtering, dyadic CART and beyond24
Principal components in linear mixed models with general bulk24
Scalable estimation and inference for censored quantile regression process24
MARS via LASSO22
On the disjoint and sliding block maxima method for piecewise stationary time series21
Adaptive estimation in multivariate response regression with hidden variables21
Locally simultaneous inference21
Metric statistics: Exploration and inference for random objects with distance profiles21
Robust k-means clustering for distributions with two moments21
The adaptive Wynn algorithm in generalized linear models with univariate response20
A sieve stochastic gradient descent estimator for online nonparametric regression in Sobolev ellipsoids20
Large-scale inference with block structure19
Estimation and inference for minimizer and minimum of convex functions: Optimality, adaptivity and uncertainty principles19
Optimization hierarchy for fair statistical decision problems19
Optimal disclosure risk assessment19
Half-trek criterion for identifiability of latent variable models18
Are deviations in a gradually varying mean relevant? A testing approach based on sup-norm estimators17
Choosing between persistent and stationary volatility17
ℓ2 inference for change points in high-dimensional time series via a Two-Way MOSUM17
Variable selection consistency of Gaussian process regression16
Convergence rates of oblique regression trees for flexible function libraries16
On universally consistent and fully distribution-free rank tests of vector independence16
Boosted nonparametric hazards with time-dependent covariates16
Parameter estimation in nonlinear multivariate stochastic differential equations based on splitting schemes16
On the sample complexity of entropic optimal transport15
On high-dimensional Poisson models with measurement error: Hypothesis testing for nonlinear nonconvex optimization15
Nonlinear global Fréchet regression for random objects via weak conditional expectation15
Empirical partially Bayes multiple testing and compound χ2 decisions15
Tensor clustering with planted structures: Statistical optimality and computational limits15
A statistical framework of watermarks for large language models: Pivot, detection efficiency and optimal rules15
Measuring dependence in the Wasserstein distance for Bayesian nonparametric models14
On an extension of the promotion time cure model14
Minimax optimality of permutation tests14
Stein’s method of normal approximation: Some recollections and reflections14
Testability of high-dimensional linear models with nonsparse structures14
Infinite-dimensional gradient-based descent for alpha-divergence minimisation14
Universal rank inference via residual subsampling with application to large networks14
Adaptive robust estimation in sparse vector model14
The completion of covariance kernels14
Consistent nonparametric estimation for heavy-tailed sparse graphs13
Marginal singularity and the benefits of labels in covariate-shift13
Nonparametric Bayesian inference for reversible multidimensional diffusions13
Sharp global convergence guarantees for iterative nonconvex optimization with random data12
Learning models with uniform performance via distributionally robust optimization12
Gaussian approximation for nonstationary time series with optimal rate and explicit construction12
Deep learning for the partially linear Cox model12
A Gaussian process approach to model checks12
A conformal test of linear models via permutation-augmented regressions12
An optimal statistical and computational framework for generalized tensor estimation12
Efficiency in local differential privacy12
Distribution and quantile functions, ranks and signs in dimension d: A measure transportation approach11
Bridging factor and sparse models11
E-values: Calibration, combination and applications11
Semiparametric latent-class models for multivariate longitudinal and survival data11
Conformal prediction beyond exchangeability11
AutoRegressive approximations to nonstationary time series with inference and applications11
Average treatment effects in the presence of unknown interference11
The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression11
Editorial: Memorial issue for Charles Stein10
Asymptotic accuracy of the saddlepoint approximation for maximum likelihood estimation10
The projected covariance measure for assumption-lean variable significance testing10
Factor-driven two-regime regression10
Optimal nonparametric testing of Missing Completely At Random and its connections to compatibility10
Nonparametric conditional local independence testing10
Volatility coupling10
Rate-optimal cluster-randomized designs for spatial interference10
Correction note: “Asymptotic spectral theory for nonlinear time series”10
On fixed-domain asymptotics, parameter estimation and isotropic Gaussian random fields with Matérn covariance functions9
Adaptive and robust multi-task learning9
Cube root weak convergence of empirical estimators of a density level set9
Adaptive estimation of multivariate piecewise polynomials and bounded variation functions by optimal decision trees9
Bayesian fixed-domain asymptotics for covariance parameters in a Gaussian process model9
Charles Stein and invariance: Beginning with the Hunt–Stein theorem9
Settling the sample complexity of model-based offline reinforcement learning8
Analysis of “learn-as-you-go” (LAGO) studies8
An asymptotic test for constancy of the variance under short-range dependence8
What is resolution? A statistical minimax testing perspective on superresolution microscopy8
Density deconvolution under general assumptions on the distribution of measurement errors8
Statistical inference for decentralized federated learning8
A general characterization of optimal tie-breaker designs8
Graphical models for nonstationary time series8
Estimation of mixed fractional stable processes using high-frequency data8
Asymptotic properties of penalized spline estimators in concave extended linear models: Rates of convergence8
Asymptotic normality for eigenvalue statistics of a general sample covariance matrix when p/n→∞ and applications7
Learning sparse graphons and the generalized Kesten–Stigum threshold7
Nonregular and minimax estimation of individualized thresholds in high dimension with binary responses7
Early stopping for L2-boosting in high-dimensional linear models7
Universal Bayes consistency in metric spaces7
Inference on the maximal rank of time-varying covariance matrices using high-frequency data7
Minimax rates for heterogeneous causal effect estimation7
Consistent inference for diffusions from low frequency measurements7
Learning low-dimensional nonlinear structures from high-dimensional noisy data: An integral operator approach7
Optimal linear discriminators for the discrete choice model in growing dimensions7
High-dimensional inference for dynamic treatment effects7
Linearized two-layers neural networks in high dimension7
SuperMix: Sparse regularization for mixtures6
Limit theorems for distributions invariant under groups of transformations6
Uniform convergence of local Fréchet regression with applications to locating extrema and time warping for metric space valued trajectories6
Admissible ways of merging p-values under arbitrary dependence6
Testing high-dimensional regression coefficients in linear models6
Higher criticism to compare two large frequency tables, with sensitivity to possible rare and weak differences6
Minimax estimation of smooth densities in Wasserstein distance6
Asymptotic analysis of synchrosqueezing transform—toward statistical inference with nonlinear-type time-frequency analysis6
Existence and uniqueness of the Kronecker covariance MLE6
Approximate and exact designs for total effects6
Optimality of spectral clustering in the Gaussian mixture model6
Statistical inference for principal components of spiked covariance matrices6
Improved central limit theorem and bootstrap approximations in high dimensions6
Integrative methods for post-selection inference under convex constraints6
How do noise tails impact on deep ReLU networks?6
Efficiency of delayed-acceptance random walk Metropolis algorithms6
The interpolation phase transition in neural networks: Memorization and generalization under lazy training6
Local convexity of the TAP free energy and AMP convergence for Z2-synchronization6
Deflated HeteroPCA: Overcoming the curse of ill-conditioning in heteroskedastic PCA6
Inference for low-rank models6
Joint sequential detection and isolation for dependent data streams6
Adaptive test of independence based on HSIC measures6
0.11169195175171