Annals of Statistics

Papers
(The median citation count of Annals of Statistics is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
On high-dimensional Poisson models with measurement error: Hypothesis testing for nonlinear nonconvex optimization136
Estimation and inference for minimizer and minimum of convex functions: Optimality, adaptivity and uncertainty principles84
A sieve stochastic gradient descent estimator for online nonparametric regression in Sobolev ellipsoids72
Inference in Ising models on dense regular graphs70
Efficiency in local differential privacy68
Robust k-means clustering for distributions with two moments68
Measuring dependence in the Wasserstein distance for Bayesian nonparametric models64
Half-trek criterion for identifiability of latent variable models52
Online inference with multi-modal likelihood functions42
Universal rank inference via residual subsampling with application to large networks41
Foundations of structural causal models with cycles and latent variables35
Scalable estimation and inference for censored quantile regression process33
Parametric copula adjusted for non- and semiparametric regression33
Deep learning for the partially linear Cox model32
Learning sparse graphons and the generalized Kesten–Stigum threshold31
A general characterization of optimal tie-breaker designs31
Adaptive and robust multi-task learning29
Asymptotic distributions of high-dimensional distance correlation inference28
Consistent inference for diffusions from low frequency measurements27
Inference for low-rank tensors—no need to debias26
Inference for low-rank models26
Asymptotic analysis of synchrosqueezing transform—toward statistical inference with nonlinear-type time-frequency analysis26
Community detection with dependent connectivity26
Admissible ways of merging p-values under arbitrary dependence25
Rate-optimal estimation of mixed semimartingales24
Uniform convergence of local Fréchet regression with applications to locating extrema and time warping for metric space valued trajectories24
Refined Cramér-type moderate deviation theorems for general self-normalized sums with applications to dependent random variables and winsorized mean22
Order-of-addition orthogonal arrays to study the effect of treatment ordering22
Nonparametric classification with missing data22
Environment invariant linear least squares22
Supervised homogeneity fusion: A combinatorial approach21
Sharp optimality for high-dimensional covariance testing under sparse signals21
Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling21
On posterior consistency of data assimilation with Gaussian process priors: The 2D-Navier–Stokes equations21
Rank and factor loadings estimation in time series tensor factor model by pre-averaging20
Spectral estimation of Hawkes processes from count data20
Consistent order selection for ARFIMA processes20
A causal bootstrap19
Is infinity that far? A Bayesian nonparametric perspective of finite mixture models19
Change-point inference in high-dimensional regression models under temporal dependence19
Limiting distributions for eigenvalues of sample correlation matrices from heavy-tailed populations19
Spatial dependence and space–time trend in extreme events18
A nonparametric test for elliptical distribution based on kernel embedding of probabilities18
New Edgeworth-type expansions with finite sample guarantees18
Wilks’ theorem for semiparametric regressions with weakly dependent data18
General spatio-temporal factor models for high-dimensional random fields on a lattice17
Iterative algorithm for discrete structure recovery17
Plugin estimation of smooth optimal transport maps17
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics17
Consistency of Bayesian inference for multivariate max-stable distributions17
Computational lower bounds for graphon estimation via low-degree polynomials16
Sup-norm adaptive drift estimation for multivariate nonreversible diffusions16
Learning mixtures of permutations: Groups of pairwise comparisons and combinatorial method of moments16
Time-uniform central limit theory and asymptotic confidence sequences16
Distributed nonparametric function estimation: Optimal rate of convergence and cost of adaptation16
Consistency of invariance-based randomization tests15
Optimal rates for independence testing via U-statistic permutation tests15
Wald tests when restrictions are locally singular15
Backfitting for large scale crossed random effects regressions15
A shrinkage principle for heavy-tailed data: High-dimensional robust low-rank matrix recovery15
The Lasso with general Gaussian designs with applications to hypothesis testing14
Minimax rate of distribution estimation on unknown submanifolds under adversarial losses14
Surprises in high-dimensional ridgeless least squares interpolation14
Edgeworth expansions for network moments14
Projected state-action balancing weights for offline reinforcement learning14
Detecting multiple replicating signals using adaptive filtering procedures13
Minimax nonparametric estimation of pure quantum states13
Testing for independence in high dimensions based on empirical copulas13
Transfer learning for contextual multi-armed bandits13
Linear biomarker combination for constrained classification13
A new approach to tests and confidence bands for distribution functions13
On least squares estimation under heteroscedastic and heavy-tailed errors12
Sharp adaptive and pathwise stable similarity testing for scalar ergodic diffusions12
Dimension free ridge regression12
Testing nonparametric shape restrictions12
Interactive versus noninteractive locally differentially private estimation: Two elbows for the quadratic functional12
Change acceleration and detection12
Noisy linear inverse problems under convex constraints: Exact risk asymptotics in high dimensions12
Confidence regions near singular information and boundary points with applications to mixed models12
Carving model-free inference11
Finite-sample complexity of sequential Monte Carlo estimators11
Dispersal density estimation across scales11
Approximate Message Passing algorithms for rotationally invariant matrices11
On universally consistent and fully distribution-free rank tests of vector independence11
Adaptive transfer learning11
Testing community structure for hypergraphs11
ℓ2 inference for change points in high-dimensional time series via a Two-Way MOSUM10
ARK: Robust knockoffs inference with coupling10
Ridge regression revisited: Debiasing, thresholding and bootstrap10
On the sample complexity of entropic optimal transport10
The Stein effect for Fréchet means10
Learning models with uniform performance via distributionally robust optimization10
Adaptive estimation in multivariate response regression with hidden variables10
Tensor clustering with planted structures: Statistical optimality and computational limits10
Nonlinear global Fréchet regression for random objects via weak conditional expectation10
Marginal singularity and the benefits of labels in covariate-shift10
Bridging factor and sparse models9
Joint sequential detection and isolation for dependent data streams9
Statistical inference for principal components of spiked covariance matrices9
Sparse high-dimensional linear regression. Estimating squared error and a phase transition9
Correction note: “Asymptotic spectral theory for nonlinear time series”9
Testing high-dimensional regression coefficients in linear models9
Large-dimensional independent component analysis: Statistical optimality and computational tractability9
Matching recovery threshold for correlated random graphs9
Local convexity of the TAP free energy and AMP convergence for Z2-synchronization9
Existence and uniqueness of the Kronecker covariance MLE9
SuperMix: Sparse regularization for mixtures9
Heavy-tailed Bayesian nonparametric adaptation8
On robustness and local differential privacy8
Analysis of generalized Bregman surrogate algorithms for nonsmooth nonconvex statistical learning8
Inference for a two-stage enrichment design8
Multivariate trend filtering for lattice data8
A nonparametric doubly robust test for a continuous treatment effect8
Affine-equivariant inference for multivariate location under Lp loss functions8
Rerandomization with diminishing covariate imbalance and diverging number of covariates8
Global and individualized community detection in inhomogeneous multilayer networks8
Ensemble projection pursuit for general nonparametric regression8
Spectral analysis of gram matrices with missing at random observations: Convergence, central limit theorems, and applications in statistical inference8
Adaptive variational Bayes: Optimality, computation and applications8
Bootstrapping persistent Betti numbers and other stabilizing statistics7
Peskun–Tierney ordering for Markovian Monte Carlo: Beyond the reversible scenario7
The online closure principle7
Two-level parallel flats designs7
General and feasible tests with multiply-imputed datasets7
Universal regression with adversarial responses7
Post-selection inference via algorithmic stability7
Local permutation tests for conditional independence7
Estimating a density near an unknown manifold: A Bayesian nonparametric approach7
Rates of estimation for high-dimensional multireference alignment7
Adaptive novelty detection with false discovery rate guarantee7
Optimal signal detection in some spiked random matrix models: Likelihood ratio tests and linear spectral statistics7
Efficient estimation of the maximal association between multiple predictors and a survival outcome7
Embedding distributional data7
A general framework to quantify deviations from structural assumptions in the analysis of nonstationary function-valued processes7
Optimal false discovery rate control for large scale multiple testing with auxiliary information7
Some theory about efficient dimension reduction regarding the interaction between two responses7
Total positivity in multivariate extremes7
Approximation error from discretizations and its applications6
Statistical inference in sparse high-dimensional additive models6
Grouped variable selection with discrete optimization: Computational and statistical perspectives6
Estimation of the spectral measure from convex combinations of regularly varying random vectors6
Rate-optimal robust estimation of high-dimensional vector autoregressive models6
Improved covariance estimation: Optimal robustness and sub-Gaussian guarantees under heavy tails6
S-estimation in linear models with structured covariance matrices6
Stereographic Markov chain Monte Carlo6
Bootstrap-assisted inference for generalized Grenander-type estimators6
Central limit theorem and bootstrap approximation in high dimensions: Near 1/n rates via implicit smoothing6
Gaussian process regression in the flat limit6
Statistical complexity and optimal algorithms for nonlinear ridge bandits6
A new and flexible design construction for orthogonal arrays for modern applications6
Deep neural networks for nonparametric interaction models with diverging dimension6
On the existence of powerful p-values and e-values for composite hypotheses6
Convex regression in multidimensions: Suboptimality of least squares estimators6
Conditional calibration for false discovery rate control under dependence6
Conditional predictive inference for stable algorithms6
On minimax optimality of sparse Bayes predictive density estimates6
Model selection in the space of Gaussian models invariant by symmetry6
Optimal subgroup selection6
Strong selection consistency of Bayesian vector autoregressive models based on a pseudo-likelihood approach6
One-step estimation of differentiable Hilbert-valued parameters5
Augmented minimax linear estimation5
Causal discovery in heavy-tailed models5
MARS via LASSO5
A statistical framework of watermarks for large language models: Pivot, detection efficiency and optimal rules5
Infinite-dimensional gradient-based descent for alpha-divergence minimisation5
A conformal test of linear models via permutation-augmented regressions5
Skewed Bernstein–von Mises theorem and skew-modal approximations5
Distributed adaptive Gaussian mean estimation with unknown variance: Interactive protocol helps adaptation5
False discovery rate control with unknown null distribution: Is it possible to mimic the oracle?5
Boosted nonparametric hazards with time-dependent covariates5
Extreme value inference for heterogeneous power law data5
Editorial: Memorial issue for Charles Stein5
Consistent nonparametric estimation for heavy-tailed sparse graphs5
A study of orthogonal array-based designs under a broad class of space-filling criteria5
Convergence of de Finetti’s mixing measure in latent structure models for observed exchangeable sequences5
Variable selection consistency of Gaussian process regression5
The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression5
Stein’s method of normal approximation: Some recollections and reflections5
Learning low-dimensional nonlinear structures from high-dimensional noisy data: An integral operator approach5
High-dimensional inference for dynamic treatment effects4
The generalization error of max-margin linear classifiers: Benign overfitting and high dimensional asymptotics in the overparametrized regime4
Early stopping for L2-boosting in high-dimensional linear models4
Spectral statistics of sample block correlation matrices4
The edge of discovery: Controlling the local false discovery rate at the margin4
Statistical guarantees for Bayesian uncertainty quantification in nonlinear inverse problems with Gaussian process priors4
Optimal difference-based variance estimators in time series: A general framework4
Optimality of spectral clustering in the Gaussian mixture model4
Higher criticism to compare two large frequency tables, with sensitivity to possible rare and weak differences4
Integrative methods for post-selection inference under convex constraints4
Random graph asymptotics for treatment effect estimation under network interference4
Non-independent component analysis4
Deep approximate policy iteration4
Concentration of discrepancy-based approximate Bayesian computation via Rademacher complexity4
How do noise tails impact on deep ReLU networks?4
On cross-validated Lasso in high dimensions4
Precise error rates for computationally efficient testing4
Total positivity in exponential families with application to binary variables4
Isotonic regression with unknown permutations: Statistics, computation and adaptation4
Optimal policy evaluation using kernel-based temporal difference methods4
Evidence factors from multiple, possibly invalid, instrumental variables3
On blockwise and reference panel-based estimators for genetic data prediction in high dimensions3
Approximate kernel PCA: Computational versus statistical trade-off3
Stochastic continuum-armed bandits with additive models: Minimax regrets and adaptive algorithm3
Optimal change-point estimation in time series3
Propriety of the reference posterior distribution in Gaussian process modeling3
Unified algorithms for RL with Decision-Estimation Coefficients: PAC, reward-free, preference-based learning and beyond3
Uniform consistency in nonparametric mixture models3
On statistical learning of simplices: Unmixing problem revisited3
Empirical tail copulas for functional data3
Testing equivalence of clustering3
Semiparametric optimal estimation with nonignorable nonresponse data3
Covariance estimation under one-bit quantization3
Variable selection, monotone likelihood ratio and group sparsity3
Functional sufficient dimension reduction through average Fréchet derivatives3
On the statistical complexity of sample amplification3
Higher-order coverage errors of batching methods via Edgeworth expansions on t-statistics3
Set structured global empirical risk minimizers are rate optimal in general dimensions3
Powerful knockoffs via minimizing reconstructability3
Testing for practically significant dependencies in high dimensions via bootstrapping maxima of U-statistics3
Optimal heteroskedasticity testing in nonparametric regression3
Fundamental limits of low-rank matrix estimation with diverging aspect ratios3
Asymptotic normality and optimality in nonsmooth stochastic approximation3
Adaptive learning rates for support vector machines working on data with low intrinsic dimension3
Statistical-computational trade-offs in tensor PCA and related problems via communication complexity3
Inference for extremal regression with dependent heavy-tailed data3
Simplex quantile regression without crossing3
Increasing dimension asymptotics for two-way crossed mixed effect models3
Robust sub-Gaussian estimation of a mean vector in nearly linear time3
Projective, sparse and learnable latent position network models3
Tensor factor model estimation by iterative projection3
Exact minimax risk for linear least squares, and the lower tail of sample covariance matrices3
Sparse anomaly detection across referentials: A rank-based higher criticism approach3
Dimension-free mixing times of Gibbs samplers for Bayesian hierarchical models3
High-dimensional asymptotics of likelihood ratio tests in the Gaussian sequence model under convex constraints3
StarTrek: Combinatorial variable selection with false discovery rate control3
Gromov–Wasserstein distances: Entropic regularization, duality and sample complexity3
Quantile processes and their applications in finite populations3
Correction note: “Statistical inference for the mean outcome under a possibly nonunique optimal treatment rule”3
Multiscale Bayesian survival analysis3
Extreme conditional expectile estimation in heavy-tailed heteroscedastic regression models3
Distributed statistical inference for massive data2
Optimal estimation of high-dimensional Gaussian location mixtures2
Local Whittle estimation of high-dimensional long-run variance and precision matrices2
Single index Fréchet regression2
Erratum: Asymptotic genealogies of interacting particle systems with an application to sequential Monte Carlo2
Conditional sequential Monte Carlo in high dimensions2
The impacts of unobserved covariates on covariate-adaptive randomized experiments2
Reconciling the Gaussian and Whittle likelihood with an application to estimation in the frequency domain2
On the robustness of minimum norm interpolators and regularized empirical risk minimizers2
Necessary and sufficient conditions for asymptotically optimal linear prediction of random fields on compact metric spaces2
0.066154956817627