SIAM Journal on Mathematics of Data Science

Papers
(The median citation count of SIAM Journal on Mathematics of Data Science is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization45
Spectral Barron Space for Deep Neural Network Approximation35
A Simple and Optimal Algorithm for Strict Circular Seriation34
Taming Neural Networks with TUSLA: Nonconvex Learning via Adaptive Stochastic Gradient Langevin Algorithms33
A Note on the Regularity of Images Generated by Convolutional Neural Networks28
New Equivalences between Interpolation and SVMs: Kernels and Structured Features22
Quantitative Approximation Results for Complex-Valued Neural Networks20
Learning Functions Varying along a Central Subspace19
Deep Block Proximal Linearized Minimization Algorithm for Nonconvex Inverse Problems17
Block Majorization Minimization with Extrapolation and Application to \({\beta }\)-NMF17
Randomized Nyström Approximation of Non-negative Self-Adjoint Operators14
Poisson Reweighted Laplacian Uncertainty Sampling for Graph-Based Active Learning14
On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions14
Efficient Algorithms for Regularized Nonnegative Scale-Invariant Low-Rank Approximation Models14
Resolving the Mixing Time of the Langevin Algorithm to Its Stationary Distribution for Log-Concave Sampling13
Nonlinear Tomographic Reconstruction via Nonsmooth Optimization12
Online Machine Teaching under Learner Uncertainty: Gradient Descent Learners of a Quadratic Loss12
Nonbacktracking Spectral Clustering of Nonuniform Hypergraphs11
Persistent Laplacians: Properties, Algorithms and Implications11
Wassmap: Wasserstein Isometric Mapping for Image Manifold Learning11
Safe Rules for the Identification of Zeros in the Solutions of the SLOPE Problem11
Scalable Tensor Methods for Nonuniform Hypergraphs10
Convergence of a Piggyback-Style Method for the Differentiation of Solutions of Standard Saddle-Point Problems10
Stochastic Variance-Reduced Majorization-Minimization Algorithms9
A Notion of Uniqueness for the Adversarial Bayes Classifier9
Nonlinear Meta-learning Can Guarantee Faster Rates9
Asymptotics of the Sketched Pseudoinverse9
Learning Memory Kernels in Generalized Langevin Equations9
CA-PCA: Manifold Dimension Estimation, Adapted for Curvature9
Covariance Alignment: From Maximum Likelihood Estimation to Gromov–Wasserstein9
Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities9
Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis8
Inverse Evolution Layers: Physics-Informed Regularizers for Image Segmentation8
Bi-Invariant Dissimilarity Measures for Sample Distributions in Lie Groups8
A Variational Formulation of Accelerated Optimization on Riemannian Manifolds8
Group-Invariant Tensor Train Networks for Supervised Learning8
The GenCol Algorithm for High-Dimensional Optimal Transport: General Formulation and Application to Barycenters and Wasserstein Splines8
The Sample Complexity of Sparse Multireference Alignment and Single-Particle Cryo-Electron Microscopy8
On Neural Network Approximation of Ideal Adversarial Attack and Convergence of Adversarial Training8
Random Multitype Spanning Forests for Synchronization on Sparse Graphs8
The Geometric Median and Applications to Robust Mean Estimation7
Numerical Considerations and a new implementation for invariant coordinate selection7
ABBA Neural Networks: Coping with Positivity, Expressivity, and Robustness7
Finite-Time Analysis of Natural Actor-Critic for POMDPs7
Optimal Dorfman Group Testing for Symmetric Distributions7
Efficient Identification of Butterfly Sparse Matrix Factorizations7
Benefit of Interpolation in Nearest Neighbor Algorithms7
Supervised Gromov–Wasserstein Optimal Transport with Metric-Preserving Constraints7
LASSO Reloaded: A Variational Analysis Perspective with Applications to Compressed Sensing6
Operator Shifting for General Noisy Matrix Systems6
Computing Wasserstein Barycenters via Operator Splitting: The Method of Averaged Marginals6
Post-training Quantization for Neural Networks with Provable Guarantees6
HADES: Fast Singularity Detection with Local Measure Comparison5
Complete and Continuous Invariants of 1-Periodic Sequences in Polynomial Time5
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning5
Adaptive Joint Distribution Learning5
Memory Capacity of Two Layer Neural Networks with Smooth Activations5
KL Convergence Guarantees for Score Diffusion Models under Minimal Data Assumptions5
Sequential Construction and Dimension Reduction of Gaussian Processes Under Inequality Constraints5
Phase Retrieval with Semialgebraic and ReLU Neural Network Priors5
Multifidelity Covariance Estimation via Regression on the Manifold of Symmetric Positive Definite Matrices4
Fast Kernel Summation in High Dimensions via Slicing and Fourier Transforms4
Spectral Properties of Elementwise-Transformed Spiked Matrices4
Optimality Conditions for Nonsmooth Nonconvex-Nonconcave Min-Max Problems and Generative Adversarial Networks4
Convergence of a Constrained Vector Extrapolation Scheme4
Stochastic Gradient Descent for Streaming Linear and Rectified Linear Systems with Adversarial Corruptions4
Accelerated and Instance-Optimal Policy Evaluation with Linear Function Approximation4
Stability of Sequential Lateration and of Stress Minimization in the Presence of Noise4
Insights into Kernel PCA with Application to Multivariate Extremes3
Nonlinear Weighted Directed Acyclic Graph and A Priori Estimates for Neural Networks3
Simple Alternating Minimization Provably Solves Complete Dictionary Learning3
Optimization on Manifolds via Graph Gaussian Processes3
Approximate Q Learning for Controlled Diffusion Processes and Its Near Optimality3
Approximating Probability Distributions by Using Wasserstein Generative Adversarial Networks3
The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression3
First-Order Conditions for Optimization in the Wasserstein Space3
Ensemble Linear Interpolators: The Role of Ensembling3
Network Online Change Point Localization3
Causal Structural Learning via Local Graphs3
Diffeomorphic Measure Matching with Kernels for Generative Modeling3
Robust Reinforcement Learning with Dynamic Distortion Risk Measures3
Efficiency of ETA Prediction3
Exploring Variance Reduction in Importance Sampling for Efficient DNN Training3
On the Rates of Convergence for Learning with Convolutional Neural Networks3
Entropic Optimal Transport on Random Graphs3
Lipschitz-Regularized Gradient Flows and Generative Particle Algorithms for High-Dimensional Scarce Data3
Approximate Message Passing with Rigorous Guarantees for Pooled Data and Quantitative Group Testing3
$k$-Variance: A Clustered Notion of Variance3
A Priori Estimates for Deep Residual Network in Continuous-Time Reinforcement Learning3
An Adaptively Inexact First-Order Method for Bilevel Optimization with Application to Hyperparameter Learning3
Stochastic Optimal Transport in Banach Spaces for Regularized Estimation of Multivariate Quantiles3
Faster Rates for Compressed Federated Learning with Client-Variance Reduction2
Landmark Alternating Diffusion2
Wasserstein-Based Projections with Applications to Inverse Problems2
Fredholm Integral Equations for Function Approximation and the Training of Neural Networks2
The Positivity of the Neural Tangent Kernel2
Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive Smoothing2
Optimally Weighted PCA for High-Dimensional Heteroscedastic Data2
Double Double Descent: On Generalization Errors in Transfer Learning between Linear Regression Tasks2
Estimating a Potential Without the Agony of the Partition Function2
On the Nonconvexity of Push-Forward Constraints and Its Consequences in Machine Learning2
Determinantal Point Processes Implicitly Regularize Semiparametric Regression Problems2
On Design of Polyhedral Estimates in Linear Inverse Problems2
Enforcing Katz and PageRank Centrality Measures in Complex Networks2
Applications of No-Collision Transportation Maps in Manifold Learning2
\({O({k})}\)-Equivariant Dimensionality Reduction on Stiefel Manifolds2
Sharp Analysis of Sketch-and-Project Methods via a Connection to Randomized Singular Value Decomposition2
Online MCMC Thinning with Kernelized Stein Discrepancy2
Randomly Initialized Alternating Least Squares: Fast Convergence for Matrix Sensing2
Energy-Based Sequential Sampling for Low-Rank PSD-Matrix Approximation2
Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations2
Accelerated Bregman Primal-Dual Methods Applied to Optimal Transport and Wasserstein Barycenter Problems2
0.075029134750366