SIAM Journal on Mathematics of Data Science

Papers
(The median citation count of SIAM Journal on Mathematics of Data Science is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Spectral Barron Space for Deep Neural Network Approximation35
A Simple and Optimal Algorithm for Strict Circular Seriation26
Adaptivity of Stochastic Gradient Methods for Nonconvex Optimization25
Taming Neural Networks with TUSLA: Nonconvex Learning via Adaptive Stochastic Gradient Langevin Algorithms23
On the Inconsistency of Kernel Ridgeless Regression in Fixed Dimensions21
Resolving the Mixing Time of the Langevin Algorithm to Its Stationary Distribution for Log-Concave Sampling19
Efficient Algorithms for Regularized Nonnegative Scale-Invariant Low-Rank Approximation Models19
Learning Functions Varying along a Central Subspace19
A Note on the Regularity of Images Generated by Convolutional Neural Networks15
Randomized Nyström Approximation of Non-negative Self-Adjoint Operators15
New Equivalences between Interpolation and SVMs: Kernels and Structured Features14
Poisson Reweighted Laplacian Uncertainty Sampling for Graph-Based Active Learning13
Safe Rules for the Identification of Zeros in the Solutions of the SLOPE Problem11
Persistent Laplacians: Properties, Algorithms and Implications11
Nonlinear Tomographic Reconstruction via Nonsmooth Optimization11
Quantitative Approximation Results for Complex-Valued Neural Networks11
Nonbacktracking Spectral Clustering of Nonuniform Hypergraphs11
Wassmap: Wasserstein Isometric Mapping for Image Manifold Learning10
Scalable Tensor Methods for Nonuniform Hypergraphs10
Online Machine Teaching under Learner Uncertainty: Gradient Descent Learners of a Quadratic Loss10
CA-PCA: Manifold Dimension Estimation, Adapted for Curvature10
Convergence of a Piggyback-Style Method for the Differentiation of Solutions of Standard Saddle-Point Problems10
Asymptotics of the Sketched Pseudoinverse9
The GenCol Algorithm for High-Dimensional Optimal Transport: General Formulation and Application to Barycenters and Wasserstein Splines8
Stochastic Variance-Reduced Majorization-Minimization Algorithms8
The Sample Complexity of Sparse Multireference Alignment and Single-Particle Cryo-Electron Microscopy8
Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities8
A Variational Formulation of Accelerated Optimization on Riemannian Manifolds7
Finite-Time Analysis of Natural Actor-Critic for POMDPs7
Supervised Gromov–Wasserstein Optimal Transport with Metric-Preserving Constraints7
Optimal Dorfman Group Testing for Symmetric Distributions7
Group-Invariant Tensor Train Networks for Supervised Learning7
Inverse Evolution Layers: Physics-Informed Regularizers for Image Segmentation7
A Nonlinear Matrix Decomposition for Mining the Zeros of Sparse Data7
ABBA Neural Networks: Coping with Positivity, Expressivity, and Robustness6
The Geometric Median and Applications to Robust Mean Estimation6
Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis6
Efficient Identification of Butterfly Sparse Matrix Factorizations6
Bi-Invariant Dissimilarity Measures for Sample Distributions in Lie Groups6
Numerical Considerations and a new implementation for invariant coordinate selection6
Computing Wasserstein Barycenters via Operator Splitting: The Method of Averaged Marginals6
Benefit of Interpolation in Nearest Neighbor Algorithms6
Post-training Quantization for Neural Networks with Provable Guarantees5
Randomized Wasserstein Barycenter Computation: Resampling with Statistical Guarantees5
Operator Shifting for General Noisy Matrix Systems5
Memory Capacity of Two Layer Neural Networks with Smooth Activations5
LASSO Reloaded: A Variational Analysis Perspective with Applications to Compressed Sensing5
KL Convergence Guarantees for Score Diffusion Models under Minimal Data Assumptions5
Adaptive Joint Distribution Learning5
Sequential Construction and Dimension Reduction of Gaussian Processes Under Inequality Constraints5
Robust Classification Under $\ell_0$ Attack for the Gaussian Mixture Model5
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning4
Fast Kernel Summation in High Dimensions via Slicing and Fourier Transforms4
Spectral Properties of Elementwise-Transformed Spiked Matrices4
A Generalized CUR Decomposition for Matrix Pairs4
Sensitivity-Informed Provable Pruning of Neural Networks4
Multifidelity Covariance Estimation via Regression on the Manifold of Symmetric Positive Definite Matrices4
Stability of Sequential Lateration and of Stress Minimization in the Presence of Noise4
Stochastic Gradient Descent for Streaming Linear and Rectified Linear Systems with Adversarial Corruptions4
Accelerated and Instance-Optimal Policy Evaluation with Linear Function Approximation4
A Unifying Generative Model for Graph Learning Algorithms: Label Propagation, Graph Convolutions, and Combinations4
Optimality Conditions for Nonsmooth Nonconvex-Nonconcave Min-Max Problems and Generative Adversarial Networks4
Convergence of a Constrained Vector Extrapolation Scheme4
Insights into Kernel PCA with Application to Multivariate Extremes3
Nonlinear Weighted Directed Acyclic Graph and A Priori Estimates for Neural Networks3
Causal Structural Learning via Local Graphs3
A Priori Estimates for Deep Residual Network in Continuous-Time Reinforcement Learning3
Block Bregman Majorization Minimization with Extrapolation3
Approximating Probability Distributions by Using Wasserstein Generative Adversarial Networks3
Entropic Optimal Transport on Random Graphs3
Ensemble Linear Interpolators: The Role of Ensembling2
Efficiency of ETA Prediction2
Diffeomorphic Measure Matching with Kernels for Generative Modeling2
Sharp Analysis of Sketch-and-Project Methods via a Connection to Randomized Singular Value Decomposition2
$k$-Variance: A Clustered Notion of Variance2
Optimization on Manifolds via Graph Gaussian Processes2
Network Online Change Point Localization2
An Adaptively Inexact First-Order Method for Bilevel Optimization with Application to Hyperparameter Learning2
Binary Classification of Gaussian Mixtures: Abundance of Support Vectors, Benign Overfitting, and Regularization2
The Common Intuition to Transfer Learning Can Win or Lose: Case Studies for Linear Regression2
Estimating a Potential Without the Agony of the Partition Function2
Approximate Message Passing with Rigorous Guarantees for Pooled Data and Quantitative Group Testing2
Lipschitz-Regularized Gradient Flows and Generative Particle Algorithms for High-Dimensional Scarce Data2
First-Order Conditions for Optimization in the Wasserstein Space2
Approximate Q Learning for Controlled Diffusion Processes and Its Near Optimality2
Faster Rates for Compressed Federated Learning with Client-Variance Reduction2
Accelerated Bregman Primal-Dual Methods Applied to Optimal Transport and Wasserstein Barycenter Problems2
Positive Semi-definite Embedding for Dimensionality Reduction and Out-of-Sample Extensions2
Simple Alternating Minimization Provably Solves Complete Dictionary Learning2
\({O({k})}\)-Equivariant Dimensionality Reduction on Stiefel Manifolds1
Applications of No-Collision Transportation Maps in Manifold Learning1
Randomly Initialized Alternating Least Squares: Fast Convergence for Matrix Sensing1
The Positivity of the Neural Tangent Kernel1
Overcomplete Order-3 Tensor Decomposition, Blind Deconvolution, and Gaussian Mixture Models1
When Big Data Actually Are Low-Rank, or Entrywise Approximation of Certain Function-Generated Matrices1
Corrigendum: Post-training Quantization for Neural Networks with Provable Guarantees1
Finding Planted Cliques Using Gradient Descent1
Federated Primal Dual Fixed Point Algorithm1
Fast Cluster Detection in Networks by First Order Optimization1
Energy-Based Sequential Sampling for Low-Rank PSD-Matrix Approximation1
Wasserstein-Based Projections with Applications to Inverse Problems1
Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations1
Double Double Descent: On Generalization Errors in Transfer Learning between Linear Regression Tasks1
Online MCMC Thinning with Kernelized Stein Discrepancy1
Target Network and Truncation Overcome the Deadly Triad in \(\boldsymbol{Q}\)-Learning1
Core-Periphery Detection in Hypergraphs1
What Kinds of Functions Do Deep Neural Networks Learn? Insights from Variational Spline Theory1
Two Steps at a Time---Taking GAN Training in Stride with Tseng's Method1
Intrinsic Dimension Adaptive Partitioning for Kernel Methods1
Nonparametric Finite Mixture Models with Possible Shape Constraints: A Cubic Newton Approach1
Identifying 3D Genome Organization in Diploid Organisms via Euclidean Distance Geometry1
Improving the Accuracy-Robustness Trade-Off of Classifiers via Adaptive Smoothing1
Landmark Alternating Diffusion1
Optimally Weighted PCA for High-Dimensional Heteroscedastic Data1
On the Nonconvexity of Push-Forward Constraints and Its Consequences in Machine Learning1
Fredholm Integral Equations for Function Approximation and the Training of Neural Networks1
Gradient Descent in the Absence of Global Lipschitz Continuity of the Gradients1
High-Dimensional Analysis of Double Descent for Linear Regression with Random Projections1
A Universal Trade-off Between the Model Size, Test Loss, and Training Loss of Linear Predictors1
Structural Balance and Random Walks on Complex Networks with Complex Weights1
Approximation Bounds for Sparse Programs1
On Design of Polyhedral Estimates in Linear Inverse Problems1
Determinantal Point Processes Implicitly Regularize Semiparametric Regression Problems1
0.028502941131592