IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
[Front inside cover]2531
IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information1942
Cover 21248
Cover 4 [Table of Contents, back cover]1175
Front Cover1163
Towards Robust Probabilistic Modeling on SO(3) via Rotation Laplace Distribution1121
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling1086
IEEE Computer Society Information1080
Front Cover1005
Cover 3999
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting994
Gait Recognition in the Wild: A Large-scale Benchmark and NAS-based Baseline784
Diffusion Models in Low-Level Vision: A Survey651
Table of Contents633
[Back inside cover]603
Cover590
Editorial Board573
Towards Accurate Post-Training Quantization of Vision Transformers via Error Reduction548
Learning the Optimal Discriminant SVM With Feature Extraction526
Instruction-Guided Scene Text Recognition473
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines455
Quantum Gated Recurrent Neural Networks413
One-for-All: Towards Universal Domain Translation With a Single StyleGAN400
Glissando-Net: Deep Single View Category Level Pose Estimation and 3D Reconstruction394
DHVT: Dynamic Hybrid Vision Transformer for Small Dataset Recognition388
Behind Every Domain There is a Shift: Adapting Distortion-Aware Vision Transformers for Panoramic Semantic Segmentation387
AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking387
Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics356
Old Photo Restoration via Deep Latent Space Translation348
Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features340
Exploiting Wavelength Diversity for High Resolution Time-of-Flight 3D Imaging337
A Lightweight Optical Flow CNN —Revisiting Data Fidelity and Regularization336
SIFT Matching by Context Exposed330
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation324
Learning Invariance From Generated Variance for Unsupervised Person Re-Identification319
Sparse R-CNN: An End-to-End Framework for Object Detection313
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness305
Consistent 3D Hand Reconstruction in Video via Self-Supervised Learning301
Normalization Techniques in Training DNNs: Methodology, Analysis and Application299
Deep Long-Tailed Learning: A Survey296
An Integrated Fast Hough Transform for Multidimensional Data292
Lazily Aggregated Quantized Gradient Innovation for Communication-Efficient Federated Learning291
Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited278
On the Optimality of Sufficient Statistics-based Quantizers275
IEEE Computer Society Information274
A Causal Adjustment Module for Debiasing Scene Graph Generation273
Quasi-Metric Learning for Bilateral Person-Job Fit270
Global Model Selection via Solution Paths for Robust Support Vector Machine268
Remembering What is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction267
IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information266
DPODv2: Dense Correspondence-Based 6 DoF Pose Estimation262
Multiview Unsupervised Shapelet Learning for Multivariate Time Series Clustering261
DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation260
PyMAF-X: Towards Well-Aligned Full-Body Model Regression From Monocular Images252
Learning an Invariant and Equivariant Network for Weakly Supervised Object Detection247
Continual Image Deraining With Hypergraph Convolutional Networks244
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering239
A Unified Visual Information Preservation Framework for Self-supervised Pre-training in Medical Image Analysis238
Instance Shadow Detection with A Single-Stage Detector231
On the Decision Boundaries of Neural Networks: A Tropical Geometry Perspective218
Adaptive Region-Specific Loss for Improved Medical Image Segmentation214
Transformer-Based Visual Segmentation: A Survey209
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation209
UniMiSS+: Universal Medical Self-Supervised Learning From Cross-Dimensional Unpaired Data206
Robust Multi-View Clustering With Incomplete Information202
PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-Step Point Moving Paths196
Wavelet Approximation-Aware Residual Network for Single Image Deraining194
CAS(ME)<sup>3</sup>: A Third Generation Facial Spontaneous Micro-Expression Database with Depth Information and High Ecological Validity192
Unsupervised Grouped Axial Data Modeling via Hierarchical Bayesian Nonparametric Models With Watson Distributions187
Deep Learning Methods for Calibrated Photometric Stereo and Beyond184
BadLabel: A Robust Perspective on Evaluating and Enhancing Label-Noise Learning184
Fully Sparse Fusion for 3D Object Detection182
DVIS++: Improved Decoupled Framework for Universal Video Segmentation178
Systematic Bias of Machine Learning Regression Models and Correction177
Impact of Noisy Supervision in Foundation Model Learning174
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation174
Publications Seek 2023 Editors in Chief172
[Back cover - Table of contents, continued]172
Cover169
[Back cover]167
Cover166
Cover165
Table of Contents163
Front Cover161
IEEE Quantum Week161
Cover 4 [Table of Contents]160
Cover 2159
Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning158
Learning to Follow and Generate Instructions for Language-Capable Navigation157
Locating and Counting Heads in Crowds With a Depth Prior157
Fast and Informative Model Selection Using Learning Curve Cross-Validation156
Detecting Line Segments in Motion-Blurred Images With Events155
From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm154
Extraction of an Explanatory Graph to Interpret a CNN154
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation153
Continuous-Time Object Segmentation Using High Temporal Resolution Event Camera152
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion151
Learning Without Forgetting for Vision-Language Models151
Intelligent Bionic Polarization Orientation Method Using Biological Neuron Model for Harsh Conditions151
Equivariant Diffusion Model with A5-Group Neurons for Joint Pose Estimation and Shape Reconstruction151
MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network147
BEVHeight++: Toward Robust Visual Centric 3D Object Detection146
Active Supervised Cross-Modal Retrieval145
Fourier-Based and Rational Graph Filters for Spectral Processing141
SimSwap++: Towards Faster and High-Quality Identity Swapping140
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image Segmentation140
Towards Accurate and Compact Architectures via Neural Architecture Transformer139
RayMVSNet++: Learning Ray-Based 1D Implicit Fields for Accurate Multi-View Stereo138
Hunter: Exploring High-Order Consistency for Point Cloud Registration With Severe Outliers137
Multi-Task Head Pose Estimation in-the-Wild136
Context Disentangling and Prototype Inheriting for Robust Visual Grounding131
DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning130
One-Hot Graph Encoder Embedding130
SibNet: Sibling Convolutional Encoder for Video Captioning128
Incomplete Label Multiple Instance Multiple Label Learning128
Towards Age-Invariant Face Recognition127
DSGN++: Exploiting Visual-Spatial Relation for Stereo-Based 3D Detectors126
PLMP – Point-Line Minimal Problems in Complete Multi-View Visibility125
Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces122
Simple Primitives With Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-Shot Learning122
On Exploring Multiplicity of Primitives and Attributes for Texture Recognition in the Wild122
Boosting Photon-Efficient Image Reconstruction with A Unified Deep Neural Network121
A Deterministic Approximation to Neural SDEs120
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds120
BiFuse++: Self-Supervised and Efficient Bi-Projection Fusion for $360^{\circ }$ Depth Estimation119
Node-Oriented Spectral Filtering for Graph Neural Networks118
Deep Image Matting With Sparse User Interactions116
MPS-NeRF: Generalizable 3D Human Rendering From Multiview Images113
Importance Weighted Structure Learning for Scene Graph Generation113
Parameterized Hamiltonian Learning With Quantum Circuit111
Learning Optical Flow and Scene Flow With Bidirectional Camera-LiDAR Fusion111
Searching a High Performance Feature Extractor for Text Recognition Network110
Revisiting Computer-Aided Tuberculosis Diagnosis109
PFENet++: Boosting Few-Shot Semantic Segmentation With the Noise-Filtered Context-Aware Prior Mask109
Point Cloud Attacks in Graph Spectral Domain: When 3D Geometry Meets Graph Signal Processing108
Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval108
Learning Dynamic Scene-Conditioned 3D Object Detectors108
X2-VLM: All-in-One Pre-Trained Model for Vision-Language Tasks108
ZJUT-EIFD: A Synchronously Collected External and Internal Fingerprint Database108
Fast Graph Generation via Spectral Diffusion107
Understanding and Accelerating Neural Architecture Search With Training-Free and Theory-Grounded Metrics107
Sheared Epipolar Focus Spectrum for Dense Light Field Reconstruction107
Realize Generative Yet Complete Latent Representation for Incomplete Multi-View Learning106
Adaptive Cross-Modal Transferable Adversarial Attacks From Images to Videos105
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation105
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning104
Learning Bilateral Cost Volume for Rolling Shutter Temporal Super-Resolution102
Incomplete Gamma Kernels: Generalizing Locally Optimal Projection Operators102
Graph Transformer GANs With Graph Masked Modeling for Architectural Layout Generation101
Face Generation and Editing With StyleGAN: A Survey101
Stereo Image Restoration via Attention-Guided Correspondence Learning100
B-Cos Alignment for Inherently Interpretable CNNs and Vision Transformers99
ASP: Learn a Universal Neural Solver!99
Unsupervised Test-Time Adaptation Learning for Effective Hyperspectral Image Super-Resolution With Unknown Degeneration98
Evidential Multi-Source-Free Unsupervised Domain Adaptation98
Adaptive Perturbation for Adversarial Attack97
Deep Variational Network Toward Blind Image Restoration97
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification96
Sequential Point Clouds: A Survey95
CrossHomo: Cross-Modality and Cross-Resolution Homography Estimation95
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection95
Global Instance Tracking: Locating Target More Like Humans94
Single Day Outdoor Photometric Stereo94
Recent Advances in Optimal Transport for Machine Learning94
Few-Shot Partial Multi-View Learning94
Erratum to “Deep Back-Projection Networks for Single Image Super-Resolution”93
NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation92
Sensing Diversity and Sparsity Models for Event Generation and Video Reconstruction from Events92
Relationship-Embedded Representation Learning for Grounding Referring Expressions92
Perceptual Texture Similarity Estimation: An Evaluation of Computational Features92
Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection91
Estimating Information Theoretic Measures via Multidimensional Gaussianization91
Generalized Face Liveness Detection via De-Fake Face Generator90
PWLU: Learning Specialized Activation Functions With the Piecewise Linear Unit90
Bilinear Scoring Function Search for Knowledge Graph Learning90
Cyclic Differentiable Architecture Search89
Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation89
Few-Shot Multi-Agent Perception With Ranking-Based Feature Learning88
PATNAS: A Path-Based Training-Free Neural Architecture Search88
Rainbow UDA: Combining Domain Adaptive Models for Semantic Segmentation Tasks86
NAS-PED: Neural Architecture Search for Pedestrian Detection85
Model Study of Transient Imaging With Multi-Frequency Time-of-Flight Sensors84
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification84
Text-Driven Video Acceleration: A Weakly-Supervised Reinforcement Learning Method84
PointGLR: Unsupervised Structural Representation Learning of 3D Point Clouds83
IBCS: Learning Information Bottleneck-Constrained Denoised Causal Subgraph for Graph Classification83
Unsupervised Global and Local Homography Estimation With Coplanarity-Aware GAN83
Self-Supervised Arbitrary-Scale Implicit Point Clouds Upsampling83
Feature Re-Representation and Reliable Pseudo Label Retraining for Cross-Domain Semantic Segmentation82
Multiview Feature Selection for Single-View Classification81
A Review of Deep Learning for Video Captioning80
Towards a Complete 3D Morphable Model of the Human Head80
Heterogeneous Few-Shot Model Rectification With Semantic Mapping80
Minimizing Negative Transfer of Knowledge in Multivariate Gaussian Processes: A Scalable and Regularized Approach80
Recovering 3D Human Mesh From Monocular Images: A Survey79
Eigendecomposition-Free Training of Deep Networks for Linear Least-Square Problems79
Diagnosing and Preventing Instabilities in Recurrent Video Processing79
Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization79
Learning Symbolic Model-Agnostic Loss Functions via Meta-Learning78
The Group Loss++: A Deeper Look Into Group Loss for Deep Metric Learning78
Multiscale Dynamic Graph Representation for Biometric Recognition With Occlusions78
Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank78
Object Affinity Learning: Towards Annotation-Free Instance Segmentation77
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation77
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection76
Maximum Block Energy Guided Robust Subspace Clustering76
Emotional Attention: From Eye Tracking to Computational Modeling76
Bridging the Gap Between Computational Photography and Visual Recognition76
Modeling Noisy Annotations for Point-Wise Supervision75
Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study75
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching75
You Only Train Once: Learning General and Distinctive 3D Local Descriptors74
Exploring Simple and Transferable Recognition-Aware Image Processing73
Deep Generative Mixture Model for Robust Imbalance Classification73
End-to-End One-Shot Human Parsing73
Extended : Learning with Mixed Closed-set and Open-set Noisy Labels72
An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras72
Second-Order Unsupervised Feature Selection via Knowledge Contrastive Distillation72
Logarithmic Schatten-p Norm Minimization for Tensorial Multi-view Subspace Clustering72
Superadditivity and Convex Optimization for Globally Optimal Cell Segmentation Using Deformable Shape Models72
Circular Silhouette and a Fast Algorithm72
Discrete and Balanced Spectral Clustering With Scalability71
Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation71
Interactive NeRF Geometry Editing With Shape Priors70
Contextualizing Meta-Learning via Learning to Decompose70
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models70
Generative Text Convolutional Neural Network for Hierarchical Document Representation Learning69
Light Field Neural Rendering69
Towards a Deeper Understanding of Global Covariance Pooling in Deep Learning: An Optimization Perspective69
Deep Scene Flow Learning: From 2D Images to 3D Point Clouds69
Large-Scale Object Detection in the Wild With Imbalanced Data Distribution, and Multi-Labels68
Debiased Scene Graph Generation for Dual Imbalance Learning68
Adaptive Subgraph Neural Network with Reinforced Critical Structure Mining68
EM-Driven Unsupervised Learning for Efficient Motion Segmentation68
Content-Aware Warping for View Synthesis68
DebSDF: Delving Into the Details and Bias of Neural Indoor Scene Reconstruction68
How to Query an Oracle? Efficient Strategies to Label Data67
Rebuttal to “Comments on ‘Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features’ ”67
Markov Progressive Framework, a Universal Paradigm for Modeling Long Videos67
A Visual Approach to Measure Cloth-Body and Cloth-Cloth Friction67
Attention-Guided Low-Rank Tensor Completion67
Fast Learning of Signed Distance Functions From Noisy Point Clouds via Noise to Noise Mapping67
Inductive State-Relabeling Adversarial Active Learning With Heuristic Clique Rescaling66
U-Match: Exploring Hierarchy-Aware Local Context for Two-View Correspondence Learning66
Learn to Predict Sets Using Feed-Forward Neural Networks66
GAN Compression: Efficient Architectures for Interactive Conditional GANs66
Variational Nested Dropout66
GCP: Graph Encoder With Content-Planning for Sentence Generation From Knowledge Bases66
CO-Net++: A Cohesive Network for Multiple Point Cloud Tasks at Once With Two-Stage Feature Rectification66
Attention in Reasoning: Dataset, Analysis, and Modeling65
0.071851015090942