IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-07-01 to 2025-07-01.)
ArticleCitations
Learn to Predict Sets Using Feed-Forward Neural Networks2777
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus2327
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification1344
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation1312
Modeling Noisy Annotations for Point-Wise Supervision1268
Invariant Policy Learning: A Causal Perspective1218
Cover1195
Editorial Board1171
[Back cover - Table of contents, continued]1154
Front Cover870
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images775
A Generative Model for Generic Light Field Reconstruction739
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing627
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference627
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems618
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting560
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data539
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching519
One-for-All: Towards Universal Domain Translation With a Single StyleGAN473
Deep Non-Rigid Structure From Motion With Missing Data449
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation444
Towards Unified Deep Image Deraining: A Survey and a New Benchmark429
Enhancing Representations Through Heterogeneous Self-Supervised Learning417
DVIS++: Improved Decoupled Framework for Universal Video Segmentation405
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion399
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting393
Learning to Guide a Saturation-Based Theorem Prover383
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes378
Centerless Clustering367
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning363
A Clustering Validity Index with Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms357
Structure-Preserving Image Super-Resolution353
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search350
Quadratic Matrix Factorization With Applications to Manifold Learning341
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization341
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks338
Interactive NeRF Geometry Editing With Shape Priors331
Towards Accurate and Compact Architectures via Neural Architecture Transformer321
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks321
Graph Convolutional Module for Temporal Action Localization in Videos310
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting301
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation296
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models294
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach290
Instance Shadow Detection with A Single-Stage Detector283
Physics-Informed Guided Disentanglement in Generative Networks283
VATr++: Choose Your Words Wisely for Handwritten Text Generation281
Multi-Task Head Pose Estimation in-the-Wild270
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures270
Locating and Counting Heads in Crowds With a Depth Prior265
Simplicial Complex Neural Networks264
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness257
Optimization-Based Post-Training Quantization With Bit-Split and Stitching255
On the Trade-off between Flatness and Optimization in Distributed Learning249
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains244
Are Graph Convolutional Networks With Random Weights Feasible?235
Active Supervised Cross-Modal Retrieval234
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification223
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation223
Transformer-Based Visual Segmentation: A Survey222
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks219
Prior Image Guided Snapshot Compressive Spectral Imaging217
Affective Image Content Analysis: Two Decades Review and New Perspectives216
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition215
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution204
Deep Long-Tailed Learning: A Survey204
Face Forgery Detection by 3D Decomposition and Composition Search203
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration203
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models200
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications197
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation197
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks186
Inferring Point Cloud Quality via Graph Similarity185
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method185
Face Generation and Editing With StyleGAN: A Survey182
Fast Component Tree Computation for Images of Limited Levels181
Unsupervised Domain Adaptation via Discriminative Manifold Propagation181
Cover179
Human-Centric Transformer for Domain Adaptive Action Recognition179
Cover 2178
Cover177
Cover176
Table of Contents175
IEEE Computer Society Has You Covered!175
TPAMI Information for Authors174
Point Set Registration for 3D Range Scans Using Fuzzy Cluster-Based Metric and Efficient Global Optimization171
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning171
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets169
A Variational EM Acceleration for Efficient Clustering at Very Large Scales169
Compositional Scene Representation Learning via Reconstruction: A Survey167
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey165
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network164
Image Lens Flare Removal Using Adversarial Curve Learning163
On the Robustness of Average Losses for Partial-Label Learning161
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks159
Learning to See Through With Events158
PathNet: Path-Selective Point Cloud Denoising157
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation157
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach157
Self-Supervised Multimodal Learning: A Survey156
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks156
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing155
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement151
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation149
Rate-Distortion Theory in Coding for Machines and Its Applications149
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation147
Discriminant Feature Extraction by Generalized Difference Subspace146
Knowledge-Based Embodied Question Answering144
Universal Image Segmentation with Efficiency140
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification139
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning138
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics134
Dynamic Self-Supervised Teacher-Student Network Learning133
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos132
Learning Graph Attentions via Replicator Dynamics132
GradMDM: Adversarial Attack on Dynamic Networks130
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining130
Human Interaction Understanding With Consistency-Aware Learning129
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition127
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks127
Differential Viewpoints for Ground Terrain Material Recognition126
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation125
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications123
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification123
Unbiased Scene Graph Generation via Two-Stage Causal Modeling123
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration122
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance121
On Positive-Unlabeled Classification from Corrupted Data in GANs119
Enhancing Photorealism Enhancement118
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression118
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks117
AutoNovel: Automatically Discovering and Learning Novel Visual Categories117
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation117
Revisiting Nonlocal Self-Similarity from Continuous Representation116
Correcting Optical Aberration via Depth-Aware Point Spread Functions115
Accurate and Efficient Stereo Matching via Attention Concatenation Volume115
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition115
Random Permutation Set Reasoning114
Domain Generalization: A Survey114
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition113
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond113
Hypergraph-Based Multi-View Action Recognition Using Event Cameras113
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning113
Differentially Private Graph Neural Networks for Whole-Graph Classification111
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking110
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision107
BNET: Batch Normalization With Enhanced Linear Transformation107
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving106
P2T: Pyramid Pooling Transformer for Scene Understanding106
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses106
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging105
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification105
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT103
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion103
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images102
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning100
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning100
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap100
Deep Learning for Face Anti-Spoofing: A Survey99
Variational Data-Free Knowledge Distillation for Continual Learning99
Deep Gait Recognition: A Survey99
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector99
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing99
ComputingEdge ad98
Advances and Challenges in Meta-Learning: A Technical Review98
A Style-Based Generator Architecture for Generative Adversarial Networks98
ONNXPruner: ONNX-Based General Model Pruning Adapter97
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation97
Support Vector Machine Classifier via Soft-Margin Loss97
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images97
LCBM: A Multi-View Probabilistic Model for Multi-Label Classification97
luvHarris: A Practical Corner Detector for Event-Cameras97
Low-Shot Video Object Segmentation95
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing94
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating93
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets93
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications93
DeepMesh: Differentiable Iso-Surface Extraction93
Stimulative Training++: Go Beyond The Performance Limits of Residual Networks92
Semi-Supervised Learning for FGVC With Out-of-Category Data92
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning92
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search92
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection91
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning91
Relationship Quantification of Image Degradations91
Continual Unsupervised Generative Modeling91
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera91
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification91
Cover 389
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images89
The Cluster Structure Function87
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding87
Editorial: Special Section on Egocentric Perception87
An Energy-Based Prior for Generative Saliency86
Reusable Architecture Growth for Continual Stereo Matching86
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation86
Scale Propagation Network for Generalizable Depth Completion86
Test-time Training for Hyperspectral Image Super-resolution85
3D Visual Saliency: An Independent Perceptual Measure or A Derivative of 2D Image Saliency?85
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion85
Supervision by Denoising85
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition85
Deep Learning on Object-Centric 3D Neural Fields84
TE141K: Artistic Text Benchmark for Text Effect Transfer84
Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset84
A Thorough Benchmark and a New Model for Light Field Saliency Detection84
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons84
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues83
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning83
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation83
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video83
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization82
Rolling Shutter Homography and its Applications82
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution80
Compositional Physical Reasoning of Objects and Events from Videos80
Learning to Super-Resolve Blurry Images With Events80
Reframing Neural Networks: Deep Structure in Overcomplete Representations80
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition79
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging79
Recurrent Neural Networks for Snapshot Compressive Imaging79
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?79
Noisy Label Learning With Provable Consistency for a Wider Family of Losses79
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation79
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing79
S $^{2}$ O: Enhancing Adversarial Training with Second-Order Statistics of Weights78
Adversarially Robust Neural Architectures78
Cascaded Dynamic Memory Refinement and Semantic Alignment for Exo-to-Ego Cross-View Video Generation78
SS-TBN: A Semi-Supervised Tri-Branch Network for COVID-19 Screening and Lesion Segmentation78
Disentangled Representation Learning77
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI77
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses76
Any Fashion Attribute Editing: Dataset and Pretrained Models75
Progressive Instance-Aware Feature Learning for Compositional Action Recognition75
Conformal Prediction for Time Series75
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation74
The Bayesian Cut74
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs74
Analysis of the Hands in Egocentric Vision: A Survey74
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution74
Adaptive Perspective Distillation for Semantic Segmentation73
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling73
MB-TaylorFormer V2: Improved Multi-Branch Linear Transformer Expanded by Taylor Formula for Image Restoration73
Optimizing Regularized Cholesky Score for Order-Based Learning of Bayesian Networks73
Semantic Object Accuracy for Generative Text-to-Image Synthesis73
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World73
IEEE Computer Society Has You Covered!72
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks72
Cover 372
Single Image Deraining: From Model-Based to Data-Driven and Beyond72
Table of Contents72
0.048265933990479