IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)
ArticleCitations
Learn to Predict Sets Using Feed-Forward Neural Networks3038
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification2731
Modeling Noisy Annotations for Point-Wise Supervision1527
[Back cover - Table of contents, continued]1503
Front Cover1400
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference1392
Self-Supervised Skeleton Representation Learning via Actionlet Contrast and Reconstruct1390
A Generative Model for Generic Light Field Reconstruction1265
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification1263
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation1059
Centerless Clustering1007
Video Demoireing using Focused-Defocused Dual-Camera System892
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing722
One-for-All: Towards Universal Domain Translation With a Single StyleGAN690
Deep Non-Rigid Structure From Motion With Missing Data649
Affective Image Content Analysis: Two Decades Review and New Perspectives611
Simplicial Complex Neural Networks583
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search537
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications523
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation509
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems506
Locating and Counting Heads in Crowds With a Depth Prior506
Invariant Policy Learning: A Causal Perspective470
Quadratic Matrix Factorization With Applications to Manifold Learning453
Enhancing Representations Through Heterogeneous Self-Supervised Learning445
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization445
Unsupervised Domain Adaptation via Discriminative Manifold Propagation441
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data439
BiBBDM: Bidirectional Image Translation with Brownian Bridge Diffusion Models427
Physics-Informed Guided Disentanglement in Generative Networks411
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images410
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration402
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains400
Prior Image Guided Snapshot Compressive Spectral Imaging385
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting376
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes376
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation374
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation372
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks367
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models357
Towards Accurate and Compact Architectures via Neural Architecture Transformer339
Interactive NeRF Geometry Editing With Shape Priors336
Graph Convolutional Module for Temporal Action Localization in Videos335
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks332
Active Supervised Cross-Modal Retrieval326
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach324
Instance Shadow Detection with A Single-Stage Detector320
Towards Unified Deep Image Deraining: A Survey and a New Benchmark320
On the Trade-Off Between Flatness and Optimization in Distributed Learning318
Event-Based Photometric Bundle Adjustment313
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models312
VATr++: Choose Your Words Wisely for Handwritten Text Generation303
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting297
Learning to Guide a Saturation-Based Theorem Prover295
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures295
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition292
Optimization-Based Post-Training Quantization With Bit-Split and Stitching290
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus286
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning277
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks264
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks258
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation249
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms247
Structure-Preserving Image Super-Resolution247
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution245
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method242
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion241
DVIS++: Improved Decoupled Framework for Universal Video Segmentation240
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching231
Are Graph Convolutional Networks With Random Weights Feasible?231
Deep Long-Tailed Learning: A Survey225
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness224
Transformer-Based Visual Segmentation: A Survey221
Face Forgery Detection by 3D Decomposition and Composition Search218
Inferring Point Cloud Quality via Graph Similarity213
Face Generation and Editing With StyleGAN: A Survey212
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting211
Fast Component Tree Computation for Images of Limited Levels202
Human-Centric Transformer for Domain Adaptive Action Recognition200
Cover 2198
Cover197
Cover197
Table of Contents196
IEEE Computer Society Has You Covered!196
TPAMI Information for Authors195
A Variational EM Acceleration for Efficient Clustering at Very Large Scales195
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation192
Discriminant Feature Extraction by Generalized Difference Subspace192
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation190
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks190
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation190
On Positive-Unlabeled Classification From Corrupted Data in GANs189
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks187
Rate-Distortion Theory in Coding for Machines and Its Applications186
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining186
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior182
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification180
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning179
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification178
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration177
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning174
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance173
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights171
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition171
AutoNovel: Automatically Discovering and Learning Novel Visual Categories167
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition165
Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining161
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images161
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey161
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach159
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition155
M3D: a Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction154
Universal Image Segmentation With Efficiency153
Temporal Feature Matters: A Framework for Diffusion Model Quantization152
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration152
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation152
BNET: Batch Normalization With Enhanced Linear Transformation149
On the Robustness of Average Losses for Partial-Label Learning146
Dynamic Self-Supervised Teacher-Student Network Learning146
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics144
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos144
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark142
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks142
Compositional Scene Representation Learning via Reconstruction: A Survey142
Image Lens Flare Removal Using Adversarial Curve Learning141
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks141
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving141
Correcting Optical Aberration via Depth-Aware Point Spread Functions140
Learning Graph Attentions via Replicator Dynamics138
Human Interaction Understanding With Consistency-Aware Learning138
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision138
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector136
GradMDM: Adversarial Attack on Dynamic Networks134
Learning to See Through With Events133
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression133
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging133
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond131
Differential Viewpoints for Ground Terrain Material Recognition131
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets130
Enhancing Photorealism Enhancement130
Learning Efficient Meshflow and Optical Flow from Event Cameras130
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning129
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement127
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation126
Hypergraph-Based Multi-View Action Recognition Using Event Cameras126
Differentially Private Graph Neural Networks for Whole-Graph Classification124
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications122
PathNet: Path-Selective Point Cloud Denoising122
Self-Supervised Multimodal Learning: A Survey120
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning119
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving117
Variational Data-Free Knowledge Distillation for Continual Learning117
Unbiased Scene Graph Generation via Two-Stage Causal Modeling117
Deep Gait Recognition: A Survey116
Deep Learning for Face Anti-Spoofing: A Survey116
Random Permutation Set Reasoning116
Revisiting Nonlocal Self-Similarity from Continuous Representation115
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion115
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing113
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses112
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning111
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification111
Knowledge-Based Embodied Question Answering111
Domain Generalization: A Survey111
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap110
A Style-Based Generator Architecture for Generative Adversarial Networks110
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion109
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking108
Accurate and Efficient Stereo Matching via Attention Concatenation Volume107
Advances and Challenges in Meta-Learning: A Technical Review107
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network107
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing106
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT106
ComputingEdge ad106
P2T: Pyramid Pooling Transformer for Scene Understanding106
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs105
Relationship Quantification of Image Degradations104
The Bayesian Cut104
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks103
Continual Unsupervised Generative Modeling103
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI102
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization102
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications102
Probabilistic Directed Distance Fields for Ray-Based Shape Representations102
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning101
Semi-Supervised Learning for FGVC With Out-of-Category Data101
Low-Shot Video Object Segmentation100
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation99
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets99
Cover 399
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing99
The Cluster Structure Function98
An Energy-Based Prior for Generative Saliency98
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding98
Editorial: Special Section on Egocentric Perception98
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks97
Reusable Architecture Growth for Continual Stereo Matching97
Scale Propagation Network for Generalizable Depth Completion97
Compositional Physical Reasoning of Objects and Events From Videos96
Deep Learning on Object-Centric 3D Neural Fields95
TE141K: Artistic Text Benchmark for Text Effect Transfer95
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion95
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues95
Learning to Super-Resolve Blurry Images With Events95
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?94
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video94
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation94
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning94
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons93
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling93
Test-Time Training for Hyperspectral Image Super-Resolution93
Cascaded Dynamic Memory Refinement and Semantic Alignment for Exo-to-Ego Cross-View Video Generation93
GLC++: Source-Free Universal Domain Adaptation through Global-Local Clustering and Contrastive Affinity Learning93
Compositional Generative Model of Unbounded 4D Cities92
ONNXPruner: ONNX-Based General Model Pruning Adapter91
Optimizing Regularized Cholesky Score for Order-Based Learning of Bayesian Networks90
Hypergraph-Based High-Order Correlation Analysis for Large-Scale Long-Tailed Data Classification90
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images90
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition89
Adaptive Perspective Distillation for Semantic Segmentation89
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification89
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector89
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference88
PMGT-VR: a Decentralized Proximal-gradient Algorithmic Framework with Variance Reduction88
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction87
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition87
Any Fashion Attribute Editing: Dataset and Pretrained Models87
Adversarially Robust Neural Architectures87
Semantic Object Accuracy for Generative Text-to-Image Synthesis86
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation86
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection86
Unified Modality Separation: A Vision-Language Framework for Unsupervised Domain Adaptation86
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?86
luvHarris: A Practical Corner Detector for Event-Cameras86
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own85
SS-NeRF: Physically Based Sparse Spectral Rendering with Neural Radiance Field85
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images85
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation85
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating85
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization84
Recurrent Neural Networks for Snapshot Compressive Imaging83
DeepMesh: Differentiable Iso-Surface Extraction83
Support Vector Machine Classifier via Soft-Margin Loss83
Conformal Prediction for Time Series82
Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset82
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning82
Noisy Label Learning With Provable Consistency for a Wider Family of Losses82
Reframing Neural Networks: Deep Structure in Overcomplete Representations82
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera82
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging81
0.10696816444397