IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The TQCC of IEEE Transactions on Pattern Analysis and Machine Intelligence is 24. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
[Back cover - Table of contents, continued]2904
Front Cover1584
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference1535
Learn to Predict Sets Using Feed-Forward Neural Networks1471
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification1449
Modeling Noisy Annotations for Point-Wise Supervision1436
Video Demoireing using Focused-Defocused Dual-Camera System1283
One-for-All: Towards Universal Domain Translation With a Single StyleGAN1095
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems1078
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search931
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion748
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation715
Physics-Informed Guided Disentanglement in Generative Networks686
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks603
Towards Accurate and Compact Architectures via Neural Architecture Transformer555
A Generative Model for Generic Light Field Reconstruction541
Face Generation and Editing With StyleGAN: A Survey541
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks536
MECD+ : Unlocking Event-Level Causal Graph Discovery for Video Reasoning519
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition485
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration475
Structure-Preserving Image Super-Resolution471
Face Forgery Detection by 3D Decomposition and Composition Search460
Transformer-Based Visual Segmentation: A Survey446
Inferring Point Cloud Quality via Graph Similarity446
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes442
Affective Image Content Analysis: Two Decades Review and New Perspectives432
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation430
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms425
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks411
Active Supervised Cross-Modal Retrieval398
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus391
Event-Based Photometric Bundle Adjustment387
On the Trade-Off Between Flatness and Optimization in Distributed Learning382
Instance Shadow Detection with A Single-Stage Detector380
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning360
VATr++: Choose Your Words Wisely for Handwritten Text Generation355
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach354
Interactive NeRF Geometry Editing With Shape Priors352
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization350
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation350
Enhancing Representations Through Heterogeneous Self-Supervised Learning349
Quadratic Matrix Factorization With Applications to Manifold Learning343
Invariant Policy Learning: A Causal Perspective340
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains339
Centerless Clustering332
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation324
Deep Non-Rigid Structure From Motion With Missing Data322
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data316
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images309
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks307
Unsupervised Domain Adaptation via Discriminative Manifold Propagation301
Simplicial Complex Neural Networks300
Graph Convolutional Module for Temporal Action Localization in Videos300
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications299
Editorial: Introduction to the Special Section on Best of CVPR'2022272
Towards Unified Deep Image Deraining: A Survey and a New Benchmark272
Self-Supervised Skeleton Representation Learning Via Actionlet Contrast and Reconstruct266
BiBBDM: Bidirectional Image Translation With Brownian Bridge Diffusion Models259
Locating and Counting Heads in Crowds With a Depth Prior255
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models250
Learning to Guide a Saturation-Based Theorem Prover248
Are Graph Convolutional Networks With Random Weights Feasible?248
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures245
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing245
DVIS++: Improved Decoupled Framework for Universal Video Segmentation237
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation234
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching232
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification231
Prior Image Guided Snapshot Compressive Spectral Imaging227
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution222
Deep Long-Tailed Learning: A Survey220
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective219
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting218
Optimization-Based Post-Training Quantization With Bit-Split and Stitching212
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method210
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models207
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting206
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting206
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness206
IEEE Computer Society Has You Covered!204
Cover203
Table of Contents202
Cover202
Cover 2201
Fast Component Tree Computation for Images of Limited Levels200
BNET: Batch Normalization With Enhanced Linear Transformation197
Human-Centric Transformer for Domain Adaptive Action Recognition197
Universal Image Segmentation With Efficiency196
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration196
M3D: a Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction194
Temporal Feature Matters: A Framework for Diffusion Model Quantization194
Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining191
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging188
Learning to See Through With Events186
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights184
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration183
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior179
Rate-Distortion Theory in Coding for Machines and Its Applications178
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning174
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation172
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation172
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks170
On Positive-Unlabeled Classification From Corrupted Data in GANs169
Learning Graph Attentions via Replicator Dynamics168
Image Lens Flare Removal Using Adversarial Curve Learning163
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks163
Human Interaction Understanding With Consistency-Aware Learning163
GradMDM: Adversarial Attack on Dynamic Networks161
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos160
Revisiting Nonlocal Self-Similarity from Continuous Representation159
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets158
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving156
Enhancing Photorealism Enhancement156
Knowledge-Based Embodied Question Answering155
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion154
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification151
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector149
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning148
Learning Efficient Meshflow and Optical Flow from Event Cameras146
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach146
PathNet: Path-Selective Point Cloud Denoising145
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression143
Differential Viewpoints for Ground Terrain Material Recognition143
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks142
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics141
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark140
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance140
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning138
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification137
A Variational EM Acceleration for Efficient Clustering at Very Large Scales136
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification136
Correcting Optical Aberration via Depth-Aware Point Spread Functions133
Accurate and Efficient Stereo Matching via Attention Concatenation Volume133
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition133
On the Robustness of Average Losses for Partial-Label Learning133
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks132
Differentially Private Graph Neural Networks for Whole-Graph Classification132
Compositional Scene Representation Learning via Reconstruction: A Survey128
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing127
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision126
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap126
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey125
Hypergraph-Based Multi-View Action Recognition Using Event Cameras124
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond123
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning122
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation122
Dynamic Self-Supervised Teacher-Student Network Learning121
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications120
Unbiased Scene Graph Generation via Two-Stage Causal Modeling120
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition119
Random Permutation Set Reasoning119
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving116
Self-Supervised Multimodal Learning: A Survey116
Discriminant Feature Extraction by Generalized Difference Subspace115
AutoNovel: Automatically Discovering and Learning Novel Visual Categories115
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement115
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion115
Deep Gait Recognition: A Survey114
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing113
Advances and Challenges in Meta-Learning: A Technical Review113
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation113
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses112
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation112
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning112
Domain Generalization: A Survey111
P2T: Pyramid Pooling Transformer for Scene Understanding110
A Style-Based Generator Architecture for Generative Adversarial Networks109
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining109
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network109
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images109
Variational Data-Free Knowledge Distillation for Continual Learning108
Deep Learning for Face Anti-Spoofing: A Survey108
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking108
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets107
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT107
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition107
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation106
Semi-Supervised Learning for FGVC With Out-of-Category Data106
Low-Shot Video Object Segmentation105
Any Fashion Attribute Editing: Dataset and Pretrained Models105
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning105
ComputingEdge ad105
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction105
Cover 3104
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation104
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video103
Hypergraph-Based High-Order Correlation Analysis for Large-Scale Long-Tailed Data Classification103
Reframing Neural Networks: Deep Structure in Overcomplete Representations102
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation102
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons102
SS-NeRF: Physically Based Sparse Spectral Rendering with Neural Radiance Field102
Test-Time Training for Hyperspectral Image Super-Resolution102
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?102
DeepMesh: Differentiable Iso-Surface Extraction101
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs100
S$^{2}$ 2O: Enhancing Adversarial Training With Second-Order Statistics of Weights100
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own100
Progressive Instance-Aware Feature Learning for Compositional Action Recognition99
The Bayesian Cut99
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses98
Supervision by Denoising98
PMGT-VR: a Decentralized Proximal-gradient Algorithmic Framework with Variance Reduction98
Noisy Label Learning With Provable Consistency for a Wider Family of Losses97
Unified Modality Separation: A Vision-Language Framework for Unsupervised Domain Adaptation97
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition97
Probabilistic Directed Distance Fields for Ray-Based Shape Representations97
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization97
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging96
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?96
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition96
Scale Propagation Network for Generalizable Depth Completion95
ONNXPruner: ONNX-Based General Model Pruning Adapter95
An Energy-Based Prior for Generative Saliency95
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding95
Editorial: Special Section on Egocentric Perception94
The Cluster Structure Function94
Reusable Architecture Growth for Continual Stereo Matching94
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference93
Adversarially Robust Neural Architectures93
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks93
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues92
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution92
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector91
Continual Unsupervised Generative Modeling91
Compositional Physical Reasoning of Objects and Events From Videos91
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification91
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion91
Compositional Generative Model of Unbounded 4D Cities91
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection89
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera89
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World89
Adaptive Perspective Distillation for Semantic Segmentation89
luvHarris: A Practical Corner Detector for Event-Cameras88
Semantic Object Accuracy for Generative Text-to-Image Synthesis88
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning88
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing88
GLC++: Source-Free Universal Domain Adaptation Through Global-Local Clustering and Contrastive Affinity Learning88
Orthogonal Decoupling Contrastive Regularization: Towards Uncorrelated Feature Decoupling for Unpaired Image Restoration88
Analysis of the Hands in Egocentric Vision: A Survey87
Learning to Super-Resolve Blurry Images With Events86
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating86
Deep Learning on Object-Centric 3D Neural Fields86
A Thorough Benchmark and a New Model for Light Field Saliency Detection86
Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset86
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning86
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing86
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications86
Conformal Prediction for Time Series86
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks85
0.080156803131104