IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The median citation count of IEEE Transactions on Pattern Analysis and Machine Intelligence is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Learn to Predict Sets Using Feed-Forward Neural Networks2949
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus2593
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification1445
Modeling Noisy Annotations for Point-Wise Supervision1435
Cover1358
[Back cover - Table of contents, continued]1347
Front Cover1319
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference1237
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems1234
One-for-All: Towards Universal Domain Translation With a Single StyleGAN1008
Deep Non-Rigid Structure From Motion With Missing Data936
Enhancing Representations Through Heterogeneous Self-Supervised Learning864
Quadratic Matrix Factorization With Applications to Manifold Learning693
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization673
Interactive NeRF Geometry Editing With Shape Priors617
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation588
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications563
A Clustering Validity Index with Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms514
Instance Shadow Detection with A Single-Stage Detector503
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks490
Event-based Photometric Bundle Adjustment489
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting460
Invariant Policy Learning: A Causal Perspective452
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models441
VATr++: Choose Your Words Wisely for Handwritten Text Generation435
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains423
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach411
Active Supervised Cross-Modal Retrieval410
On the Trade-off between Flatness and Optimization in Distributed Learning398
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation397
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks394
DVIS++: Improved Decoupled Framework for Universal Video Segmentation387
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration380
Towards Accurate and Compact Architectures via Neural Architecture Transformer377
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation373
Centerless Clustering368
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification365
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks356
Learning to Guide a Saturation-Based Theorem Prover347
Face Generation and Editing With StyleGAN: A Survey328
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures324
Video Demoireing using Focused-Defocused Dual-Camera System313
BiBBDM: Bidirectional Image Translation with Brownian Bridge Diffusion Models313
Optimization-Based Post-Training Quantization With Bit-Split and Stitching312
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation312
A Generative Model for Generic Light Field Reconstruction312
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching307
Locating and Counting Heads in Crowds With a Depth Prior305
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation302
Towards Unified Deep Image Deraining: A Survey and a New Benchmark297
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution292
Self-Supervised Skeleton Representation Learning via Actionlet Contrast and Reconstruct292
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models280
Simplicial Complex Neural Networks273
Prior Image Guided Snapshot Compressive Spectral Imaging273
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images269
Physics-Informed Guided Disentanglement in Generative Networks263
Are Graph Convolutional Networks With Random Weights Feasible?261
Unsupervised Domain Adaptation via Discriminative Manifold Propagation253
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing249
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion248
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning240
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search240
Face Forgery Detection by 3D Decomposition and Composition Search238
Transformer-Based Visual Segmentation: A Survey237
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes235
Graph Convolutional Module for Temporal Action Localization in Videos235
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data221
Structure-Preserving Image Super-Resolution221
Deep Long-Tailed Learning: A Survey219
Inferring Point Cloud Quality via Graph Similarity219
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness215
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting214
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method207
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition206
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting201
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks200
Affective Image Content Analysis: Two Decades Review and New Perspectives196
Fast Component Tree Computation for Images of Limited Levels192
Human-Centric Transformer for Domain Adaptive Action Recognition191
Cover 2187
Cover187
Cover186
Table of Contents186
Cover186
IEEE Computer Society Has You Covered!185
TPAMI Information for Authors185
Point Set Registration for 3D Range Scans Using Fuzzy Cluster-Based Metric and Efficient Global Optimization184
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets184
A Variational EM Acceleration for Efficient Clustering at Very Large Scales182
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network181
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation181
Discriminant Feature Extraction by Generalized Difference Subspace181
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation181
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation178
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks177
Universal Image Segmentation with Efficiency176
On Positive-Unlabeled Classification From Corrupted Data in GANs172
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks171
Temporal Feature Matters: A Framework for Diffusion Model Quantization170
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining168
Knowledge-Based Embodied Question Answering168
Random Permutation Set Reasoning166
Rate-Distortion Theory in Coding for Machines and Its Applications163
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing158
Accurate and Efficient Stereo Matching via Attention Concatenation Volume158
Deep Learning for Face Anti-Spoofing: A Survey157
PathNet: Path-Selective Point Cloud Denoising156
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior155
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing153
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach152
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation150
Hypergraph-Based Multi-View Action Recognition Using Event Cameras148
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification145
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond144
Variational Data-Free Knowledge Distillation for Continual Learning142
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning142
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning141
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration137
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification137
BNET: Batch Normalization With Enhanced Linear Transformation137
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance136
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey136
GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector135
Correcting Optical Aberration via Depth-Aware Point Spread Functions135
Learning to See Through With Events133
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation132
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision132
AutoNovel: Automatically Discovering and Learning Novel Visual Categories132
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition131
Learning Graph Attentions via Replicator Dynamics129
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos128
Dynamic Self-Supervised Teacher-Student Network Learning128
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration127
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging126
Human Interaction Understanding With Consistency-Aware Learning126
Differential Viewpoints for Ground Terrain Material Recognition126
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics125
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks124
Compositional Scene Representation Learning via Reconstruction: A Survey124
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition123
Self-Supervised Multimodal Learning: A Survey122
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark121
Enhancing Photorealism Enhancement119
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks119
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving119
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression118
Image Lens Flare Removal Using Adversarial Curve Learning118
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images118
On the Robustness of Average Losses for Partial-Label Learning115
Revisiting Nonlocal Self-Similarity from Continuous Representation115
Differentially Private Graph Neural Networks for Whole-Graph Classification115
GradMDM: Adversarial Attack on Dynamic Networks113
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning113
Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining113
Unbiased Scene Graph Generation via Two-Stage Causal Modeling111
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning111
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap111
P2T: Pyramid Pooling Transformer for Scene Understanding110
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning110
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification109
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving108
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion108
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition108
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses108
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT106
Domain Generalization: A Survey106
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications106
A Style-Based Generator Architecture for Generative Adversarial Networks106
Advances and Challenges in Meta-Learning: A Technical Review105
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement104
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking104
ComputingEdge ad103
Deep Gait Recognition: A Survey103
luvHarris: A Practical Corner Detector for Event-Cameras102
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images102
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation102
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing101
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets101
Low-Shot Video Object Segmentation101
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning101
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning100
Semi-Supervised Learning for FGVC With Out-of-Category Data100
Recurrent Neural Networks for Snapshot Compressive Imaging98
Probabilistic Directed Distance Fields for Ray-Based Shape Representations98
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection98
Cover 397
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications97
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding96
The Cluster Structure Function95
Editorial: Special Section on Egocentric Perception95
An Energy-Based Prior for Generative Saliency95
Heterogeneous Feature Re-Sampling for Balanced Pedestrian Attribute Recognition94
Reusable Architecture Growth for Continual Stereo Matching94
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion94
Scale Propagation Network for Generalizable Depth Completion94
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI93
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs93
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks93
The Bayesian Cut93
Relationship Quantification of Image Degradations93
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World93
Deep Learning on Object-Centric 3D Neural Fields92
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues92
TE141K: Artistic Text Benchmark for Text Effect Transfer92
Continual Unsupervised Generative Modeling92
Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization92
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?90
Learning to Super-Resolve Blurry Images With Events90
Adversarially Robust Neural Architectures89
SS-TBN: A Semi-Supervised Tri-Branch Network for COVID-19 Screening and Lesion Segmentation88
DeepMesh: Differentiable Iso-Surface Extraction88
Reframing Neural Networks: Deep Structure in Overcomplete Representations88
S $^{2}$ O: Enhancing Adversarial Training with Second-Order Statistics of Weights88
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling88
FNA++: Fast Network Adaptation via Parameter Remapping and Architecture Search88
Supervision by Denoising87
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation87
A Thorough Benchmark and a New Model for Light Field Saliency Detection87
Semantic Object Accuracy for Generative Text-to-Image Synthesis86
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification85
Unified Modality Separation: A Vision-Language Framework for Unsupervised Domain Adaptation84
Compositional Generative Model of Unbounded 4D Cities84
Noisy Label Learning With Provable Consistency for a Wider Family of Losses84
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector83
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition83
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction83
MB-TaylorFormer V2: Improved Multi-Branch Linear Transformer Expanded by Taylor Formula for Image Restoration83
ONNXPruner: ONNX-Based General Model Pruning Adapter83
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera83
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing83
Optimizing Regularized Cholesky Score for Order-Based Learning of Bayesian Networks83
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?82
RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating82
Single Image Deraining: From Model-Based to Data-Driven and Beyond81
Adaptive Perspective Distillation for Semantic Segmentation81
Pixel Distillation: Cost-Flexible Distillation Across Image Sizes and Heterogeneous Networks81
Distributionally Location-Aware Transferable Adversarial Patches for Facial Images80
Map-Guided Curriculum Domain Adaptation and Uncertainty-Aware Evaluation for Semantic Nighttime Image Segmentation80
Progressive Instance-Aware Feature Learning for Compositional Action Recognition80
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation79
Any Fashion Attribute Editing: Dataset and Pretrained Models79
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization79
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution79
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution79
Conformal Prediction for Time Series79
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging79
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses78
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation78
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video77
0.15038895606995