IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The H4-Index of IEEE Transactions on Pattern Analysis and Machine Intelligence is 134. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
[Back cover - Table of contents, continued]3182
Front Cover1690
One-for-All: Towards Universal Domain Translation With a Single StyleGAN1610
Editorial: Introduction to the Special Section on Best of CVPR'20221549
Self-Supervised Skeleton Representation Learning Via Actionlet Contrast and Reconstruct1515
BiBBDM: Bidirectional Image Translation With Brownian Bridge Diffusion Models1221
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems1171
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation982
MECD+ : Unlocking Event-Level Causal Graph Discovery for Video Reasoning789
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks765
Active Supervised Cross-Modal Retrieval611
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus597
Event-Based Photometric Bundle Adjustment577
On the Trade-Off Between Flatness and Optimization in Distributed Learning576
VATr++: Choose Your Words Wisely for Handwritten Text Generation546
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective534
Learning to Guide a Saturation-Based Theorem Prover523
Towards Accurate and Compact Architectures via Neural Architecture Transformer518
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration496
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification490
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization488
Enhancing Representations Through Heterogeneous Self-Supervised Learning483
Quadratic Matrix Factorization With Applications to Manifold Learning475
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains474
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images471
Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification456
SNI-SLAM++: Tightly-Coupled Semantic Neural Implicit SLAM455
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach455
Video Demoireing Using Focused-Defocused Dual-Camera System445
Instance Shadow Detection with A Single-Stage Detector421
Rethinking Rotation-Invariant Recognition of Fine-grained Shapes from the Perspective of Contour Points415
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation413
Learn to Predict Sets Using Feed-Forward Neural Networks394
DVIS++: Improved Decoupled Framework for Universal Video Segmentation393
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition376
Test-time Correction: An Online 3D Detection System via Visual Prompting371
Optimization-Based Post-Training Quantization With Bit-Split and Stitching370
Invariant Policy Learning: A Causal Perspective368
Face Forgery Detection by 3D Decomposition and Composition Search360
Locating and Counting Heads in Crowds With a Depth Prior359
Physics-Informed Guided Disentanglement in Generative Networks353
A Generative Model for Generic Light Field Reconstruction353
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting342
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting341
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms338
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation334
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search332
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes330
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting319
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models319
Graph Convolutional Module for Temporal Action Localization in Videos315
Towards Unified Deep Image Deraining: A Survey and a New Benchmark300
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method298
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning283
Prior Image Guided Snapshot Compressive Spectral Imaging270
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data269
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching268
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications265
Interactive NeRF Geometry Editing With Shape Priors264
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing263
Structure-Preserving Image Super-Resolution263
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution262
Modeling Noisy Annotations for Point-Wise Supervision254
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion253
Inferring Point Cloud Quality via Graph Similarity251
Simplicial Complex Neural Networks247
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures247
Are Graph Convolutional Networks With Random Weights Feasible?246
Centerless Clustering240
Unsupervised Domain Adaptation via Discriminative Manifold Propagation238
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks236
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference235
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation234
Face Generation and Editing With StyleGAN: A Survey231
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks230
Deep Long-Tailed Learning: A Survey229
Affective Image Content Analysis: Two Decades Review and New Perspectives228
Transformer-Based Visual Segmentation: A Survey225
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness221
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks220
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models220
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation218
IEEE Computer Society Has You Covered!215
Cover 2212
Fast Component Tree Computation for Images of Limited Levels212
BNET: Batch Normalization With Enhanced Linear Transformation209
Universal Image Segmentation With Efficiency208
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration206
Temporal Feature Matters: A Framework for Diffusion Model Quantization202
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging201
Learning to See Through With Events198
Differential Viewpoints for Ground Terrain Material Recognition197
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights194
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression191
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition188
Graph-Oriented Instruction Tuning of Large Language Models for Generic Graph Mining186
M3D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-Level Information Extraction183
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond182
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks181
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks180
Random Permutation Set Reasoning180
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation177
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification177
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation175
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing173
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining172
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning171
Deep Learning for Face Anti-Spoofing: A Survey170
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance169
Human-Centric Transformer for Domain Adaptive Action Recognition164
AutoNovel: Automatically Discovering and Learning Novel Visual Categories162
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images162
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach160
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion159
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition158
On the Robustness of Average Losses for Partial-Label Learning157
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation152
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning152
On Positive-Unlabeled Classification From Corrupted Data in GANs151
Human Interaction Understanding With Consistency-Aware Learning150
GradMDM: Adversarial Attack on Dynamic Networks150
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis With Semantic Graph Prior149
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration149
Learning Graph Attentions via Replicator Dynamics148
Learning Efficient Meshflow and Optical Flow from Event Cameras148
Image Lens Flare Removal Using Adversarial Curve Learning147
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion146
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos146
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning142
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark141
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving139
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics139
Dynamic Self-Supervised Teacher-Student Network Learning139
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets139
0.81181406974792