IEEE Transactions on Pattern Analysis and Machine Intelligence

Papers
(The H4-Index of IEEE Transactions on Pattern Analysis and Machine Intelligence is 103. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
Squeeze-and-Excitation Networks3253
OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields2061
Deep High-Resolution Representation Learning for Visual Recognition1671
Res2Net: A New Multi-Scale Backbone Architecture1455
Image Segmentation Using Deep Learning: A Survey1118
Deep Learning for 3D Point Clouds: A Survey878
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey830
A Survey on Vision Transformer821
Deep Learning for Image Super-Resolution: A Survey788
Deep Learning for Person Re-Identification: A Survey and Outlook734
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding719
Event-Based Vision: A Survey701
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild691
Cascade R-CNN: High Quality Object Detection and Instance Segmentation654
U2Fusion: A Unified Unsupervised Image Fusion Network628
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning478
Residual Dense Network for Image Restoration451
Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection440
Meta-Learning in Neural Networks: A Survey420
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer404
A continual learning survey: Defying forgetting in classification tasks396
Normalizing Flows: An Introduction and Review of Current Methods357
Recent Advances in Open Set Recognition: A Survey352
Hierarchical Deep Click Feature Prediction for Fine-Grained Image Recognition349
Plug-and-Play Image Restoration With Deep Denoiser Prior311
Image Super-Resolution Via Iterative Refinement290
A Style-Based Generator Architecture for Generative Adversarial Networks282
Salient Object Detection in the Deep Learning Era: An In-Depth Survey275
Deep Multi-View Enhancement Hashing for Image Retrieval274
A Review of Domain Adaptation without Target Labels274
The ApolloScape Open Dataset for Autonomous Driving and Its Application264
Imbalance Problems in Object Detection: A Review257
Convolutional Networks with Dense Connectivity246
Multi-Task Learning for Dense Prediction Tasks: A Survey245
Joint Rain Detection and Removal from a Single Image with Contextualized Deep Networks236
Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks228
Detection and Tracking Meet Drones Challenge220
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization220
Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era213
YOLACT++ Better Real-Time Instance Segmentation208
Contextual Transformer Networks for Visual Recognition204
Prior Guided Feature Enrichment Network for Few-Shot Segmentation202
Deep Audio-Visual Speech Recognition200
High Speed and High Dynamic Range Video with an Event Camera193
Deep Imbalanced Learning for Face Recognition and Attribute Prediction191
FakeCatcher: Detection of Synthetic Portrait Videos using Biological Signals185
Dynamic Neural Networks: A Survey185
Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency181
MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement178
Low-Light Image and Video Enhancement Using Deep Learning: A Survey176
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs175
Concealed Object Detection174
CCNet: Criss-Cross Attention for Semantic Segmentation172
Domain Generalization: A Survey171
Revisiting Video Saliency Prediction in the Deep Learning Era169
Maximum Density Divergence for Domain Adaptation165
Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos161
Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation157
Beyond Self-Attention: External Attention Using Two Linear Layers for Visual Tasks157
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training157
Diffusion Models in Vision: A Survey156
Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges149
A Comprehensive Analysis of Deep Regression143
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method143
Learning Depth with Convolutional Spatial Propagation Network143
Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding143
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction139
Constructing Stronger and Faster Baselines for Skeleton-Based Action Recognition138
Human Action Recognition From Various Data Modalities: A Review135
Inferring Salient Objects from Human Fixations134
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models134
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes133
Unsupervised Tracklet Person Re-Identification130
Effects of Image Degradation and Degradation Removal to CNN-Based Image Classification130
Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks130
Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking128
Confidence Propagation through CNNs for Guided Sparse Depth Regression128
Single Image Deraining: From Model-Based to Data-Driven and Beyond127
Coherence Constrained Graph LSTM for Group Activity Recognition126
ArcFace: Additive Angular Margin Loss for Deep Face Recognition126
GAN Inversion: A Survey125
MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video124
Robust Low-Rank Tensor Recovery with Rectification and Alignment122
SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing122
Densely Residual Laplacian Super-Resolution121
Hiding Images within Images120
Neural Image Compression for Gigapixel Histopathology Image Analysis118
Weakly Supervised Object Localization and Detection: A Survey116
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning116
Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion113
Direction-Aware Spatial Context Features for Shadow Detection and Removal111
A Survey on Curriculum Learning110
Models Matter, So Does Training: An Empirical Study of CNNs for Optical Flow Estimation110
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation110
MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval109
Self-Correction for Human Parsing109
Negation of the Quantum Mass Function for Multisource Quantum Information Fusion With its Application to Pattern Classification108
Graph U-Nets107
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images107
AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem?105
Fine-Grained Image Analysis With Deep Learning: A Survey104
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion104
Class-Incremental Learning: Survey and Performance Evaluation on Image Classification103
Graph Neural Networks with Convolutional ARMA Filters103
Skeleton-Based Online Action Prediction Using Scale Selection Network103
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods103
0.085116147994995