IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 69. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Variational Structured Attention Networks for Deep Visual Representation Learning952
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach872
TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min- p794
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics693
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation689
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting551
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals317
Focus on Finding Deepfakes: A Robust Proactive Detection Method Based on Orthogonal Moment Watermarking294
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching291
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing279
Harnessing Multi-Modal Large Language Models for Measuring and Interpreting Color Differences257
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment244
Pose-Appearance Relational Modeling for Video Action Recognition235
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence195
Global Modeling Matters: A Fast, Lightweight, and Effective Baseline for Efficient Image Restoration191
Toward Projected Clustering With Aggregated Mapping186
LearnMat: Semantic-Aware Self-Supervision Fine-Grained Visual Recognition179
COME: A Collaborative Optimization Framework With Low-Rank MoE for Indoor 3D Object Detection179
Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning172
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate171
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets171
Revisiting Fine-Grained Image Analysis by Semantic-Part Alignment167
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering164
High-Fidelity Seismic Super-Resolution Using Prior-Informed Deep Learning With 3D Awareness154
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection153
Zero-Pose-Prior NeRF: Recursive Radiance Field Reconstruction From Unposed and Unordered Images148
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency143
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation143
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering139
Advancing Pre-trained Teacher: Towards Robust Feature Discrepancy for Anomaly Detection126
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering121
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment114
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments114
Fine-Grained Recognition With Learnable Semantic Data Augmentation106
Star-Shaped Multi-Person Interaction Graph Model for Group Skeleton-Based Action Recognition104
Automatic Quaternion-Domain Color Image Stitching101
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model101
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models100
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation100
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection100
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation98
Spatial Frequency Modulation Network for Efficient Image Dehazing95
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining94
Cross-Modality Pyramid Alignment for Visual Intention Understanding93
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments91
Advances in Predictive RAHT for Geometric Point Cloud Compression87
Fast 3D Room Layout Estimation Based on Compact High-Level Representation87
Inverse Image Frequency for Long-Tailed Image Recognition87
ASDTracker: Adaptively Sparse Detection With Attention-Guided Refinement for Efficient Multi-Object Tracking86
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning83
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching82
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning81
Precise Facial Landmark Detection by Reference Heatmap Transformer79
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression78
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation78
Spatial-Temporal Scene Graph Generation for Open-Vocabulary Multiple Object Tracking78
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching77
Toward Generalizable Forgery Detection and Reasoning77
Rethinking Sampling Strategies for Unsupervised Person Re-Identification76
FD-SCU: Frequency Decomposition-Based Spectrum Collaborative Upsampling for Point Cloud Color Attribute75
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm74
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction74
Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs73
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation73
TSCCD: Temporal Self-Construction Cross-Domain Learning for Unsupervised Hyperspectral Change Detection73
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters73
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision71
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution71
Point-Based Learnable Query Generator for Human–Object Interaction Detection70
Stacked Deconvolutional Network for Semantic Segmentation69
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification69
Distractor-Aware Event-Based Tracking69
0.167160987854