IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 88. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate755
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model731
Dual Alternating Direction Method of Multipliers for Inverse Imaging651
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics640
Multiframe Joint Enhancement for Early Interlaced Videos529
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment483
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching471
Variational Structured Attention Networks for Deep Visual Representation Learning421
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond417
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing381
GMLight: Lighting Estimation via Geometric Distribution Approximation315
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence301
Self-Supervised Matting-Specific Portrait Enhancement and Generation268
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals262
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency249
Canonical Correlation Analysis With Low-Rank Learning for Image Representation249
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection223
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets219
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation214
Discrete Metric Learning for Fast Image Set Classification212
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation211
Automatic Quaternion-Domain Color Image Stitching201
Multimodal Unrolled Robust PCA for Background Foreground Separation200
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering197
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting194
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation193
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization191
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering185
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering180
Toward Projected Clustering With Aggregated Mapping178
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition175
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion175
Fine-Grained Recognition With Learnable Semantic Data Augmentation169
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining161
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors160
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments156
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal151
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models150
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation141
Differentiable SAR Renderer and Image-Based Target Reconstruction141
Cross-Modality Pyramid Alignment for Visual Intention Understanding140
TSFormer: Efficient Ultra-High-Definition Image Restoration via Trusted Min- p138
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation137
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment137
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification137
Pose-Appearance Relational Modeling for Video Action Recognition136
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences135
Spatial Frequency Modulation Network for Efficient Image Dehazing134
Real Image Denoising With a Locally-Adaptive Bitonic Filter132
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach131
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments131
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection131
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval129
Advances in Predictive RAHT for Geometric Point Cloud Compression129
Interactive Face Video Coding: A Generative Compression Framework128
Variational Bayes Image Restoration With Compressive Autoencoders127
Unsupervised Person Re-Identification With Stochastic Training Strategy126
Grammar-Induced Wavelet Network for Human Parsing125
Distractor-Aware Event-Based Tracking120
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring120
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation118
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction117
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation115
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction112
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning112
Video Moment Retrieval With Cross-Modal Neural Architecture Search111
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning109
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters108
IMU-Assisted Online Video Background Identification106
Learning Dynamic Prompts for All-in-One Image Restoration106
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution105
Multi-Exposure Image Fusion via Deformable Self-Attention105
Stacked Deconvolutional Network for Semantic Segmentation102
Fast 3D Room Layout Estimation Based on Compact High-Level Representation100
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering100
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection99
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion97
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning97
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space96
FsaNet: Frequency Self-Attention for Semantic Segmentation95
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation93
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision93
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data92
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection92
Inverse Image Frequency for Long-Tailed Image Recognition92
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression92
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching91
Precise Facial Landmark Detection by Reference Heatmap Transformer89
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression88
Rethinking Sampling Strategies for Unsupervised Person Re-Identification88
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching88
0.23916411399841