IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 79. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate580
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets553
Canonical Correlation Analysis With Low-Rank Learning for Image Representation526
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach471
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering386
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors376
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model360
Automatic Quaternion-Domain Color Image Stitching317
Spatial Frequency Modulation Network for Efficient Image Dehazing276
Toward Projected Clustering With Aggregated Mapping274
Self-Supervised Matting-Specific Portrait Enhancement and Generation248
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection240
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering233
Real Image Denoising With a Locally-Adaptive Bitonic Filter206
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization202
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence179
Pose-Appearance Relational Modeling for Video Action Recognition178
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection173
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation172
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion171
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering164
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences160
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation159
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting159
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments153
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation153
Multiframe Joint Enhancement for Early Interlaced Videos151
Differentiable SAR Renderer and Image-Based Target Reconstruction147
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation143
Cross-Modality Pyramid Alignment for Visual Intention Understanding141
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing141
Variational Structured Attention Networks for Deep Visual Representation Learning137
Dual Alternating Direction Method of Multipliers for Inverse Imaging130
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining121
Multimodal Unrolled Robust PCA for Background Foreground Separation121
GMLight: Lighting Estimation via Geometric Distribution Approximation121
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond119
Discrete Metric Learning for Fast Image Set Classification119
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal118
Fine-Grained Recognition With Learnable Semantic Data Augmentation116
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition115
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification112
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments112
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment112
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation111
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction110
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency110
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning109
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm108
Grammar-Induced Wavelet Network for Human Parsing107
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision107
Distractor-Aware Event-Based Tracking106
Stacked Deconvolutional Network for Semantic Segmentation106
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression104
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching101
Unsupervised Person Re-Identification With Stochastic Training Strategy100
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels99
Precise Facial Landmark Detection by Reference Heatmap Transformer99
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection98
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction96
IMU-Assisted Online Video Background Identification96
Variational Bayes Image Restoration With Compressive Autoencoders94
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression94
Interactive Face Video Coding: A Generative Compression Framework91
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning91
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion91
Advances in Predictive RAHT for Geometric Point Cloud Compression91
Learning Dynamic Prompts for All-in-One Image Restoration91
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching90
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model89
Fast 3D Room Layout Estimation Based on Compact High-Level Representation88
Multi-Exposure Image Fusion via Deformable Self-Attention87
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation87
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering83
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection82
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation81
FsaNet: Frequency Self-Attention for Semantic Segmentation81
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition81
Rethinking Sampling Strategies for Unsupervised Person Re-Identification80
0.13148522377014