IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 81. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate617
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets580
Canonical Correlation Analysis With Low-Rank Learning for Image Representation540
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach495
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering402
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors401
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model375
Automatic Quaternion-Domain Color Image Stitching331
Spatial Frequency Modulation Network for Efficient Image Dehazing314
Toward Projected Clustering With Aggregated Mapping308
Self-Supervised Matting-Specific Portrait Enhancement and Generation262
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection254
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization239
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence216
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation210
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering191
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences187
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting180
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation178
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation177
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments175
Multimodal Unrolled Robust PCA for Background Foreground Separation175
GMLight: Lighting Estimation via Geometric Distribution Approximation166
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal165
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing164
Variational Structured Attention Networks for Deep Visual Representation Learning162
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining157
Discrete Metric Learning for Fast Image Set Classification154
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering151
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond150
Multiframe Joint Enhancement for Early Interlaced Videos150
Dual Alternating Direction Method of Multipliers for Inverse Imaging146
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion133
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection132
Real Image Denoising With a Locally-Adaptive Bitonic Filter128
Differentiable SAR Renderer and Image-Based Target Reconstruction127
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation124
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments123
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency123
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment121
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals121
Cross-Modality Pyramid Alignment for Visual Intention Understanding116
Pose-Appearance Relational Modeling for Video Action Recognition116
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation115
Fine-Grained Recognition With Learnable Semantic Data Augmentation115
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification114
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction113
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition113
Stacked Deconvolutional Network for Semantic Segmentation111
Grammar-Induced Wavelet Network for Human Parsing111
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning111
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement110
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning110
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition110
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring110
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval109
FsaNet: Frequency Self-Attention for Semantic Segmentation109
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing107
Interactive Face Video Coding: A Generative Compression Framework107
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model106
DUT: Learning Video Stabilization by Simply Watching Unstable Videos101
Point-Based Learnable Query Generator for Human–Object Interaction Detection101
Precise Facial Landmark Detection by Reference Heatmap Transformer98
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression98
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering97
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision96
Fast 3D Room Layout Estimation Based on Compact High-Level Representation96
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion96
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images95
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels95
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation93
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection91
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction90
IMU-Assisted Online Video Background Identification88
Rethinking Sampling Strategies for Unsupervised Person Re-Identification87
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning86
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data84
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm84
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification83
Inverse Image Frequency for Long-Tailed Image Recognition83
Variational Bayes Image Restoration With Compressive Autoencoders81
0.1885130405426