IEEE Transactions on Image Processing

Papers
(The H4-Index of IEEE Transactions on Image Processing is 75. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate499
Variational Structured Attention Networks for Deep Visual Representation Learning483
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency471
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets405
Canonical Correlation Analysis With Low-Rank Learning for Image Representation350
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion319
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach313
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering277
Toward Projected Clustering With Aggregated Mapping241
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal231
Multimodal Unrolled Robust PCA for Background Foreground Separation220
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting214
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors194
Self-Supervised Matting-Specific Portrait Enhancement and Generation191
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization180
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences165
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection165
GMLight: Lighting Estimation via Geometric Distribution Approximation159
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence154
Pose-Appearance Relational Modeling for Video Action Recognition152
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection145
Cross-Modality Pyramid Alignment for Visual Intention Understanding143
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation141
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model139
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining139
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond137
Multiframe Joint Enhancement for Early Interlaced Videos133
Differentiable SAR Renderer and Image-Based Target Reconstruction129
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition128
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation126
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering124
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments121
STPNet: Scale-aware Text Prompt Network for Medical Image Segmentation116
Fine-Grained Recognition With Learnable Semantic Data Augmentation115
Discrete Metric Learning for Fast Image Set Classification110
Real Image Denoising With a Locally-Adaptive Bitonic Filter110
Automatic Quaternion-Domain Color Image Stitching109
Dual Alternating Direction Method of Multipliers for Inverse Imaging106
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing105
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments104
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation103
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment102
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering101
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification100
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning100
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction100
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation99
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm99
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing98
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision97
Grammar-Induced Wavelet Network for Human Parsing97
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels97
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching96
Point-Based Learnable Query Generator for Human–Object Interaction Detection94
Stacked Deconvolutional Network for Semantic Segmentation92
Distractor-Aware Event-Based Tracking92
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning90
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data89
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model89
IMU-Assisted Online Video Background Identification89
Multi-Exposure Image Fusion via Deformable Self-Attention88
Learning Dynamic Prompts for All-in-One Image Restoration87
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection86
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction84
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression84
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval84
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement83
Variational Bayes Image Restoration With Compressive Autoencoders83
Rethinking Sampling Strategies for Unsupervised Person Re-Identification82
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression79
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion79
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning78
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification77
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation76
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain75
Video Moment Retrieval With Cross-Modal Neural Architecture Search75
Interactive Face Video Coding: A Generative Compression Framework75
0.30979681015015