IEEE Transactions on Image Processing

Papers
(The TQCC of IEEE Transactions on Image Processing is 22. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate717
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model700
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences619
Dual Alternating Direction Method of Multipliers for Inverse Imaging615
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics502
Multiframe Joint Enhancement for Early Interlaced Videos456
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing443
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization409
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment402
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching376
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors301
Variational Structured Attention Networks for Deep Visual Representation Learning293
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence255
Self-Supervised Matting-Specific Portrait Enhancement and Generation246
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals244
Canonical Correlation Analysis With Low-Rank Learning for Image Representation229
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion210
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation209
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal209
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency205
Spatial Frequency Modulation Network for Efficient Image Dehazing200
Pose-Appearance Relational Modeling for Video Action Recognition195
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection194
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation188
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets188
Discrete Metric Learning for Fast Image Set Classification187
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach186
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation175
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation174
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments173
Automatic Quaternion-Domain Color Image Stitching169
Fine-Grained Recognition With Learnable Semantic Data Augmentation168
Cross-Modality Pyramid Alignment for Visual Intention Understanding163
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation155
Multimodal Unrolled Robust PCA for Background Foreground Separation154
Differentiable SAR Renderer and Image-Based Target Reconstruction148
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond147
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments145
Real Image Denoising With a Locally-Adaptive Bitonic Filter138
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition137
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection136
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering134
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models132
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining132
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting131
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification131
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering129
GMLight: Lighting Estimation via Geometric Distribution Approximation129
Toward Projected Clustering With Aggregated Mapping128
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment127
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering127
Advances in Predictive RAHT for Geometric Point Cloud Compression125
Interactive Face Video Coding: A Generative Compression Framework124
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction124
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning124
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction123
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning122
IMU-Assisted Online Video Background Identification120
Distractor-Aware Event-Based Tracking118
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning117
Motion and Appearance Decoupling Representation for Event Cameras117
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution112
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection110
Learning Dynamic Prompts for All-in-One Image Restoration109
ScaleNet: Scaling up Pretrained Neural Networks With Incremental Parameters108
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data108
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm108
Unsupervised Person Re-Identification With Stochastic Training Strategy107
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain103
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion103
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection102
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification101
Rethinking Sampling Strategies for Unsupervised Person Re-Identification100
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression99
Grammar-Induced Wavelet Network for Human Parsing98
DUT: Learning Video Stabilization by Simply Watching Unstable Videos95
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation94
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring92
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching92
FsaNet: Frequency Self-Attention for Semantic Segmentation91
Multi-Exposure Image Fusion via Deformable Self-Attention90
Stacked Deconvolutional Network for Semantic Segmentation90
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering89
Point-Based Learnable Query Generator for Human–Object Interaction Detection89
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels89
Fast 3D Room Layout Estimation Based on Compact High-Level Representation88
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation86
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing85
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation84
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition84
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching84
Precise Facial Landmark Detection by Reference Heatmap Transformer83
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision81
Inverse Image Frequency for Long-Tailed Image Recognition81
Variational Bayes Image Restoration With Compressive Autoencoders81
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space81
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images80
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression80
Video Moment Retrieval With Cross-Modal Neural Architecture Search79
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement79
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model79
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval79
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification78
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation78
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding77
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition77
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network77
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments76
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model76
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification75
Characteristic Mapping for Ellipse Detection Acceleration75
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations74
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis74
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding74
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization73
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization73
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement73
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation72
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates72
Continual Referring Expression Comprehension via Dual Modular Memorization70
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models69
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System69
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels69
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening68
Generalizing to Out-of-Sample Degradations via Model Reprogramming68
Causal Inference Hashing for Long-Tailed Image Retrieval68
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center68
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images67
Implicit-Explicit Integrated Representations for Multi-View Video Compression66
Rich Action-Semantic Consistent Knowledge for Early Action Prediction66
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination66
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis66
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features66
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation65
Multi-Scale Fusion and Decomposition Network for Single Image Deraining64
Quality-aware Spatio-temporal Transformer Network for RGBT Tracking64
Semi-Supervised Domain Adaptive Structure Learning64
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification63
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism63
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding63
Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal62
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description62
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling61
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment61
Uncertainty Quantification for Semi-Supervised Object Detection in Remote Sensing Images61
Exploring the Potential of Pooling Techniques for Universal Image Restoration61
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing61
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm60
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution60
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks60
One Sketch for All: One-Shot Personalized Sketch Segmentation59
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection59
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization59
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation59
Fast Learning Radiance Fields by Shooting Much Fewer Rays58
Cross-Modal Causal Representation Learning for Radiology Report Generation58
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling57
FOVQA: Blind Foveated Video Quality Assessment57
Arbitrary-Scale Texture Generation From Coarse-Grained Control57
Reviewer Summary for Transactions on Image Processing57
Image-Level Adaptive Adversarial Ranking for Person Re-Identification57
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception57
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection57
Data Augmentation Using Bitplane Information Recombination Model57
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization57
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation57
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging57
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation57
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network56
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination56
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications56
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding56
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing55
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation55
Toward Scalable and Unified Example-Based Explanation and Outlier Detection55
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem55
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks55
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules55
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation54
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images54
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer54
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement54
Neural Scene Designer: Self-Styled Semantic Image Manipulation54
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses54
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation53
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport53
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes53
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification52
Few-Shot Domain Adaptation via Mixup Optimal Transport52
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention52
Dynamic Slimmable Denoising Network52
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective52
Deep Ranking Exemplar-Based Dynamic Scene Deblurring52
DVMark: A Deep Multiscale Framework for Video Watermarking51
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization51
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images51
Zero-Shot Camouflaged Object Detection51
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction51
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy51
Rethinking the Low-Light Video Enhancement: Benchmark Datasets and Methods51
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution51
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?50
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection50
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches50
Multistage Spatio-Temporal Networks for Robust Sketch Recognition50
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation50
Sensitivity Decouple Learning for Image Compression Artifacts Reduction49
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation49
Learning Domain Invariant Representations for Generalizable Person Re-Identification49
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution48
U-Shape Transformer for Underwater Image Enhancement48
Hierarchical Hashing Learning for Image Set Classification48
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors48
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition48
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification48
DO-Conv: Depthwise Over-Parameterized Convolutional Layer48
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining48
Multi-Label Auroral Image Classification Based on CNN and Transformer48
Degraded Reference Image Quality Assessment47
IEEE Transactions on Image Processing publication information47
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding47
Leveraging Frequency Analysis for Image Denoising Network Pruning47
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement47
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption47
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing47
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking47
MetaAge: Meta-Learning Personalized Age Estimators47
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression47
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery46
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning46
Coupled Splines for Sparse Curve Fitting46
BVI-VFI: A Video Quality Database for Video Frame Interpolation46
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective46
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering45
Learning to Compare Relation: Semantic Alignment for Few-Shot Learning45
Multi-Branch and Progressive Network for Low-Light Image Enhancement45
ROOT: Region-word Alignment with Partial Optimal Transport for Open-vocabulary Object Detection45
SketchAging: Face Photo-Sketch Synthesis and Aging With Multi-Scale Feature Extraction45
Enhancing Multimodal Learning via Hierarchical Fusion Architecture Search With Inconsistency Mitigation44
Scale-Aware Crowd Counting Network With Annotation Error Modeling44
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment44
Decoupling Discriminative Attributes for Few-Shot Fine-Grained Recognition43
Dual Mixture Model Based CNN for Image Denoising43
Siamese-DETR for Generic Multi-Object Tracking43
StreakNet-Arch: An Anti-Scattering Network-Based Architecture for Underwater Carrier LiDAR-Radar Imaging43
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields43
Robust Palmprint Recognition via Multi-Stage Noisy Label Selection and Correction43
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors43
Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting43
Action Quality Assessment via Hierarchical Pose-Guided Multi-Stage Contrastive Regression43
0.16350388526917