IEEE Transactions on Image Processing

Papers
(The median citation count of IEEE Transactions on Image Processing is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate499
Variational Structured Attention Networks for Deep Visual Representation Learning483
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency471
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets405
Canonical Correlation Analysis With Low-Rank Learning for Image Representation350
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion319
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach313
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering277
Toward Projected Clustering With Aggregated Mapping241
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal231
Multimodal Unrolled Robust PCA for Background Foreground Separation220
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting214
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors194
Self-Supervised Matting-Specific Portrait Enhancement and Generation191
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization180
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences165
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection165
GMLight: Lighting Estimation via Geometric Distribution Approximation159
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence154
Pose-Appearance Relational Modeling for Video Action Recognition152
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection145
Cross-Modality Pyramid Alignment for Visual Intention Understanding143
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation141
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining139
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model139
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond137
Multiframe Joint Enhancement for Early Interlaced Videos133
Differentiable SAR Renderer and Image-Based Target Reconstruction129
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition128
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation126
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering124
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments121
STPNet: Scale-aware Text Prompt Network for Medical Image Segmentation116
Fine-Grained Recognition With Learnable Semantic Data Augmentation115
Real Image Denoising With a Locally-Adaptive Bitonic Filter110
Discrete Metric Learning for Fast Image Set Classification110
Automatic Quaternion-Domain Color Image Stitching109
Dual Alternating Direction Method of Multipliers for Inverse Imaging106
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing105
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments104
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation103
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment102
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering101
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction100
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification100
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning100
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation99
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm99
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing98
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels97
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision97
Grammar-Induced Wavelet Network for Human Parsing97
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching96
Point-Based Learnable Query Generator for Human–Object Interaction Detection94
Distractor-Aware Event-Based Tracking92
Stacked Deconvolutional Network for Semantic Segmentation92
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning90
IMU-Assisted Online Video Background Identification89
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data89
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model89
Multi-Exposure Image Fusion via Deformable Self-Attention88
Learning Dynamic Prompts for All-in-One Image Restoration87
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection86
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval84
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction84
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression84
Variational Bayes Image Restoration With Compressive Autoencoders83
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement83
Rethinking Sampling Strategies for Unsupervised Person Re-Identification82
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion79
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression79
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning78
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification77
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation76
Interactive Face Video Coding: A Generative Compression Framework75
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain75
Video Moment Retrieval With Cross-Modal Neural Architecture Search75
SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring73
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space72
Advances in Predictive RAHT for Geometric Point Cloud Compression72
Unsupervised Person Re-Identification With Stochastic Training Strategy71
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation71
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering71
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection69
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching69
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation68
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images68
FsaNet: Frequency Self-Attention for Semantic Segmentation66
DUT: Learning Video Stabilization by Simply Watching Unstable Videos66
Precise Facial Landmark Detection by Reference Heatmap Transformer65
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification65
Inverse Image Frequency for Long-Tailed Image Recognition65
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments64
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition64
Semi-Supervised Domain Adaptive Structure Learning63
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center63
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing61
Characteristic Mapping for Ellipse Detection Acceleration59
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description59
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network59
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates58
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels57
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization57
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models57
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis57
Generalizing to Out-of-Sample Degradations via Model Reprogramming56
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization55
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation55
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks55
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding55
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding54
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images54
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System54
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition54
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement54
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding53
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm53
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening53
Continual Referring Expression Comprehension via Dual Modular Memorization53
Rich Action-Semantic Consistent Knowledge for Early Action Prediction53
Multi-Scale Fusion and Decomposition Network for Single Image Deraining53
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination53
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution53
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis53
Implicit-Explicit Integrated Representations for Multi-View Video Compression52
One Sketch for All: One-Shot Personalized Sketch Segmentation52
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features52
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization52
Cross-Modal Causal Representation Learning for Radiology Report Generation52
Fast Learning Radiance Fields by Shooting Much Fewer Rays51
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment50
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation50
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection50
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification50
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling49
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation49
Exploring the Potential of Pooling Techniques for Universal Image Restoration49
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification49
Image-Level Adaptive Adversarial Ranking for Person Re-Identification48
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation48
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy48
Reviewer Summary for Transactions on Image Processing48
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection47
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation47
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification47
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection47
Arbitrary-Scale Texture Generation From Coarse-Grained Control47
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization46
Data Augmentation Using Bitplane Information Recombination Model46
Toward Scalable and Unified Example-Based Explanation and Outlier Detection46
Degraded Reference Image Quality Assessment46
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution45
FOVQA: Blind Foveated Video Quality Assessment45
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding45
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation45
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network45
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination45
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification45
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining45
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing45
Multi-Label Auroral Image Classification Based on CNN and Transformer45
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images44
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches44
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation44
Zero-Shot Camouflaged Object Detection44
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging44
Deep Ranking Exemplar-Based Dynamic Scene Deblurring43
DVMark: A Deep Multiscale Framework for Video Watermarking43
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images43
MetaAge: Meta-Learning Personalized Age Estimators43
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses43
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation43
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention43
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling42
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes42
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization42
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks42
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation42
Learning Domain Invariant Representations for Generalizable Person Re-Identification41
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules41
DO-Conv: Depthwise Over-Parameterized Convolutional Layer41
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications41
Few-Shot Domain Adaptation via Mixup Optimal Transport41
Hierarchical Hashing Learning for Image Set Classification41
Sensitivity Decouple Learning for Image Compression Artifacts Reduction41
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking40
Dynamic Slimmable Denoising Network40
U-Shape Transformer for Underwater Image Enhancement40
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction40
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution40
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors40
Multistage Spatio-Temporal Networks for Robust Sketch Recognition40
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?40
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem40
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression39
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement39
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing39
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition39
YOLOH: You Only Look One Hourglass for Real-Time Object Detection39
IEEE Transactions on Image Processing publication information39
Coupled Splines for Sparse Curve Fitting38
Versatile Denoising-Based Approximate Message Passing for Compressive Sensing38
Leveraging Frequency Analysis for Image Denoising Network Pruning38
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding38
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields38
Dual Mixture Model Based CNN for Image Denoising37
Accurate 3D Measurement of Complex Texture Objects by Height Compensation Using a Dual-Projector Structure37
Multimodal Composition Example Mining for Composed Query Image Retrieval37
Siamese-DETR for Generic Multi-Object Tracking37
Learning Structure Aware Deep Spectral Embedding37
BVI-VFI: A Video Quality Database for Video Frame Interpolation37
Scale-Aware Crowd Counting Network With Annotation Error Modeling37
DisAVR: Disentangled Adaptive Visual Reasoning Network for Diagram Question Answering37
Conditional Feature Learning Based Transformer for Text-Based Person Search36
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis36
View-Consistency Learning for Incomplete Multiview Clustering36
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection36
State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation36
Multi-Branch and Progressive Network for Low-Light Image Enhancement35
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors35
Designing an Illumination-Aware Network for Deep Image Relighting35
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery35
Learning to Compare Relation: Semantic Alignment for Few-Shot Learning35
Magi-Net: Meta Negative Network for Early Activity Prediction35
Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression35
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption35
Cluster-Guided Asymmetric Contrastive Learning for Unsupervised Person Re-Identification35
Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning35
Neuromorphic Synergy for Video Binarization34
Anomaly Detection for Medical Images Using Heterogeneous Auto-Encoder34
Deepfake Forensics via an Adversarial Game34
Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression34
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning34
Multi-Modal Remote Sensing Image Matching Considering Co-Occurrence Filter34
Adaptive Betweenness Clustering for Semi-Supervised Domain Adaptation34
Disparity-Aware Reference Frame Generation Network for Multiview Video Coding34
Interactive Learning of Intrinsic and Extrinsic Properties for All-Day Semantic Segmentation34
DREAM-PCD: Deep Reconstruction and Enhancement of mmWave Radar Pointcloud34
HDR or SDR? A Subjective and Objective Study of Scaled and Compressed Videos33
Error Model and Concise Temporal Network for Indirect Illumination in 3D Reconstruction33
Event-Based Optical Flow via Transforming Into Motion-Dependent View33
Learning Common Semantics via Optimal Transport for Contrastive Multi-View Clustering33
Alignment Relation is What You Need for Diagram Parsing33
Latitude-Redundancy-Aware All-Zero Block Detection for Fast 360-Degree Video Coding33
X-View: Non-Egocentric Multi-View 3D Object Detector32
Joint Under-Sampling Pattern and Dual-Domain Reconstruction for Accelerating Multi-Contrast MRI32
Improving Inconspicuous Attributes Modeling for Person Search by Language32
Diffusion Models as Strong Adversaries32
Nowhere to Disguise: Spot Camouflaged Objects via Saliency Attribute Transfer32
Exploration of Learned Lifting-Based Transform Structures for Fully Scalable and Accessible Wavelet-Like Image Compression32
0.090322971343994