IEEE Transactions on Image Processing

Papers
(The TQCC of IEEE Transactions on Image Processing is 16. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
ADStereo: Efficient Stereo Matching With Adaptive Downsampling and Disparity Alignment441
Reserve to Adapt: Mining Inter-Class Relations for Open-Set Domain Adaptation428
Who, What, and Where: Composite-Semantics Instance Search for Story Videos420
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection354
Optimal Graph Learning-Based Label Propagation for Cross-Domain Image Classification319
Deep Face Leakage: Inverting High-Quality Faces From Gradients Using Residual Optimization285
MaeFuse: Transferring Omni Features With Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training250
MoVis: When 3D Object Detection is Like Human Monocular Vision250
Co-Learning Meets Stitch-Up for Noisy Multi-Label Visual Recognition204
Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision203
A General Dynamic Knowledge Distillation Method for Visual Analytics190
Consensus Sparsity: Multi-Context Sparse Image Representation via L -Induced Matrix Variate190
Deep Hypersphere Feature Regularization for Weakly Supervised RGB-D Salient Object Detection177
Cross-Modality Pyramid Alignment for Visual Intention Understanding155
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment148
6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-Based Instance Representation Learning146
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification145
Exploiting Latent Properties to Optimize Neural Codecs144
Momentum Contrastive Teacher for Semi-Supervised Skeleton Action Recognition143
Constrained Visual Representation Learning With Bisimulation Metrics for Safe Reinforcement Learning134
IEEE Transactions on Image Processing Publication Information133
Variational Structured Attention Networks for Deep Visual Representation Learning133
Concept-Aware Video Captioning: Describing Videos With Effective Prior Information127
Unsupervised Meta Learning With Multiview Constraints for Hyperspectral Image Small Sample set Classification124
Tensor Cascaded-Rank Minimization in Subspace: A Unified Regime for Hyperspectral Image Low-Level Vision124
Wavelet-Guided Promotion-Suppression Transformer for Surface-Defect Detection122
Temporal Fusion: Continuous-Time Light Field Video Factorization118
Adaptive Bit Selection for Scalable Deep Hashing117
Spiking Neural Networks With Adaptive Membrane Time Constant for Event-Based Tracking113
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences113
Super-Resolution Phase Retrieval Network for Single-Pattern Structured Light 3D Imaging112
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval106
Equivariant Local Reference Frames with Optimization for Robust Non-rigid Point Cloud Correspondence105
Single-Image-Based Deep Learning for Segmentation of Early Esophageal Cancer Lesions105
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency102
UMCGL: Universal Multi-View Consensus Graph Learning With Consistency and Diversity100
Multi-Label Adversarial Attack With New Measures and Self-Paced Constraint Weighting99
An Embeddable Implicit IUVD Representation for Part-Based 3D Human Surface Reconstruction98
SWFormer: Stochastic Windows Convolutional Transformer for Hybrid Modality Hyperspectral Classification97
CPI-Parser: Integrating Causal Properties Into Multiple Human Parsing97
Rethinking Noise Sampling in Class-Imbalanced Diffusion Models96
Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation93
To Boost Zero-Shot Generalization for Embodied Reasoning With Vision-Language Pre-Training92
Segmentation-Free Velocity Field Super-Resolution on 4D Flow MRI91
Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification91
Exploring Multi-Modal Spatial–Temporal Contexts for High-Performance RGB-T Tracking90
Unfolded Proximal Neural Networks for Robust Image Gaussian Denoising89
Explainability Enhanced Object Detection Transformer With Feature Disentanglement89
CWSCNet: Channel-Weighted Skip Connection Network for Underwater Object Detection87
Learning Weak Semantics by Feature Graph for Attribute-Based Person Search87
Learning-Based Rate Control for Video-Based Point Cloud Compression86
GMLight: Lighting Estimation via Geometric Distribution Approximation85
A Novel Hybrid Level Set Model for Non-Rigid Object Contour Tracking85
Frequency Information Disentanglement Network for Video-Based Person Re-Identification84
Dynamic Neural Network for Lossy-to-Lossless Image Coding83
Prototype Adaption and Projection for Few- and Zero-Shot 3D Point Cloud Semantic Segmentation83
Geometry-Aware Deep Video Deblurring via Recurrent Feature Refinement81
Local Orthogonal Moments for Local Features80
Fine-Grained Video Retrieval With Scene Sketches80
Dual Alternating Direction Method of Multipliers for Inverse Imaging79
JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks79
Rebalanced Zero-Shot Learning78
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation77
Multisubject Task-Related fMRI Data Processing via a Two-Stage Generalized Canonical Correlation Analysis77
Revisiting the Regularizers in Blind Image Deblurring With a New One76
User-Guided Deep Human Image Matting Using Arbitrary Trimaps75
Exploring the Robustness of Human Parsers Toward Common Corruptions73
Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding73
CalibNet: Dual-Branch Cross-Modal Calibration for RGB-D Salient Instance Segmentation72
A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal72
Sparse Coding Inspired LSTM and Self-Attention Integration for Medical Image Segmentation71
Revisiting Domain-Adaptive Semantic Segmentation via Knowledge Distillation69
A New Cross-Space Total Variation Regularization Model for Color Image Restoration With Quaternion Blur Operator69
Portrait Shadow Removal Using Context-Aware Illumination Restoration Network69
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation68
Learning Frame-Event Fusion for Motion Deblurring68
MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers68
GSSF: Generalized Structural Sparse Function for Deep Cross-Modal Metric Learning68
Efficient 3D Scene Semantic Segmentation via Active Learning on Rendered 2D Images66
Toward Projected Clustering With Aggregated Mapping66
Feature Preserving Non-Rigid Iterative Weighted Closest Point and Semi-Curvature Registration65
Defending Against Multiple and Unforeseen Adversarial Videos65
Interactive Regression and Classification for Dense Object Detector65
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering64
Point Cloud Video Super-Resolution via Partial Point Coupling and Graph Smoothness63
MuTrans: Multiple Transformers for Fusing Feature Pyramid on 2D and 3D Object Detection63
VPU: A Video-Based Point Cloud Upsampling Framework63
HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection63
Multimodal Unrolled Robust PCA for Background Foreground Separation61
Helping Visually Impaired People Take Better Quality Pictures61
A Dual-Branch Self-Boosting Framework for Self-Supervised 3D Hand Pose Estimation59
A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images58
Cross-Modal Graph With Meta Concepts for Video Captioning57
Ingredient Prediction via Context Learning Network With Class-Adaptive Asymmetric Loss57
MetaLabelNet: Learning to Generate Soft-Labels From Noisy-Labels57
BooDet: Gradient Boosting Object Detection With Additive Learning-Based Prediction Aggregation56
Semantic-Aware Modular Capsule Routing for Visual Question Answering56
Learning Feature Channel Weighting for Real-Time Visual Tracking55
A Geodesic Translation Model for Spherical Video Compression54
From Global to Local: Multi-Scale Out-of-Distribution Detection53
Real Image Denoising With a Locally-Adaptive Bitonic Filter51
Enhancing Person Re-Identification Performance Through In Vivo Learning51
Pre-Demosaic Graph-Based Light Field Image Compression51
Video Reenactment as Inductive Bias for Content-Motion Disentanglement51
Pro-UIGAN: Progressive Face Hallucination From Occluded Thumbnails51
FlexHDR: Modeling Alignment and Exposure Uncertainties for Flexible HDR Imaging51
Self-Supervised Matting-Specific Portrait Enhancement and Generation51
Broad Spectrum Image Deblurring via an Adaptive Super-Network50
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting50
R-PointHop: A Green, Accurate, and Unsupervised Point Cloud Registration Method50
Improving Embedding Generalization in Few-Shot Learning With Instance Neighbor Constraints50
Learning a Prototype Discriminator With RBF for Multimodal Image Synthesis50
Guided Filter Network for Semantic Image Segmentation50
BLPSeg: Balance the Label Preference in Scribble-Supervised Semantic Segmentation50
Tolerating Annotation Displacement in Dense Object Counting via Point Annotation Probability Map49
Deep-Based Film Grain Removal and Synthesis49
Pattern-Based Reconstruction of K-Level Images From Cutsets49
Dynamic Frame Interpolation in Wavelet Domain49
Unsupervised Synthetic Acoustic Image Generation for Audio-Visual Scene Understanding49
EGRC-Net: Embedding-Induced Graph Refinement Clustering Network48
Self-Paced Multi-Grained Cross-Modal Interaction Modeling for Referring Expression Comprehension48
Coarse Mask Guided Interactive Object Segmentation48
Deep Unrolled Low-Rank Tensor Completion for High Dynamic Range Imaging48
Learning a Locally Unified 3D Point Cloud for View Synthesis48
Plug-and-Play Priors for Multi-Shot Compressive Hyperspectral Imaging48
EDDMF: An Efficient Deep Discrepancy Measuring Framework for Full-Reference Light Field Image Quality Assessment47
Distance-Aware Occlusion Detection With Focused Attention47
JNMR: Joint Non-Linear Motion Regression for Video Frame Interpolation47
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors47
Composition and Style Attributes Guided Image Aesthetic Assessment46
Confusing Image Quality Assessment: Toward Better Augmented Reality Experience46
Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation46
Sampling Equivariant Self-Attention Networks for Object Detection in Aerial Images45
Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection45
Exploring Long- and Short-Range Temporal Information for Learned Video Compression45
Data Acquisition and Preparation for Dual-Reference Deep Learning of Image Super-Resolution45
DO-SA&R: Distant Object Augmented Set Abstraction and Regression for Point-Based 3D Object Detection45
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization44
Discrete Metric Learning for Fast Image Set Classification44
Uncertainty Modeling for Gaze Estimation44
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion44
Surface-SOS: Self-Supervised Object Segmentation via Neural Surface Representation43
Anycost Network Quantization for Image Super-Resolution43
Fast Scalable Image Restoration Using Total Variation Priors and Expectation Propagation43
TransVQA: Transferable Vector Quantization Alignment for Unsupervised Domain Adaptation43
A Machine Learning Approach to Design of Aperiodic, Clustered-Dot Halftone Screens via Direct Binary Search43
Spatial-Temporal Pyramid Graph Reasoning for Action Recognition43
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining43
MFNet: A Novel GNN-Based Multi-Level Feature Network With Superpixel Priors43
Latent Space Semantic Supervision Based on Knowledge Distillation for Cross-Modal Retrieval43
Automatic Quaternion-Domain Color Image Stitching42
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model41
Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos41
RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation With Occlusion Handling41
Cost Volume Aggregation in Stereo Matching Revisited: A Disparity Classification Perspective41
Adapting Vision-Language Models via Learning to Inject Knowledge41
Normalizing Batch Normalization for Long-Tailed Recognition41
AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource40
HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation40
CLIP4STR: A Simple Baseline for Scene Text Recognition With Pre-Trained Vision-Language Model40
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets40
Learning Lossless Compression for High Bit-Depth Volumetric Medical Image40
Toward Blind Flare Removal Using Knowledge-Driven Flare-Level Estimator40
MISC: Ultra-Low Bitrate Image Semantic Compression Driven by Large Multimodal Model39
Toward Adversarial Robustness in Unlabeled Target Domains39
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering39
Spectral Clustering Super-Resolution Imaging Based on Multispectral Camera Array39
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach39
Event-Aware Video Deraining via Multi-Patch Progressive Learning39
RGB-Guided Depth Map Recovery by Two-Stage Coarse-to-Fine Dense CRF Models38
Ray-Space Motion Compensation for Lenslet Plenoptic Video Coding38
Learning Shadow Removal From Unpaired Samples via Reciprocal Learning38
Motion-Compensated Predictive RAHT for Dynamic Point Clouds38
Additivity Constrained Linearisation of Camera Calibration Data38
Perspectively Equivariant Keypoint Learning for Omnidirectional Images38
Neural Reference Synthesis for Inter Frame Coding37
Inference-Domain Network Evolution: A New Perspective for One-Shot Multi-Object Tracking37
Progressive Transfer Learning37
Polarization Guided HDR Reconstruction via Pixel-Wise Depolarization37
Exploiting Intra-Slice and Inter-Slice Redundancy for Learning-Based Lossless Volumetric Image Compression37
Delving Into Crispness: Guided Label Refinement for Crisp Edge Detection37
Subjective and Objective Audio-Visual Quality Assessment for User Generated Content37
Scale-Consistent Fusion: From Heterogeneous Local Sampling to Global Immersive Rendering36
MERF: A Practical HDR-Like Image Generator via Mutual-Guided Learning Between Multi-Exposure Registration and Fusion36
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering36
Weakly-Supervised RGBD Video Object Segmentation36
A Closer Look at the Joint Training of Object Detection and Re-Identification in Multi-Object Tracking36
Canonical Correlation Analysis With Low-Rank Learning for Image Representation36
NIM-Nets: Noise-Aware Incomplete Multi-View Learning Networks36
DRNet: Double Recalibration Network for Few-Shot Semantic Segmentation36
Saliency Guided Deep Neural Network for Color Transfer With Light Optimization36
Bi-Fusion of Structure and Deformation at Multi-Scale for Joint Segmentation and Registration36
INSURE: An Information Theory iNspired diSentanglement and pURification modEl for Domain Generalization35
Graph Embedding Interclass Relation-Aware Adaptive Network for Cross-Scene Classification of Multisource Remote Sensing Data35
Learning a Deep Demosaicing Network for Spike Camera With Color Filter Array35
Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach35
Learning to Generate Parameters of ConvNets for Unseen Image Data35
Learning Virtual View Selection for 3D Scene Semantic Segmentation35
Gloss Prior Guided Visual Feature Learning for Continuous Sign Language Recognition35
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation35
Local Intensity Order Transformation for Robust Curvilinear Object Segmentation34
Learnable Feature Augmentation Framework for Temporal Action Localization34
Trustworthy Limited Data CT Reconstruction Using Progressive Artifact Image Learning34
Wavelet-Based Texture Reformation Network for Image Super-Resolution34
A Gating Model for Bias Calibration in Generalized Zero-shot Learning34
Semi-Supervised Learning With Heterogeneous Distribution Consistency for Visible Infrared Person Re-Identification33
MM-Net: A MixFormer-Based Multi-Scale Network for Anatomical and Functional Image Fusion33
A no-Reference Stereoscopic Image Quality Assessment Network Based on Binocular Interaction and Fusion Mechanisms33
Angular Isotonic Loss Guided Multi-Layer Integration for Few-Shot Fine-Grained Image Classification32
Multiview Spectral Clustering With Bipartite Graph32
Towards Low Light Enhancement With RAW Images32
ShaTure: Shape and Texture Deformation for Human Pose and Attribute Transfer32
FineAction: A Fine-Grained Video Dataset for Temporal Action Localization32
Graph Convolutional Dictionary Selection With L, Norm for Video Summarization32
Robust Object Detection via Adversarial Novel Style Exploration32
Two-Stage Copy-Move Forgery Detection With Self Deep Matching and Proposal SuperGlue32
Triple-Level Model Inferred Collaborative Network Architecture for Video Deraining32
Exploiting Non-Local Priors via Self-Convolution for Highly-Efficient Image Restoration32
Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation32
Locality-Aware Channel-Wise Dropout for Occluded Face Recognition31
Human Co-Parsing Guided Alignment for Occluded Person Re-Identification31
Content-Aware Scalable Deep Compressed Sensing31
Weakly Supervised Visual Saliency Prediction31
Color Image Recovery Using Low-Rank Quaternion Matrix Completion Algorithm31
Fast Multi-Grid Methods for Minimizing Curvature Energies31
Attention-Guided Progressive Neural Texture Fusion for High Dynamic Range Image Restoration31
BANet: A Blur-Aware Attention Network for Dynamic Scene Deblurring30
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification30
Exploring Spatial Correlation for Light Field Saliency Detection: Expansion From a Single View30
CBNet: A Composite Backbone Network Architecture for Object Detection30
No-Reference Image Quality Assessment by Hallucinating Pristine Features30
Discrepancy-Aware Meta-Learning for Zero-Shot Face Manipulation Detection30
Self-Parameter Distillation Dehazing30
NCSiam: Reliable Matching via Neighborhood Consensus for Siamese-Based Object Tracking30
DTCM: Joint Optimization of Dark Enhancement and Action Recognition in Videos29
Weakly-Supervised Salient Object Detection on Light Fields29
Transductive Few-Shot Learning With Enhanced Spectral-Spatial Embedding for Hyperspectral Image Classification29
MagConv: Mask-Guided Convolution for Image Inpainting29
Rotation-Invariant Attention Network for Hyperspectral Image Classification29
GaitMPL: Gait Recognition With Memory-Augmented Progressive Learning29
Raformer: Redundancy-Aware Transformer for Video Wire Inpainting29
Fine-Grained Essential Tensor Learning for Robust Multi-View Spectral Clustering29
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA29
Relationship-Guided Knowledge Transfer for Class-Incremental Facial Expression Recognition29
Facial Prior Guided Micro-Expression Generation29
Deep Multi-Exposure Image Fusion for Dynamic Scenes29
Dynamic Facial Expression Recognition Under Partial Occlusion With Optical Flow Reconstruction28
Plug-and-Play Regulators for Image-Text Matching28
Temporal Phase Unwrapping Based on Unequal Phase-Shifting Code28
Restructuring the Teacher and Student in Self-Distillation28
0.13177919387817