OOIR: Observatory of International Research

Papers

(The TQCC of IEEE Transactions on Image Processing is 21. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)

Article	Citations
Consensus Sparsity: Multi-Context Sparse Image Representation via L _∞-Induced Matrix Variate	684
SemiRS-COC: Semi-Supervised Classification for Complex Remote Sensing Scenes With Cross-Object Consistency	654
HAda: Hyper-Adaptive Parameter-Efficient Learning for Multi-View ConvNets	592
Pro2Diff: Proposal Propagation for Multi-Object Tracking via the Diffusion Model	581
Multiframe Joint Enhancement for Early Interlaced Videos	482
Fine-Grained Recognition With Learnable Semantic Data Augmentation	434
Cross-Modality Pyramid Alignment for Visual Intention Understanding	402
OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments	397
MaCon: A Generic Self-Supervised Framework for Unsupervised Multimodal Change Detection	363
An Adaptive Multi-Granularity Graph Representation of Image via Granular-ball Computing	361
Uncertainty-Guided Refinement for Fine-Grained Salient Object Detection	286
Discrete Metric Learning for Fast Image Set Classification	282
Bi-Nuclear Tensor Schatten-p Norm Minimization for Multi-View Subspace Clustering	253
GMLight: Lighting Estimation via Geometric Distribution Approximation	234
Graph Convolutional Dictionary Selection With L₂_, ₚ Norm for Video Summarization	229
Cross-Modal Retrieval With Noisy Correspondence via Consistency Refining and Mining	211
Density-Guided Incremental Dominant Instance Exploration for Two-View Geometric Model Fitting	204
TTVFI: Learning Trajectory-Aware Transformer for Video Frame Interpolation	203
Multi-Granularity Contrastive Cross-Modal Collaborative Generation for End-to-End Long-Term Video Question Answering	199
Contrast-Reconstruction Representation Learning for Self-Supervised Skeleton-Based Action Recognition	197
Multimodal Unrolled Robust PCA for Background Foreground Separation	192
A Fast and Efficient Shape Blending by Stable and Analytically Invertible Finite Descriptors	192
Variational Structured Attention Networks for Deep Visual Representation Learning	189
Equivariant Local Reference Frames With Optimization for Robust Non-Rigid Point Cloud Correspondence	178
Automatic Quaternion-Domain Color Image Stitching	178

A Low-Rank Tensor Decomposition Model With Factors Prior and Total Variation for Impulsive Noise Removal	178
FF-LPD: A Real-Time Frame-by-Frame License Plate Detector With Knowledge Distillation and Feature Propagation	175
STPNet: Scale-Aware Text Prompt Network for Medical Image Segmentation	173
Self-Supervised Matting-Specific Portrait Enhancement and Generation	168
Color Spike Camera Reconstruction via Long Short-Term Temporal Aggregation of Spike Signals	163
AdaAugment: A Tuning-Free and Adaptive Approach to Enhance Data Augmentation	162
Spatial Frequency Modulation Network for Efficient Image Dehazing	160
Canonical Correlation Analysis With Low-Rank Learning for Image Representation	158
Learning Spectral Cues for Multispectral and Panchromatic Image Fusion	145
An Explanation Method Based on Interpretable Linear Model With Four Key Characteristics	143
Real Image Denoising With a Locally-Adaptive Bitonic Filter	140
One-Class Classification Using ℓp-Norm Multiple Kernel Fisher Null Approach	140
Dual Alternating Direction Method of Multipliers for Inverse Imaging	139
Pose-Appearance Relational Modeling for Video Action Recognition	132
Harnessing Multi-modal Large Language Models for Measuring and Interpreting Color Differences	132
Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments	130
Attentive WaveBlock: Complementarity-Enhanced Mutual Networks for Unsupervised Domain Adaptation in Person Re-Identification and Beyond	130
Cross-Domain Few-Shot Medical Image Segmentation via Dynamic Semantic Matching	129
Toward Efficient Test Time Adaptation With Hierarchical Distribution Alignment	129
Multi-Constraint Adversarial Networks for Unsupervised Image-to-Image Translation	129
Toward Projected Clustering With Aggregated Mapping	127
Few-Shot Learning With Class-Covariance Metric for Hyperspectral Image Classification	126
Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering	126
Differentiable SAR Renderer and Image-Based Target Reconstruction	124
Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment	121
Variational Bayes Image Restoration With Compressive Autoencoders	121
NeuralDiffuser: Neuroscience-Inspired Diffusion Guidance for fMRI Visual Reconstruction	120
Advances in Predictive RAHT for Geometric Point Cloud Compression	120
Non-Cascaded and Crosstalk-Free Multi-Image Encryption Based on Optical Scanning Holography Using 2D Orthogonal Compressive Sensing	120
Interactive Face Video Coding: A Generative Compression Framework	119
Fast 3D Room Layout Estimation Based on Compact High-Level Representation	117
Cross-Domain Diffusion With Progressive Alignment for Efficient Adaptive Retrieval	117
Generalization Beyond Feature Alignment: Concept Activation-Guided Contrastive Learning	116
Grammar-Induced Wavelet Network for Human Parsing	114
Cross-Layer Contrastive Learning of Latent Semantics for Facial Expression Recognition	110
Motion and Appearance Decoupling Representation for Event Cameras	110
Hyperspectral Meets Optical Flow: Spectral Flow Extraction for Hyperspectral Image Classification	106
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning	106
Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression	105
IMU-Assisted Online Video Background Identification	105
Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images	104
Optimization-Inspired Learning With Architecture Augmentations and Control Mechanisms for Low-Level Vision	104
Inverse Image Frequency for Long-Tailed Image Recognition	103
Boundary-Aware Prototype in Semi-Supervised Medical Image Segmentation	102
SRS: Siamese Reconstruction-Segmentation Network Based on Dynamic-Parameter Convolution	99
Distractor-Aware Event-Based Tracking	99
Learning Dynamic Prompts for All-in-One Image Restoration	97
Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain	96
Precise Facial Landmark Detection by Reference Heatmap Transformer	95
KSS-ICP: Point Cloud Registration Based on Kendall Shape Space	93

SharpFormer: Learning Local Feature Preserving Global Representations for Image Deblurring	92
Stacked Deconvolutional Network for Semantic Segmentation	92
SegHSI: Semantic Segmentation of Hyperspectral Images With Limited Labeled Pixels	90
Video Moment Retrieval With Cross-Modal Neural Architecture Search	90
Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model	89
Addressing Challenges of Incorporating Appearance Cues Into Heuristic Multi-Object Tracker via a Novel Feature Paradigm	89
Decoupled Cross-Modal Phrase-Attention Network for Image-Sentence Matching	89
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection	88
Multi-Condition Latent Diffusion Network for Scene-Aware Neural Human Motion Prediction	88
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	88
Rethinking Sampling Strategies for Unsupervised Person Re-Identification	86
Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning	83
Unsupervised Person Re-Identification With Stochastic Training Strategy	83
Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering	80
Point-Based Learnable Query Generator for Human–Object Interaction Detection	80
Coarse-to-Fine Contrastive Self-Supervised Feature Learning for Land-Cover Classification in SAR Images With Limited Labeled Data	80
Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection	79
Weighted Feature Fusion of Convolutional Neural Network and Graph Attention Network for Hyperspectral Image Classification	79
RSSFormer: Foreground Saliency Enhancement for Remote Sensing Land-Cover Segmentation	78
NR-MVSNet: Learning Multi-View Stereo Based on Normal Consistency and Depth Refinement	78
FsaNet: Frequency Self-Attention for Semantic Segmentation	77
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation	77
Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching	77
Multi-Exposure Image Fusion via Deformable Self-Attention	76
Fuzzy Sparse Subspace Clustering for Infrared Image Segmentation	76
Perceptually Weighted Rate Distortion Optimization for Video-Based Point Cloud Compression	76
DUT: Learning Video Stabilization by Simply Watching Unstable Videos	76
Joint Local and Nonlocal Progressive Prediction for Versatile Video Coding	76
Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer’s Disease Diagnosis	75
Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network	74
HQ2CL: A High-Quality Class Center Learning System for Deep Face Recognition	74
Learned Spherical Image Compression With Spherical Convolution-Self-Attention and Transformer Context Model	74
Reduced Biquaternion Dual-Branch Deraining U-Network via Multi-Attention Mechanism	73
Fast Learning Radiance Fields by Shooting Much Fewer Rays	73
NesTD-Net: Deep NESTA-Inspired Unfolding Network With Dual-Path Deblocking Structure for Image Compressive Sensing	73
RobustMat: Neural Diffusion for Street Landmark Patch Matching Under Challenging Environments	73
Rich Action-Semantic Consistent Knowledge for Early Action Prediction	72
A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding	72
High-Quality and Diverse Few-Shot Image Generation via Masked Discrimination	71
Noise Prior Knowledge Informed Bayesian Inference Network for Hyperspectral Super-Resolution	71
Graph-Based Depth Denoising & Dequantization for Point Cloud Enhancement	71
Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment	70
Implicit-Explicit Integrated Representations for Multi-View Video Compression	69
HOPE: Enhanced Position Image Priors via High-Order Implicit Representations	68
Exploring the Potential of Pooling Techniques for Universal Image Restoration	68
Fuzzy Sparse Deviation Regularized Robust Principal Component Analysis	68
Characteristic Mapping for Ellipse Detection Acceleration	68
UVaT: Uncertainty Incorporated View-Aware Transformer for Robust Multi-View Classification	68
Robust Ellipse Fitting Based on Maximum Correntropy Criterion With Variable Center	66
Generalizing to Out-of-Sample Degradations via Model Reprogramming	65
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models	65
MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization	65
Spatially Consistent Transformer for Colorization in Monochrome-Color Dual-Lens System	65
Compact Representation and Reliable Classification Learning for Point-Level Weakly-Supervised Action Localization	64
Image Reconstruction for Accelerated MR Scan With Faster Fourier Convolutional Neural Networks	64
Mutually Reinforcing Learning of Decoupled Degradation and Diffusion Enhancement for Unpaired Low-Light Image Lightening	64
Causal Inference Hashing for Long-Tailed Image Retrieval	64
Hierarchical Superpixel Segmentation by Parallel CRTrees Labeling	63
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description	63
One Sketch for All: One-Shot Personalized Sketch Segmentation	63
Rethinking Object Saliency Ranking: A Novel Whole-Flow Processing Paradigm	62
Cross-Modal Causal Representation Learning for Radiology Report Generation	61
RoMo: Robust Unsupervised Multimodal Learning With Noisy Pseudo Labels	61
Semi-Supervised Domain Adaptive Structure Learning	61
PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates	61
Joint Denoising-Demosaicking Network for Long-Wave Infrared Division-of-Focal-Plane Polarization Images With Mixed Noise Level Estimation	61
DMRA: Depth-Induced Multi-Scale Recurrent Attention Network for RGB-D Saliency Detection	60
A Real-Time Memory Updating Strategy for Unsupervised Person Re-Identification	60
Semantic Representation and Attention Alignment for Graph Information Bottleneck in Video Summarization	59
Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Images	59
Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding	59
Enhancing Few-Shot Out-of-Distribution Detection With Pre-Trained Model Features	59
Energy-Based Domain Adaptation Without Intermediate Domain Dataset for Foggy Scene Segmentation	59
Continual Referring Expression Comprehension via Dual Modular Memorization	58
Toward Robust and Unconstrained Full Range of Rotation Head Pose Estimation	58
Multi-Scale Fusion and Decomposition Network for Single Image Deraining	58
Reviewer Summary for Transactions on Image Processing	57
Dynamic Atomic Column Detection in Transmission Electron Microscopy Videos via Ridge Estimation	57
Data Augmentation Using Bitplane Information Recombination Model	56
Arbitrary-Scale Texture Generation From Coarse-Grained Control	56

Image-Level Adaptive Adversarial Ranking for Person Re-Identification	56
MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection	56
Degraded Reference Image Quality Assessment	56
PVPUFormer: Probabilistic Visual Prompt Unified Transformer for Interactive Image Segmentation	55
Source-Guided Target Feature Reconstruction for Cross-Domain Classification and Detection	55
FOVQA: Blind Foveated Video Quality Assessment	55
UniEmoX: Cross-Modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception	55
Bayesian Nonnegative Tensor Completion With Automatic Rank Determination	55
Multi-Person Pose Tracking With Sparse Key-Point Flow Estimation and Hierarchical Graph Distance Minimization	55
CKD: Contrastive Knowledge Distillation From a Sample-Wise Perspective	55
SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses	55
U-N2C: A Dual Memory-Guided Disentanglement Framework for Unsupervised System Matrix Denoising in Magnetic Particle Imaging	55
A New Non-Linear Hyperbolic-Parabolic Coupled PDE Model for Image Despeckling	55
MetaAge: Meta-Learning Personalized Age Estimators	54
Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking	54
Enhancing Text-Based Person Retrieval by Combining Fused Representation and Reciprocal Learning With Adaptive Loss Refinement	54
BPMTrack: Multi-Object Tracking With Detection Box Application Pattern Mining	54
Rotational Convolution: Rethinking Convolution for Downside Fisheye Images	54
Hierarchical Hashing Learning for Image Set Classification	54
Cross-Modal Contrastive Learning Network for Few-Shot Action Recognition	54
Deep Ranking Exemplar-Based Dynamic Scene Deblurring	54
Few-Shot Domain Adaptation via Mixup Optimal Transport	53
View-Wise Versus Cluster-Wise Weight: Which Is Better for Multi-View Clustering?	53
CartoonLossGAN: Learning Surface and Coloring of Images for Cartoonization	53
Restoration of Images Taken Through a Dirty Window Using Optics-Guided Transformer	52
Neural Scene Designer: Self-Styled Semantic Image Manipulation	52
Sensitivity Decouple Learning for Image Compression Artifacts Reduction	52
PCE-GAN: A Generative Adversarial Network for Point Cloud Attribute Quality Enhancement Based on Optimal Transport	52
Model-Induced Generalization Error Bound for Information-Theoretic Representation Learning in Source-Data-Free Unsupervised Domain Adaptation	52
Image Compression Using Stochastic-AFD Based Multisignal Sparse Representation	52
SSL++: Improving Self-Supervised Learning by Mitigating the Proxy Task-Specificity Problem	51
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation	51
Multistage Spatio-Temporal Networks for Robust Sketch Recognition	51
Learning Domain Invariant Representations for Generalizable Person Re-Identification	51
Multi-Label Auroral Image Classification Based on CNN and Transformer	51
Sampling Agnostic Feature Representation for Long-Term Person Re-Identification	51
Dynamic Slimmable Denoising Network	50
PFONet: A Progressive Feedback Optimization Network for Lightweight Single Image Dehazing	50
HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks	50
Advancing Video Anomaly Detection: A Bi-Directional Hybrid Framework for Enhanced Single- and Multi-Task Approaches	50
Contrastive Conditional Latent Diffusion for Audio-Visual Segmentation	49
Toward Scalable and Unified Example-Based Explanation and Outlier Detection	49
Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation	49
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction	48
Perception-Guided Quality Metric of 3D Point Clouds Using Hybrid Strategy	48
MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications	48
Boosting Monocular 3D Human Pose Estimation With Part Aware Attention	48
Underwater Image Enhancement via Minimal Color Loss and Locally Adaptive Contrast Enhancement	47
PointFormer: Keypoint-Guided Transformer for Simultaneous Nuclei Segmentation and Classification in Multi-Tissue Histology Images	47
Attribute and State Guided Structural Embedding Network for Vehicle Re-Identification	47
Zero-Shot Camouflaged Object Detection	47
Rethinking the Low-Light Video Enhancement: Benchmark Datasets and Methods	47
U-Shape Transformer for Underwater Image Enhancement	47
Hyperspectral Image Classification via Cascaded Spatial Cross-Attention Network	47
Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors	47
Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules	46
TTST: A Top-k Token Selective Transformer for Remote Sensing Image Super-Resolution	46
Improving Transferability of Universal Adversarial Perturbation With Feature Disruption	46
DO-Conv: Depthwise Over-Parameterized Convolutional Layer	46
FABNet: Frequency-Aware Binarized Network for Single Image Super-Resolution	46
SDSFusion: A Semantic-Aware Infrared and Visible Image Fusion Network for Degraded Scenes	46
Leveraging Frequency Analysis for Image Denoising Network Pruning	46
DVMark: A Deep Multiscale Framework for Video Watermarking	46
Deep Underwater Image Quality Assessment With Explicit Degradation Awareness Embedding	46
IEEE Transactions on Image Processing publication information	46
Spatio-Temporal Correlation Guided Geometric Partitioning for Versatile Video Coding	46
Explicitly-Decoupled Text Transfer With Minimized Background Reconstruction for Scene Text Editing	45
ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection	45
Rethinking Generalized Zero-Shot Learning: A Synthesized Per-Instance Attribute Perspective	45
Versatile Denoising-Based Approximate Message Passing for Compressive Sensing	45
Linearly Transformed Color Guide for Low-Bitrate Diffusion-Based Image Compression	45
State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation	45
Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Alignment	44
Multi-Modal Remote Sensing Image Matching Considering Co-Occurrence Filter	44
Designing an Illumination-Aware Network for Deep Image Relighting	44
Scale-Aware Crowd Counting Network With Annotation Error Modeling	44
Hierarchical Prior-Based Super Resolution for Point Cloud Geometry Compression	44
Enhancing Multimodal Learning via Hierarchical Fusion Architecture Search With Inconsistency Mitigation	44
Diverse Target and Contribution Scheduling for Domain Generalization	44
C-NeRF: Representing Scene Changes as Directional Consistency Difference-based NeRF	43
Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting	43
StreakNet-Arch: An Anti-Scattering Network-Based Architecture for Underwater Carrier LiDAR-Radar Imaging	43
View-Consistency Learning for Incomplete Multiview Clustering	43
YOLOH: You Only Look One Hourglass for Real-Time Object Detection	43
Hyperpixels: Flexible 4D Over-Segmentation for Dense and Sparse Light Fields	42
Siamese-DETR for Generic Multi-Object Tracking	42
Lightweight Deep Neural Networks for Ship Target Detection in SAR Imagery	42
Magi-Net: Meta Negative Network for Early Activity Prediction	42
Decoupling Discriminative Attributes for Few-Shot Fine-Grained Recognition	42
Cluster-Guided Asymmetric Contrastive Learning for Unsupervised Person Re-Identification	42
BVI-VFI: A Video Quality Database for Video Frame Interpolation	42
Toward Transparent Deep Image Aesthetics Assessment With Tag-Based Content Descriptors	42
Accurate 3D Measurement of Complex Texture Objects by Height Compensation Using a Dual-Projector Structure	42
Coupled Splines for Sparse Curve Fitting	41
Action Quality Assessment via Hierarchical Pose-Guided Multi-Stage Contrastive Regression	41