OOIR: Observatory of International Research

Papers

(The TQCC of IEEE Transactions on Pattern Analysis and Machine Intelligence is 25. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)

Article	Citations
[Back cover - Table of contents, continued]	3182
Front Cover	1690
One-for-All: Towards Universal Domain Translation With a Single StyleGAN	1610
Editorial: Introduction to the Special Section on Best of CVPR'2022	1549
Self-Supervised Skeleton Representation Learning Via Actionlet Contrast and Reconstruct	1515
BiBBDM: Bidirectional Image Translation With Brownian Bridge Diffusion Models	1221
Principal Uncertainty Quantification With Spatial Correlation for Image Restoration Problems	1171
S-NeRF++: Autonomous Driving Simulation via Neural Reconstruction and Generation	982
MECD+ : Unlocking Event-Level Causal Graph Discovery for Video Reasoning	789
ResNet-LDDMM: Advancing the LDDMM Framework using Deep Residual Networks	765
Active Supervised Cross-Modal Retrieval	611
DAQE: Enhancing the Quality of Compressed Images by Exploiting the Inherent Characteristic of Defocus	597
Event-Based Photometric Bundle Adjustment	577
On the Trade-Off Between Flatness and Optimization in Distributed Learning	576
VATr++: Choose Your Words Wisely for Handwritten Text Generation	546
Revisiting Transformation Invariant Geometric Deep Learning: An Initial Representation Perspective	534
Learning to Guide a Saturation-Based Theorem Prover	523
Towards Accurate and Compact Architectures via Neural Architecture Transformer	518
Sparse-to-Dense Matching Network for Large-Scale LiDAR Point Cloud Registration	496
Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification	490
A Hybrid Stochastic-Deterministic Minibatch Proximal Gradient Method for Efficient Optimization and Generalization	488
Enhancing Representations Through Heterogeneous Self-Supervised Learning	483
Quadratic Matrix Factorization With Applications to Manifold Learning	475
Learning With Style: Continual Semantic Segmentation Across Tasks and Domains	474
Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images	471

Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification	456
Implicit Annealing in Kernel Spaces: A Strongly Consistent Clustering Approach	455
SNI-SLAM++: Tightly-Coupled Semantic Neural Implicit SLAM	455
Video Demoireing Using Focused-Defocused Dual-Camera System	445
Instance Shadow Detection with A Single-Stage Detector	421
Rethinking Rotation-Invariant Recognition of Fine-grained Shapes from the Perspective of Contour Points	415
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation	413
Learn to Predict Sets Using Feed-Forward Neural Networks	394
DVIS++: Improved Decoupled Framework for Universal Video Segmentation	393
Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition	376
Test-time Correction: An Online 3D Detection System via Visual Prompting	371
Optimization-Based Post-Training Quantization With Bit-Split and Stitching	370
Invariant Policy Learning: A Causal Perspective	368
Face Forgery Detection by 3D Decomposition and Composition Search	360
Locating and Counting Heads in Crowds With a Depth Prior	359
Physics-Informed Guided Disentanglement in Generative Networks	353
A Generative Model for Generic Light Field Reconstruction	353
Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting	342
Towards Expressive Spectral-Temporal Graph Neural Networks for Time Series Forecasting	341
A Clustering Validity Index With Multi-Granularity Fusion for Multiple Fuzzy Clustering Algorithms	338
OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation	334
Symbolic Visual Reinforcement Learning: A Scalable Framework With Object-Level Abstraction and Differentiable Expression Search	332
Detection-Friendly Dehazing: Object Detection in Real-World Hazy Scenes	330
Point-to-Pixel Prompting for Point Cloud Analysis With Pre-Trained Image Models	319
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting	319
Graph Convolutional Module for Temporal Action Localization in Videos	315
Towards Unified Deep Image Deraining: A Survey and a New Benchmark	300
Multiview Clustering: A Scalable and Parameter-Free Bipartite Graph Fusion Method	298
Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning	283
Prior Image Guided Snapshot Compressive Spectral Imaging	270
Ensemble-Enhanced Semi-Supervised Learning With Optimized Graph Construction for High-Dimensional Data	269
Digging Into Uncertainty-Based Pseudo-Label for Robust Stereo Matching	268
Learning Graph Convolutional Networks for Multi-Label Recognition and Applications	265
Interactive NeRF Geometry Editing With Shape Priors	264
Structure-Preserving Image Super-Resolution	263
Motion-Aware Dynamic Graph Neural Network for Video Compressive Sensing	263
Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution	262
Modeling Noisy Annotations for Point-Wise Supervision	254
Probing Synergistic High-Order Interaction for Multi-Modal Image Fusion	253
Inferring Point Cloud Quality via Graph Similarity	251
Interaction-Based Inductive Bias in Graph Neural Networks: Enhancing Protein-Ligand Binding Affinity Predictions From 3D Structures	247
Simplicial Complex Neural Networks	247
Are Graph Convolutional Networks With Random Weights Feasible?	246
Centerless Clustering	240
Unsupervised Domain Adaptation via Discriminative Manifold Propagation	238
Multi-Dataset, Multitask Learning of Egocentric Vision Tasks	236
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference	235
Learning Signed Hyper Surfaces for Oriented Point Cloud Normal Estimation	234
Face Generation and Editing With StyleGAN: A Survey	231
Asymmetric Convolution: An Efficient and Generalized Method to Fuse Feature Maps in Multiple Vision Tasks	230

Deep Long-Tailed Learning: A Survey	229
Affective Image Content Analysis: Two Decades Review and New Perspectives	228
Transformer-Based Visual Segmentation: A Survey	225
Guaranteed Tensor Recovery Fused Low-rankness and Smoothness	221
A Comprehensive and Modularized Statistical Framework for Gradient Norm Equality in Deep Neural Networks	220
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models	220
A Survey on Deep Learning Techniques for Stereo-Based Depth Estimation	218
IEEE Computer Society Has You Covered!	215
Fast Component Tree Computation for Images of Limited Levels	212
Cover 2	212
BNET: Batch Normalization With Enhanced Linear Transformation	209
Universal Image Segmentation With Efficiency	208
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration	206
Temporal Feature Matters: A Framework for Diffusion Model Quantization	202
Spatial-Temporal Transformer for Video Snapshot Compressive Imaging	201
Learning to See Through With Events	198
Differential Viewpoints for Ground Terrain Material Recognition	197
Revisiting Transferable Adversarial Images: Systemization, Evaluation, and New Insights	194
Adaptive Transfer Kernel Learning for Transfer Gaussian Process Regression	191
Orientational Distribution Learning with Hierarchical Spatial Attention for Open Set Recognition	188
Graph-Oriented Instruction Tuning of Large Language Models for Generic Graph Mining	186
M3D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-Level Information Extraction	183
Flare7K++: Mixing Synthetic and Real Datasets for Nighttime Flare Removal and Beyond	182
RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks	181
Random Permutation Set Reasoning	180
Meta Invariance Defense Towards Generalizable Robustness to Unknown Adversarial Attacks	180
Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification	177
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation	177
Towards Pointsets Representation Learning via Self-Supervised Learning and Set Augmentation	175
Image-to-Image Translation With Disentangled Latent Vectors for Face Editing	173
Influence Function Based Second-Order Channel Pruning: Evaluating True Loss Changes for Pruning is Possible Without Retraining	172
Curriculum-Based Asymmetric Multi-Task Reinforcement Learning	171
Deep Learning for Face Anti-Spoofing: A Survey	170
Reduced-Rank Tensor-on-Tensor Regression and Tensor-Variate Analysis of Variance	169
Human-Centric Transformer for Domain Adaptive Action Recognition	164
Inter-Intra Hypergraph Computation for Survival Prediction on Whole Slide Images	162
AutoNovel: Automatically Discovering and Learning Novel Visual Categories	162
Out-of-Domain Generalization From a Single Source: An Uncertainty Quantification Approach	160
MHF-Net: An Interpretable Deep Network for Multispectral and Hyperspectral Image Fusion	159
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition	158
On the Robustness of Average Losses for Partial-Label Learning	157
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning	152
MoIL: Momentum Imitation Learning for Efficient Vision-Language Adaptation	152
On Positive-Unlabeled Classification From Corrupted Data in GANs	151
GradMDM: Adversarial Attack on Dynamic Networks	150
Human Interaction Understanding With Consistency-Aware Learning	150
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis With Semantic Graph Prior	149
SPARE: Symmetrized Point-to-Plane Distance for Robust Non-Rigid 3D Registration	149
Learning Graph Attentions via Replicator Dynamics	148
Learning Efficient Meshflow and Optical Flow from Event Cameras	148
Image Lens Flare Removal Using Adversarial Curve Learning	147
To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion	146
Bridging Actions: Generate 3D Poses and Shapes In-Between Photos	146
Matrix Completion via Non-Convex Relaxation and Adaptive Correlation Learning	142
Deep Learning-Based Point Cloud Compression: An In-Depth Survey and Benchmark	141
SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face Intrinsics	139
Dynamic Self-Supervised Teacher-Student Network Learning	139
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets	139
GenPoly: Learning Generalized and Tessellated Shape Priors via 3D Polymorphic Evolving	139
Physics-Informed Matrix Factorization Operator	133
Reconstruction Guided Meta-Learning for Few Shot Open Set Recognition	132
A Variational EM Acceleration for Efficient Clustering at Very Large Scales	132
Unbiased Scene Graph Generation via Two-Stage Causal Modeling	131
SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation	131
Compositional Scene Representation Learning via Reconstruction: A Survey	130
LMP-GAN: Out-of-Distribution Detection for Non-Control Data Malware Attacks	130
Rate-Distortion Theory in Coding for Machines and Its Applications	130
Hypergraph-Based Multi-View Action Recognition Using Event Cameras	129
Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey	129
Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification	129
A Unified Experience Replay Framework for Spiking Deep Reinforcement Learning	128
Differentially Private Graph Neural Networks for Whole-Graph Classification	128
Correcting Optical Aberration via Depth-Aware Point Spread Functions	127
VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision	127
MESA: Effective Matching Redundancy Reduction by Semantic Area Segmentation	127
MRA-Net: Improving VQA Via Multi-Modal Relation Attention Network	127
Weakly Supervised Tracklet Association Learning With Video Labels for Person Re-Identification	126
A Unified Decision Rule for Generalized Out-of-Distribution Detection	126
Self-Scalable Tanh (Stan): Multi-Scale Solutions for Physics-Informed Neural Networks	126
Discriminant Feature Extraction by Generalized Difference Subspace	125

GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector	124
Controllable Generation with Text-to-Image Diffusion Models: a Survey	124
Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses	124
Deep Gait Recognition: A Survey	123
Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving	121
Continuous Review and Timely Correction: Enhancing the Resistance to Noisy Labels Via Self-Not-True and Class-Wise Distillation	121
Variational Data-Free Knowledge Distillation for Continual Learning	120
Revisiting Nonlocal Self-Similarity from Continuous Representation	120
A Fully Automated Method for 3D Individual Tooth Identification and Segmentation in Dental CBCT	119
P2T: Pyramid Pooling Transformer for Scene Understanding	119
Enhancing Photorealism Enhancement	119
Domain Generalization: A Survey	118
Knowledge-Based Embodied Question Answering	118
Self-Supervised Multimodal Learning: A Survey	118
Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap	118
A New Brain Network Construction Paradigm for Brain Disorder via Diffusion-Based Graph Contrastive Learning	116
Robust Multimodal Learning With Missing Modalities via Parameter-Efficient Adaptation	115
PathNet: Path-Selective Point Cloud Denoising	114
Asymmetric Loss Functions for Noise-Tolerant Learning: Theory and Applications	114
From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing	114
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking	114
Accurate and Efficient Stereo Matching via Attention Concatenation Volume	113
Interpretable Optimization-Inspired Unfolding Network for Low-Light Image Enhancement	113
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning	113
Advances and Challenges in Meta-Learning: A Technical Review	112
Semi-Supervised Learning for FGVC With Out-of-Category Data	111
Any Fashion Attribute Editing: Dataset and Pretrained Models	111
ComputingEdge ad	111
WildVideo: Benchmarking LMMs for Understanding Video-Language Interaction	111
JointFormer: A Unified Framework With Joint Modeling for Video Object Segmentation	111
Low-Shot Video Object Segmentation	111
MoBluRF: Motion Deblurring Neural Radiance Fields for Blurry Monocular Video	110
Test-Time Training for Hyperspectral Image Super-Resolution	110
Cover 3	110
On the Universal Approximation Properties of Deep Neural Networks Using MAM Neurons	110
Unified Modality Separation: A Vision-Language Framework for Unsupervised Domain Adaptation	109
Towards Reliable and Faithful Explanations: A Disentanglement-Augmented Approach for Selective Rationalization	108
Probabilistic Directed Distance Fields for Ray-Based Shape Representations	108
Compositional Physical Reasoning of Objects and Events From Videos	108
Reinforcing Generated Images via Meta-Learning for One-Shot Fine-Grained Visual Recognition	106
S$^{2}$ 2O: Enhancing Adversarial Training With Second-Order Statistics of Weights	106
Reframing Neural Networks: Deep Structure in Overcomplete Representations	106
3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?	106
AutoEval: Are Labels Always Necessary for Classifier Accuracy Evaluation?	106
SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-World Object Detector	106
Noisy Label Learning With Provable Consistency for a Wider Family of Losses	105
Supervision by Denoising	105
Scale Propagation Network for Generalizable Depth Completion	104
An Energy-Based Prior for Generative Saliency	104
The Cluster Structure Function	104
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding	104
Editorial: Special Section on Egocentric Perception	103
TN-ZSTAD: Transferable Network for Zero-Shot Temporal Activity Detection	103
Unified Adversarial Patch for Visible-Infrared Cross-Modal Attacks in the Physical World	103
ONNXPruner: ONNX-Based General Model Pruning Adapter	103
Continual Unsupervised Generative Modeling	103
Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses	103
Adaptive Perspective Distillation for Semantic Segmentation	102
Analysis of the Hands in Egocentric Vision: A Survey	100
Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI	99
Dynamic Differential Image Circle Diameter Measurement Precision Assessment: Application to Burning Droplets	99
Cascaded Dynamic Memory Refinement and Semantic Alignment for Exo-to-Ego Cross-View Video Generation	99
CC4S: Encouraging Certainty and Consistency in Scribble-Supervised Semantic Segmentation	99
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification	99
luvHarris: A Practical Corner Detector for Event-Cameras	98
FreeFusion: Infrared and Visible Image Fusion via Cross Reconstruction Learning	98
Temporal Stereo Matching From Event Cameras Via Joint Learning With Stereoscopic Flow	98
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation	98
Orthogonal Decoupling Contrastive Regularization: Towards Uncorrelated Feature Decoupling for Unpaired Image Restoration	97
Relationship Quantification of Image Degradations	97
Joint Framework for Single Image Reconstruction and Super-Resolution With an Event Camera	97
Stimulative Training++: Go Beyond the Performance Limits of Residual Networks	97
Adversarially Robust Neural Architectures	96
Reusable Architecture Growth for Continual Stereo Matching	96
PMGT-VR: A Decentralized Proximal-Gradient Algorithmic Framework With Variance Reduction	95
Hypergraph-Based High-Order Correlation Analysis for Large-Scale Long-Tailed Data Classification	95
GLC++: Source-Free Universal Domain Adaptation Through Global-Local Clustering and Contrastive Affinity Learning	95
Compositional Generative Model of Unbounded 4D Cities	95
SS-NeRF: Physically Based Sparse Spectral Rendering With Neural Radiance Field	93
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference	93
Self-Guidance: Boosting Flow and Diffusion Generation on Their Own	93
RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging	92
Deep Learning on Object-Centric 3D Neural Fields	92
$\mathcal {X}$-Metric: An N-Dimensional Information-Theoretic Framework for Groupwise Registration and Deep Combined Computing	91
Homeomorphism Prior for False Positive and Negative Problem in Medical Image Dense Contrastive Representation Learning	91
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning	91
CycMuNet+: Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution	91
GhostingNet: A Novel Approach for Glass Surface Detection With Ghosting Cues	91
Learning With Constraint Learning: New Perspective, Solution Strategy and Various Applications	90
Generalized Task-Driven Medical Image Quality Enhancement With Gradient Promotion	90
Multi-Modality Multi-Attribute Contrastive Pre-Training for Image Aesthetics Computing	90
A Thorough Benchmark and a New Model for Light Field Saliency Detection	90
Human as Points: Explicit Point-Based 3D Human Reconstruction From Single-View RGB Images	90
Revealing the Dark Side of Non-Local Attention in Single Image Super-Resolution	89
STAR-FC: Structure-Aware Face Clustering on Ultra-Large-Scale Graphs	89