International Journal of Computer Vision

Papers
(The H4-Index of International Journal of Computer Vision is 57. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression2060
Instance-Aware Scene Layout Forecasting646
Exploring the Semi-Supervised Video Object Segmentation Problem from a Cyclic Perspective450
Guest Editorial: Special Issue on Open-World Visual Recognition355
Learning Extensible Series-Parallel Lookup Tables for Efficient Image Super-Resolution291
View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements286
RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion274
AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach259
Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks232
Correction: Multi-source-free Domain Adaptive Object Detection188
Guest Editorial: Special Issue on Large-Scale Generative Models for Content Creation and Manipulation181
Learning Discriminative Features for Visual Tracking via Scenario Decoupling170
Are Vision Transformers Robust to Spurious Correlations?159
Image Synthesis Under Limited Data: A Survey and Taxonomy158
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels146
MoDA: Modeling Deformable 3D Objects from Casual Videos141
Common Pole–Polar Properties of Central Catadioptric Sphere and Line Images Used for Camera Calibration140
GenKL: An Iterative Framework for Resolving Label Ambiguity and Label Non-conformity in Web Images Via a New Generalized KL Divergence140
Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement134
From Open Set to Closed Set: Supervised Spatial Divide-and-Conquer for Object Counting128
A Minimal Solution for Image-Based Sphere Estimation128
OpenMonkeyChallenge: Dataset and Benchmark Challenges for Pose Estimation of Non-human Primates112
Instance-dependent Label Distribution Estimation for Learning with Label Noise111
Learning with Enriched Inductive Biases for Vision-Language Models109
Conditional Temporal Variational AutoEncoder for Action Video Prediction109
Delving Deeper into Anti-Aliasing in ConvNets103
EAN: Event Adaptive Network for Enhanced Action Recognition102
FastComposer: Tuning-Free Multi-subject Image Generation with Localized Attention100
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision99
Learning Text-to-Video Retrieval from Image Captioning99
Image-based Morphological Characterization of Filamentous Biological Structures with Non-constant Curvature Shape Feature98
Deep Image Deblurring: A Survey97
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition92
Learning Accurate Low-bit Quantization towards Efficient Computational Imaging88
Guest Editorial: Special Issue on the British Machine Vision Conference 202286
Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks85
Vision-Language Alignment Learning Under Affinity and Divergence Principles for Few-Shot Out-of-Distribution Generalization84
Feature Hallucination for Self-supervised Action Recognition83
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild81
H-SegMed: A Hybrid Method for Prostate Segmentation in TRUS Images via Improved Closed Principal Curve and Improved Enhanced Machine Learning81
Semantic-Based Implicit Feature Transform for Few-Shot Classification76
Cascaded Iterative Transformer for Jointly Predicting Facial Landmark, Occlusion Probability and Head Pose74
A Realism Metric for Generated LiDAR Point Clouds74
Free-view Face Relighting Using a Hybrid Parametric Neural Model on a SMALL-OLAT Dataset72
SRConvNet: A Transformer-Style ConvNet for Lightweight Image Super-Resolution71
UIL-AQA: Uncertainty-Aware Clip-Level Interpretable Action Quality Assessment70
Learning to Generalize Heterogeneous Representation for Cross-Modality Image Synthesis via Multiple Domain Interventions69
Correction: SOTVerse: A User-Defined Task Space of Single Object Tracking68
NAFT and SynthStab: A RAFT-Based Network and a Synthetic Dataset for Digital Video Stabilization67
UMSCS: A Novel Unpaired Multimodal Image Segmentation Method Via Cross-Modality Generative and Semi-supervised Learning67
Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data65
ICEv2: Interpretability, Comprehensiveness, and Explainability in Vision Transformer64
Correction: Consistent Prompt Tuning for Generalized Category Discovery63
UniCanvas: Affordance-Aware Unified Real Image Editing via Customized Text-to-Image Generation62
Guest Editorial: Special Issue on the Promises and Dangers of Large Vision Models60
Exploiting Inter-Sample Affinity for Knowability-Aware Universal Domain Adaptation59
Bi-calibration Networks for Weakly-Supervised Video Representation Learning57
0.46371912956238