International Journal of Computer Vision

Papers
(The H4-Index of International Journal of Computer Vision is 52. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression1612
Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks1175
Guest Editorial: Special Issue on Open-World Visual Recognition1171
Instance-Aware Scene Layout Forecasting489
Exploring the Semi-Supervised Video Object Segmentation Problem from a Cyclic Perspective400
Physical Representation Learning and Parameter Identification from Video Using Differentiable Physics293
GenKL: An Iterative Framework for Resolving Label Ambiguity and Label Non-conformity in Web Images Via a New Generalized KL Divergence249
Common Pole–Polar Properties of Central Catadioptric Sphere and Line Images Used for Camera Calibration236
Correction: Multi-source-free Domain Adaptive Object Detection210
Learning Text-to-Video Retrieval from Image Captioning208
Learning Discriminative Features for Visual Tracking via Scenario Decoupling187
MoDA: Modeling Deformable 3D Objects from Casual Videos161
Conditional Temporal Variational AutoEncoder for Action Video Prediction160
From Open Set to Closed Set: Supervised Spatial Divide-and-Conquer for Object Counting148
Guest Editorial: Special Issue on Large-Scale Generative Models for Content Creation and Manipulation142
Bootstrapping Vision-Language Models for Frequency-Centric Self-Supervised Remote Physiological Measurement140
Are Vision Transformers Robust to Spurious Correlations?137
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision129
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition125
Learning with Enriched Inductive Biases for Vision-Language Models119
RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion117
View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements113
AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach111
Learning Extensible Series-Parallel Lookup Tables for Efficient Image Super-Resolution105
Image Synthesis Under Limited Data: A Survey and Taxonomy103
FastComposer: Tuning-Free Multi-subject Image Generation with Localized Attention102
Delving Deeper into Anti-Aliasing in ConvNets98
EAN: Event Adaptive Network for Enhanced Action Recognition95
OpenMonkeyChallenge: Dataset and Benchmark Challenges for Pose Estimation of Non-human Primates94
Deep Image Deblurring: A Survey92
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels92
A Minimal Solution for Image-Based Sphere Estimation85
Instance-dependent Label Distribution Estimation for Learning with Label Noise85
Correction: SOTVerse: A User-Defined Task Space of Single Object Tracking84
NAFT and SynthStab: A RAFT-Based Network and a Synthetic Dataset for Digital Video Stabilization81
VideoQA in the Era of LLMs: An Empirical Study72
Learning to Generalize Heterogeneous Representation for Cross-Modality Image Synthesis via Multiple Domain Interventions70
UMSCS: A Novel Unpaired Multimodal Image Segmentation Method Via Cross-Modality Generative and Semi-supervised Learning68
Relating View Directions of Complementary-View Mobile Cameras via the Human Shadow67
Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data65
ICEv2: Interpretability, Comprehensiveness, and Explainability in Vision Transformer63
Bi-calibration Networks for Weakly-Supervised Video Representation Learning61
Semantic-Based Implicit Feature Transform for Few-Shot Classification61
Feature Hallucination for Self-supervised Action Recognition61
Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow60
Lightweight and Progressively-Scalable Networks for Semantic Segmentation60
Diagram Perception Networks for Textbook Question Answering via Joint Optimization59
Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation56
A Realism Metric for Generated LiDAR Point Clouds54
Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks54
Free-view Face Relighting Using a Hybrid Parametric Neural Model on a SMALL-OLAT Dataset53
Exploiting Inter-Sample Affinity for Knowability-Aware Universal Domain Adaptation53
Guest Editorial: Special Issue on the British Machine Vision Conference 202252
0.1145920753479