Computer Vision and Image Understanding

Papers
(The H4-Index of Computer Vision and Image Understanding is 28. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Luminance prior guided Low-Light 4C catenary image enhancement307
Editorial Board248
Efficient cross-information fusion decoder for semantic segmentation116
3D semantic segmentation based on spatial-aware convolution and shape completion for augmented reality applications113
Editorial Board100
Robust Teacher: Self-correcting pseudo-label-guided semi-supervised learning for object detection97
Siamese self-supervised learning for fine-grained visual classification88
Emerging image generation with flexible control of perceived difficulty85
Improving the planarity and sharpness of monocularly estimated depth images using the Phong reflection model73
Editorial Board52
Exploring using jigsaw puzzles for out-of-distribution detection51
Extending function mixture network for improved spectral super-resolution47
MATTE: Multi-task multi-scale attention41
Deducing health cues from biometric data40
Editorial Board38
Editorial Board37
Modality adaptation via feature difference learning for depth human parsing35
Twin-SegNet: Dynamically coupled complementary segmentation networks for generalized medical image segmentation35
Exploring the differences in adversarial robustness between ViT- and CNN-based models using novel metrics34
Feature reconstruction and metric based network for few-shot object detection34
RetSeg3D: Retention-based 3D semantic segmentation for autonomous driving34
Lightweight feature point detection network with channel enhancement33
Convolutional neural network framework for deepfake detection: A diffusion-based approach33
CRML-Net: Cross-Modal Reasoning and Multi-Task Learning Network for tooth image segmentation31
REST: A resolution preserving network for photorealistic style transfer via semantic distillation31
Implicit and explicit commonsense for multi-sentence video captioning30
RelFormer: Advancing contextual relations for transformer-based dense captioning30
Reverse Stable Diffusion: What prompt was used to generate this image?29
Iterative Caption Generation with Heuristic Guidance for enhancing knowledge-based visual question answering28
GaitBranch: A multi-branch refinement model combined with frame-channel attention mechanism for gait recognition28
PConvSRGAN: Real-world super-resolution reconstruction with pure convolutional networks28
Robust detection of dehazed images via dual-stream CNNs with adaptive feature fusion28
0.089967012405396