Computer Vision and Image Understanding

Papers
(The H4-Index of Computer Vision and Image Understanding is 25. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Semantic-preserved point-based human avatar243
Rebalanced supervised contrastive learning with prototypes for long-tailed visual recognition216
Fake News Detection Based on BERT Multi-domain and Multi-modal Fusion Network208
Incremental few-shot instance segmentation via feature enhancement and prototype calibration191
A multi-modal explainability approach for human-aware robots in multi-party conversation128
Luminance prior guided Low-Light 4C catenary image enhancement96
Robust Teacher: Self-correcting pseudo-label-guided semi-supervised learning for object detection83
Target-aware and spatial-spectral discriminant feature joint correlation filters for hyperspectral video object tracking69
Seam estimation based on dense matching for parallax-tolerant image stitching67
Dual stage semantic information based generative adversarial network for image super-resolution66
Multi-Scale Adaptive Skeleton Transformer for action recognition61
Exploring using jigsaw puzzles for out-of-distribution detection59
PMGNet: Disentanglement and entanglement benefit mutually for compositional zero-shot learning55
Spatial attention for human-centric visual understanding: An Information Bottleneck method47
MDC-Net: Multi-domain constrained kernel estimation network for blind image super resolution46
Deep video compression based on Long-range Temporal Context Learning43
Static graph convolution with learned temporal and channel-wise graph topology generation for skeleton-based action recognition41
RFCNet: Enhancing urban segmentation using regularization, fusion, and completion32
3D scene generation for zero-shot learning using ChatGPT guided language prompts32
An egocentric video and eye-tracking dataset for visual search in convenience stores30
RetSeg3D: Retention-based 3D semantic segmentation for autonomous driving29
Local optimization cropping and boundary enhancement for end-to-end weakly-supervised segmentation network28
UATST: Towards unpaired arbitrary text-guided style transfer with cross-space modulation28
Trimap-guided feature mining and fusion network for natural image matting26
Adaptive gradients and weight projection based on quantized neural networks for efficient image classification26
0.15159296989441