Image and Vision Computing

Papers
(The median citation count of Image and Vision Computing is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Vehicle re-identification with large separable kernel attention and hybrid channel attention129
ESDA: Zero-shot semantic segmentation based on an embedding semantic space distribution adjustment strategy117
A polar-edge context-aware (PECA) network for mirror segmentation111
Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification109
SalFBNet: Learning pseudo-saliency distribution via feedback convolutional networks103
Single image dehazing using extended local dark channel prior88
BF3D: Bi-directional fusion 3D detector with semantic sampling and geometric mapping65
Effective hybrid attention network based on pseudo-color enhancement in ultrasound image segmentation61
Underwater image enhancement based on global features and prior distribution guided51
Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications51
MetaPix: Domain transfer for semantic segmentation by meta pixel weighting51
Editorial Board47
Fuzzy set-based Bernoulli Random Noise Weighted Loss for unsupervised person re-identification46
ATOM: Self-supervised human action recognition using atomic motion representation learning41
Authenticating and securing healthcare records: A deep learning-based zero watermarking approach40
Localization of diffusion model-based inpainting through the inter-intra similarity of frequency features39
Attribute discrimination combined with selected sample dropout for unsupervised domain adaptive person re-identification38
Privacy-preserving explainable AI enable federated learning-based denoising fingerprint recognition model37
CLBSR: A deep curriculum learning-based blind image super resolution network using geometrical prior35
Grad-CAM based explanations for multiocular disease detection using Xception net34
3D-ISRNet:Generating 3D point clouds through image similarity retrieval in a complex background from a single image33
Detecting adversarial samples by noise injection and denoising30
Machine learning based video segmentation of moving scene by motion index using IO detector and shot segmentation29
Modeling content-attribute preference for personalized image esthetics assessment29
Object aspect classification and 6DoF pose estimation29
: Robust real-time shape-from-template, a C ++ library28
View knowledge transfer network for multi-view action recognition28
Exploring global context and position-aware representation for group activity recognition27
Synthetic lidar point cloud generation using deep generative models for improved driving scene object recognition26
OCUCFormer: An Over-Complete Under-Complete Transformer Network for accelerated MRI reconstruction26
Editorial Board25
Hourglass cascaded recurrent stereo matching network25
Editorial Board25
Proactive hybrid learning framework for real-time multi-vehicle detection in unregulated traffic environments25
Editorial Board24
MFC-Net : Multi-feature fusion cross neural network for salient object detection24
Cross-scale global attention feature pyramid network for person search24
ECT: Fine-grained edge detection with learned cause tokens24
Editorial Board24
Editorial Board23
LocalFace: Learning significant local features for deep face recognition23
Faster and finer pose estimation for multiple instance objects in a single RGB image23
Learning language to symbol and language to vision mapping for visual grounding23
3D human body modeling with orthogonal human mask image based on multi-channel Swin transformer architecture22
IAC-ReCAM: Two-dimensional attention modulation and category label guidance for weakly supervised semantic segmentation22
Editorial Board22
Spatial–temporal sequential network for anomaly detection based on long short-term magnitude representation21
ABC: Aligning binary centers for single-stage monocular 3D object detection21
CFENet: Context-aware Feature Enhancement Network for efficient few-shot object counting21
μPEWFace: Parallel ensemble of weighted deep convolutional neural networks with novel loss functions for face-based authentication21
Feature extraction and fusion algorithm for infrared visible light images based on residual and generative adversarial network20
Unmasking deepfakes: Eye blink pattern analysis using a hybrid LSTM and MLP-CNN model20
Artificial immune systems for data augmentation20
Language and vision based person re-identification for surveillance systems using deep learning with LIP layers19
AI-powered trustable and explainable fall detection system using transfer learning19
CFFNet: Coordinated feature fusion network for crowd counting19
A study on attention-based LSTM for abnormal behavior recognition with variable pooling19
GAN-BodyPose: Real-time 3D human body pose data key point detection and quality assessment assisted by generative adversarial network19
RGB-T tracking by modality difference reduction and feature re-selection19
Batch feature standardization network with triplet loss for weakly-supervised video anomaly detection19
Boosting semi-supervised face recognition with raw faces18
Image-based human re-identification: Which covariates are actually (the most) important?18
SCTrans: Self-align and cross-align transformer for few-shot segmentation18
Real-time 3D human pose estimation without skeletal a priori structures17
Feature decoupling and interaction network for defending against adversarial examples17
A data augmentation approach that ensures the reliability of foregrounds in medical image segmentation17
PTPFusion: A progressive infrared and visible image fusion network based on texture preserving17
Does explainable machine learning uncover the black box in vision applications?17
Alignment and fusion for adaptive domain nighttime semantic segmentation17
Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey16
Multi-layer capsule network with joint dynamic routing for fire recognition16
Video object segmentation by multi-scale attention using bidirectional strategy16
Parameter efficient finetuning of text-to-image models with trainable self-attention layer16
UTR: A UNet-like transformer for efficient unsupervised medical image registration16
Multi-label recognition in open driving scenarios based on bipartite-driven superimposed dynamic graph16
Background debiased class incremental learning for video action recognition16
RGB road scene material segmentation16
Continual coarse-to-fine domain adaptation in semantic segmentation16
Weakly supervised moment localization with natural language based on semantic reconstruction15
Hyperspherically regularized networks for self-supervision15
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty15
G-TRACE: Grouped temporal recalibration for video object segmentation15
Image captioning: Semantic selection unit with stacked residual attention15
Few-shot classification with multisemantic information fusion network15
DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions15
Adaptive weight based on overlapping blocks network for facial expression recognition15
Learning an augmentation strategy for sparse datasets15
Improving eye movement biometrics in low frame rate eye-tracking devices using periocular and eye blinking features15
Person search over security video surveillance systems using deep learning methods: A review15
Generous teacher: Good at distilling knowledge for student learning15
Multiscale features integration based multiple-in-single-out network for object detection15
Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention14
CRENet: Crowd region enhancement network for multi-person 3D pose estimation14
SAVE: Encoding spatial interactions for vision transformers14
Distance metric-based learning for long-tail object detection14
Integrating prior knowledge into a bibranch pyramid network for medical image segmentation14
Exploring holistic discriminative representation for micro-expression recognition via contrastive learning14
FSformer: Fast-Slow Transformer for video action recognition14
Robust ensemble person reidentification via orthogonal fusion with occlusion handling14
Exploiting recollection effects for memory-based video object segmentation14
STRFormer: Spatial–Temporal–ReTemporal Transformer for 3D human pose estimation13
Efficient masked feature and group attention network for stereo image super-resolution13
A robust image representation method against illumination and occlusion variations13
HPD-Depth: High performance decoding network for self-supervised monocular depth estimation13
Learning weakly supervised audio-visual violence detection in hyperbolic space13
Knowledge graph construction in hyperbolic space for automatic image annotation13
EDCAANet: A lightweight COD network based on edge detection and coordinate attention assistance13
Exploring cross-video matching for few-shot video classification via dual-hierarchy graph neural network learning13
A dedicated benchmark for contour-based corner detection evaluation12
DiPS: Discriminative pseudo-label sampling with self-supervised transformers for weakly supervised object localization12
Integration of ultrasound and mammogram for multimodal classification of breast cancer using hybrid residual neural network and machine learning12
WITHDRAWN: Lips-SpecFormer: Non-linear interpolable transformer for spectral reconstruction using adjacent channel coupling12
Innovative underwater image enhancement algorithm: Combined application of adaptive white balance color compensation and pyramid image fusion to submarine algal microscopy12
Improving distinctiveness in video captioning with text-video similarity12
Semantic segmentation of large-scale point clouds by integrating attention mechanisms and transformer models12
Unified Volumetric Avatar: Enabling flexible editing and rendering of neural human representations12
TransWild: Enhancing 3D interacting hands recovery in the wild with IoU-guided Transformer12
Dense small target detection algorithm for UAV aerial imagery12
Learning diverse and deep clues for person reidentification12
Transformer-based feature interactor for person re-identification with margin self-punishment loss12
Contrastive learning based facial action unit detection in children with hearing impairment for a socially assistive robot platform12
A lightweight depth completion network with spatial efficient fusion12
Multi-information guided camouflaged object detection12
Acute lymphocytic leukemia detection and subtype classification via extended wavelet pooling based-CNNs and statistical-texture features12
FEANet: Foreground-edge-aware network with DenseASPOC for human parsing11
Pyramid quaternion discrete cosine transform based ConvNet for cancelable face recognition11
Visual tracking based on spatiotemporal transformer and fusion sequences11
Edge-aware salient object detection network via context guidance11
Accurate and efficient salient object detection via position prior attention11
Attention guided contextual feature fusion network for salient object detection11
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling11
A lightweight network for monocular depth estimation with decoupled body and edge supervision11
RAMT-GAN: Realistic and accurate makeup transfer with generative adversarial network11
Gait recognition via View-aware Part-wise Attention and Multi-scale Dilated Temporal Extractor11
Unsupervised person re-identification by dynamic hybrid contrastive learning10
A 3D multi-scale CycleGAN framework for generating synthetic PETs from MRIs for Alzheimer's disease diagnosis10
Knowledge distillation methods for efficient unsupervised adaptation across multiple domains10
An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers10
Depth awakens: A depth-perceptual attention fusion network for RGB-D camouflaged object detection10
Multi parallel U-net encoder network for effective polyp image segmentation10
A new deepfake detection model for responding to perception attacks in embodied artificial intelligence10
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public10
A dual-channel network based on occlusion feature compensation for human pose estimation10
ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation10
Multi-branch residual image semantic segmentation combined with inverse weight gated-control10
A few-shot learning-based ischemic stroke segmentation system using weighted MRI fusion10
An instance-level data balancing method for object detection via contextual information alignment10
LSTM with bio inspired algorithm for action recognition in sports videos10
LDWS-net: A learnable deep wavelet scattering network for RGB salient object detection10
Fine-grained bidirectional attentional generation and knowledge-assisted networks for cross-modal retrieval10
Single stage architecture for improved accuracy real-time object detection on mobile devices10
Visible thermal person re-identification via multi-branch modality residual complementary learning10
Enhancing few-shot object detection through pseudo-label mining9
Editorial Board9
3D human avatar reconstruction with neural fields: A recent survey9
Improving defocus blur detection via adaptive supervision prior-tokens9
Bidirectional Attentional Interaction Networks for RGB-D salient object detection9
Modality interactive attention for cross-modality person re-identification9
Editorial Board9
A motion model based on recurrent neural networks for visual object tracking9
Lightweight and efficient feature fusion real-time semantic segmentation network9
Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction9
Editorial Board9
Dual-scale point cloud completion network based on high-frequency feature fusion9
C2F: An effective coarse-to-fine network for video summarization9
AwareTrack: Object awareness for visual tracking via templates interaction9
Editorial Board9
Accurate video saliency prediction via hierarchical fusion and temporal recurrence9
Mobile-friendly and multi-feature aggregation via transformer for human pose estimation9
Pose-guided counterfactual inference for occluded person re-identification9
Self-supervised Vision Transformers for 3D pose estimation of novel objects9
Universal domain adaptation from multiple black-box sources9
A Point-2s reinforcement learning biomimetic model for estimating and analyzing human 3D motion posture9
Visual question answering model based on graph neural network and contextual attention9
Editorial Board9
1D kernel distillation network for efficient image super-resolution9
Moment preserving tomographic image reconstruction model8
Flow guided mutual attention for person re-identification8
DeepSegment: Segmentation of motion capture data using deep convolutional neural network8
Detection of tuberculosis using customized MobileNet and transfer learning from chest X-ray image8
Detection of dental periapical lesions using retinex based image enhancement and lightweight deep learning model8
Giving loss a personal course: Universal loss reweighting to improve stereo matching via uncertainty guidance8
Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing8
Boosting certified robustness via an expectation-based similarity regularization8
Pedestrian detection in low-light conditions: A comprehensive survey8
Boundary guidance network for camouflage object detection8
Lightweight boundary refinement module based on point supervision for semantic segmentation8
Depth assisted novel view synthesis using few images8
Environmentally adaptive fast object detection in UAV images8
Exploring the synergy between textual identity and visual signals in human-object interaction8
Deep learning with adaptive convolutions for classification of retinal diseases via optical coherence tomography8
A robust direct linear transformation for camera pose estimation using points8
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance8
Federated learning based nonlinear two-stage framework for full-reference image quality assessment: An application for biometric8
You look so different! Haven’t I seen you a long time ago?7
GPLM: Enhancing underwater images with Global Pyramid Linear Modulation7
Video object segmentation based on dynamic perception update and feature fusion7
VAE-GAN3D: Leveraging image-based semantics for 3D zero-shot recognition7
Person re-identification: A taxonomic survey and the path ahead7
Automatic deep sparse clustering with a dynamic population-based evolutionary algorithm using reinforcement learning and transfer learning7
Multi-scale interaction transformer for temporal action proposal generation7
Novel approach for fast structured light framework using deep learning7
AI-based intelligent hybrid framework (BO-DenseXGB) for multi- classification of brain tumor using MRI7
Nighttime scene understanding with label transfer scene parser7
CWGA-Net: Center-Weighted Graph Attention Network for 3D object detection from point clouds7
Interactive multi-scale feature representation enhancement for small object detection7
Combining complementary trackers for enhanced long-term visual object tracking7
Self-supervised part segmentation via motion imitation7
Bilateral regularized optimization model for edge-preserving image smoothing7
A novel micro-expression detection algorithm based on BERT and 3DCNN7
Pro-ReID: Producing reliable pseudo labels for unsupervised person re-identification7
Regularization by denoising diffusion process meets deep relaxation in phase7
Alleviating the generalization issue in adversarial domain adaptation networks7
Utilizing Inherent Bias for Memory Efficient Continual Learning: A Simple and Robust Baseline7
Semantic-aligned reinforced attention model for zero-shot learning7
DGSN: Learning how to segment pedestrians from other datasets for occluded person re-identification7
Enhancing weakly supervised semantic segmentation with efficient and robust neighbor-attentive superpixel aggregation7
Adversarial color projection: A projector-based physical-world attack to DNNs7
Certifiable relative pose estimation7
Rotating-YOLO: A novel YOLO model for remote sensing rotating object detection7
MINet: Modality interaction network for unified multi-modal tracking6
VAESim: A probabilistic approach for self-supervised prototype discovery6
Dual subspace clustering for spectral-spatial hyperspectral image clustering6
Multi-view daily action recognition based on Hooke balanced matrix and broad learning system6
Learning accurate monocular 3D voxel representation via bilateral voxel transformer6
Machine learning applications in breast cancer prediction using mammography6
Loss reweight in scale dimension: A simple while effective feature selection strategy for anchor-free detectors6
Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification6
Double chain networks for monocular 3D human pose estimation6
Feature attention fusion network for occluded person re-identification6
RBGAN: Realistic-generation and balanced-utility GAN for face de-identification6
A spatiotemporal motion prediction network based on multi-level feature disentanglement6
R2-trans: Fine-grained visual categorization with redundancy reduction6
A supervised approach for the detection of AM-FM signals’ interference regions in spectrogram images6
Noisy label facial expression recognition via face-specific label distribution learning6
SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation6
Consistent camera-invariant and noise-tolerant learning for unsupervised person re-identification6
Human–object interaction detection with missing objects6
Channel and Spatial Enhancement Network for human parsing6
Tackling multiple object tracking with complicated motions — Re-designing the integration of motion and appearance6
CoNPL: Consistency training framework with noise-aware pseudo labeling for dense pose estimation6
Multi-scale feature aggregation and boundary awareness network for salient object detection6
Research on efficient detection network method for remote sensing images based on self attention mechanism6
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs6
Spatial likelihood voting with self-knowledge distillation for weakly supervised object detection6
MLRMV: Multi-layer representation for multi-view action recognition6
Dense open-set recognition based on training with noisy negative images6
AES-Net: An adapter and enhanced self-attention guided network for multi-stage glaucoma classification using fundus images6
Rich global feature guided network for monocular depth estimation6
Lightweight multi-scale attention-guided network for real-time semantic segmentation6
0.33579802513123