Image and Vision Computing

Papers
(The median citation count of Image and Vision Computing is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
Recent advances in small object detection based on deep learning: A review282
Weighted boxes fusion: Ensembling boxes from different object detection models223
Deep learning-based object detection in low-altitude UAV datasets: A survey146
A comprehensive review on deep learning-based methods for video anomaly detection124
Application of the best evacuation model of deep learning in the design of public structures114
IoU-aware single-stage object detector for accurate localization104
A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection100
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public98
Deep multimodal fusion for semantic image segmentation: A survey94
Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model80
A review on 2D instance segmentation based on deep neural networks67
Deep learning-based detection from the perspective of small or tiny objects: A survey59
Anomaly detection in surveillance video based on bidirectional prediction59
ReMOT: A model-agnostic refinement for multiple object tracking53
Deep learning-based person re-identification methods: A survey and outlook of recent works51
Cross-resolution learning for Face Recognition46
A review of deep learning techniques for 2D and 3D human pose estimation42
Intelligent detection of building cracks based on deep learning41
Visual question answering model based on graph neural network and contextual attention39
Intelligent deep learning based ethnicity recognition and classification using facial images38
Motion saliency based multi-stream multiplier ResNets for action recognition37
Person search: New paradigm of person re-identification: A survey and outlook of recent works36
A Survey on Object Detection for the Internet of Multimedia Things (IoMT) using Deep Learning and Event-based Middleware: Approaches, Challenges, and Future Directions36
Exploring region relationships implicitly: Image captioning with visual relationship attention33
Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data32
An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network31
LSTM with bio inspired algorithm for action recognition in sports videos31
Iris and periocular biometrics for head mounted displays: Segmentation, recognition, and synthetic data generation29
A survey of iris datasets27
Generative adversarial networks and their application to 3D face generation: A survey26
R4 Det: Refined single-stage detector with feature recursion and refinement for rotating object detection in aerial images25
Projection-dependent input processing for 3D object recognition in human robot interaction systems24
Robust biometric authentication system with a secure user template24
A survey of micro-expression recognition24
Facial expression recognition using human machine interaction and multi-modal visualization analysis for healthcare applications23
Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers23
CrossATNet - a novel cross-attention based framework for sketch-based image retrieval23
A two-stage real-time YOLOv2-based road marking detector with lightweight spatial transformation-invariant classification23
Application of 3D laser scanning technology for image data processing in the protection of ancient building sites through deep learning23
MEmoR: A Multimodal Emotion Recognition using affective biomarkers for smart prediction of emotional health for people analytics in smart industries22
Synthetic data for face recognition: Current state and future prospects22
Efficient pedestrian detection in top-view fisheye images using compositions of perspective view patches21
PCANet: Pyramid convolutional attention network for semantic segmentation21
Feedback-driven loss function for small object detection21
EDS pooling layer21
RoI Tanh-polar transformer network for face parsing in the wild21
An unsupervised domain adaptation scheme for single-stage artwork recognition in cultural sites20
Improving image captioning with Pyramid Attention and SC-GAN20
Attention-guided chained context aggregation for semantic segmentation20
FastNet: Fast high-resolution network for human pose estimation20
A survey of methods, datasets and evaluation metrics for visual question answering19
Development of an embedded road boundary detection system based on deep learning19
Learning to disentangle scenes for person re-identification19
Cluster adaptation networks for unsupervised domain adaptation19
Energy clustering for unsupervised person re-identification18
Improved generative adversarial network and its application in image oil painting style transfer18
Generalizable deep features for ocular biometrics18
Revisiting crowd counting: State-of-the-art, trends, and future perspectives18
Cross-database and cross-attack Iris presentation attack detection using micro stripes analyses17
A deep-shallow and global–local multi-feature fusion network for photometric stereo17
Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention17
Multi-stream slowFast graph convolutional networks for skeleton-based action recognition17
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance17
Zero-sum game theory model for segmenting skin regions17
Few-Shot learning for face recognition in the presence of image discrepancies for limited multi-class datasets17
Investigating bias in deep face analysis: The KANFace dataset and empirical study17
Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection17
Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction16
Cross-Correlated Attention Networks for Person Re-Identification16
Unsupervised face Frontalization for pose-invariant face recognition16
SalFBNet: Learning pseudo-saliency distribution via feedback convolutional networks16
Multi-view dynamic facial action unit detection16
A neural network aided attuned scheme for gun detection in video surveillance images15
IRANet: Identity-relevance aware representation for cloth-changing person re-identification15
Dense convolutional feature histograms for robust visual object tracking15
Boundary guidance network for camouflage object detection15
Point cloud completion using multiscale feature fusion and cross-regional attention15
Attention guided contextual feature fusion network for salient object detection15
An efficient foreign objects detection network for power substation14
Collaborative representation of blur invariant deep sparse features for periocular recognition from smartphones14
An attention-based deep learning model for multiple pedestrian attributes recognition14
Face anti-spoofing detection based on multi-scale image quality assessment14
Self-trained prediction model and novel anomaly score mechanism for video anomaly detection14
SalED: Saliency prediction with a pithy encoder-decoder architecture sensing local and global information14
Bald eagle search optimization with deep transfer learning enabled age-invariant face recognition model14
Improved YOLOX-X based UAV aerial photography object detection algorithm14
Beyond modality alignment: Learning part-level representation for visible-infrared person re-identification14
Synergetic reconstruction from 2D pose and 3D motion for wide-space multi-person video motion capture in the wild14
HPRNet: Hierarchical point regression for whole-body human pose estimation13
Real-time semantic segmentation with local spatial pixel adjustment13
A new perceptual hashing method for verification and identity classification of occluded faces13
Certifiable relative pose estimation13
ERF-YOLO: A YOLO algorithm compatible with fewer parameters and higher accuracy13
Digital video intrusion intelligent detection method based on narrowband Internet of Things and its application13
Dual-path CNN with Max Gated block for text-based person re-identification13
Multi-level refinement enriched feature pyramid network for object detection13
An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers13
CAM: A fine-grained vehicle model recognition method based on visual attention model13
The effect of image recognition traffic prediction method under deep learning and naive Bayes algorithm on freeway traffic safety13
Convolutional prototype learning for zero-shot recognition13
Pose-guided part matching network via shrinking and reweighting for occluded person re-identification13
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling13
Expression recognition with deep features extracted from holistic and part-based models12
Real-time semantic segmentation with weighted factorized-depthwise convolution12
From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts12
Fusion of iris and sclera using phase intensive rubbersheet mutual exclusion for periocular recognition12
MFC-Net : Multi-feature fusion cross neural network for salient object detection12
Variance-guided attention-based twin deep network for cross-spectral periocular recognition12
Intelligent multimodal pedestrian detection using hybrid metaheuristic optimization with deep learning model12
A calibration method of computer vision system based on dual attention mechanism12
Detection of anomaly in surveillance videos using quantum convolutional neural networks12
Dense open-set recognition based on training with noisy negative images12
A novel co-attention computation block for deep learning based image co-segmentation12
Multimodal assessment of apparent personality using feature attention and error consistency constraint12
Novel features for art movement classification of portrait paintings12
CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving12
I-SOCIAL-DB: A labeled database of images collected from websites and social media for Iris recognition12
Multi-level prediction Siamese network for real-time UAV visual tracking12
Explaining VQA predictions using visual grounding and a knowledge base12
Combining complementary trackers for enhanced long-term visual object tracking12
Feature based video stabilization based on boosted HAAR Cascade and representative point matching algorithm12
Cancelable Iris template generation by aggregating patch level ordinal relations with its holistically extended performance and security analysis12
Using synthetic data for person tracking under adverse weather conditions11
A study on attention-based LSTM for abnormal behavior recognition with variable pooling11
Face mask detection using deep convolutional neural network and multi-stage image processing11
Spatiotemporal module for video saliency prediction based on self-attention11
Double anchor embedding for accurate multi-person 2D pose estimation11
Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification11
Joint detection and tracking in videos with identification features10
Deep hybrid learning for facial expression binary classifications and predictions10
Image captioning via proximal policy optimization10
Viewpoint constrained and unconstrained Cricket stroke localization from untrimmed videos10
Multistage temporal convolution transformer for action segmentation10
A motion model based on recurrent neural networks for visual object tracking10
Gender based face aging with cycle-consistent adversarial networks10
Few-shot object detection via baby learning10
E2E-VSDL: End-to-end video surveillance-based deep learning model to detect and prevent criminal activities10
How robust are discriminatively trained zero-shot learning models?10
Interactive multi-scale feature representation enhancement for small object detection10
Co-occurrence of deep convolutional features for image search10
Detection of panoramic vision pedestrian based on deep learning10
Point cloud classification with deep normalized Reeb graph convolution10
Edge supervision and multi-scale cost volume for stereo matching10
Transformer models for enhancing AttnGAN based text to image generation10
Video prediction by efficient transformers10
Pose-guided counterfactual inference for occluded person re-identification9
Towards generalized morphing attack detection by learning residuals9
E2E-V2SResNet: Deep residual convolutional neural networks for end-to-end video driven speech synthesis9
Composite recurrent network with internal denoising for facial alignment in still and video images in the wild9
Dual-branch adaptive attention transformer for occluded person re-identification9
Does explainable machine learning uncover the black box in vision applications?9
ASPset: An outdoor sports pose video dataset with 3D keypoint annotations9
Handcrafted localized phase features for human action recognition9
PDA: Proxy-based domain adaptation for few-shot image recognition9
Demographic classification through pupil analysis9
Emotion detection and face recognition of drivers in autonomous vehicles in IoT platform9
Tracking fiducial markers with discriminative correlation filters9
Multimodal emotion recognition using cross modal audio-video fusion with attention and deep metric learning9
MDCS with fully encoding the information of local shape description for 3D Rigid Data matching8
A pooling-based feature pyramid network for salient object detection8
Short-term anchor linking and long-term self-guided attention for video object detection8
Omnidirectional stereo depth estimation based on spherical deep network8
Tackling multiple object tracking with complicated motions — Re-designing the integration of motion and appearance8
SiaTrans: Siamese transformer network for RGB-D salient object detection with depth image classification8
Activity guided multi-scales collaboration based on scaled-CNN for saliency prediction8
Video-based person re-identification by intra-frame and inter-frame graph neural network8
Knowledge distillation methods for efficient unsupervised adaptation across multiple domains8
Whether normalized or not? Towards more robust iris recognition using dynamic programming8
Edge-aware salient object detection network via context guidance8
A Tibetan Thangka data set and relative tasks8
H-net: Unsupervised domain adaptation person re-identification network based on hierarchy8
Intelligent facial expression recognition and classification using optimal deep transfer learning model8
Real-time gait biometrics for surveillance applications: A review8
Lightweight boundary refinement module based on point supervision for semantic segmentation8
Boundary graph convolutional network for temporal action detection8
A novel micro-expression detection algorithm based on BERT and 3DCNN8
Continual coarse-to-fine domain adaptation in semantic segmentation8
Texture classification-based feature processing for violence-based anomaly detection in crowded environments8
View knowledge transfer network for multi-view action recognition8
RAMT-GAN: Realistic and accurate makeup transfer with generative adversarial network8
Crowd density detection method based on crowd gathering mode and multi-column convolutional neural network8
Camera pose estimation in multi-view environments: From virtual scenarios to the real world8
Adversarial sliced Wasserstein domain adaptation networks8
Triangulate geometric constraint combined with visual-flow fusion network for accurate 6DoF pose estimation8
Generating facial expression adversarial examples based on saliency map8
Clothing generation by multi-modal embedding: A compatibility matrix-regularized GAN model8
Cross-modal feature extraction and integration based RGBD saliency detection8
Improving eye movement biometrics in low frame rate eye-tracking devices using periocular and eye blinking features8
Single stage architecture for improved accuracy real-time object detection on mobile devices8
Context-based image explanations for deep neural networks8
Geometry consistency aware confidence evaluation for feature matching8
Adaptive weight based on overlapping blocks network for facial expression recognition8
A survey on computer vision based human analysis in the COVID-19 era7
Single-shot cuboids: Geodesics-based end-to-end Manhattan aligned layout estimation from spherical panoramas7
Spatial temporal and channel aware network for video-based person re-identification7
Advances in deep learning-based image recognition of product packaging7
Flow guided mutual attention for person re-identification7
Multi parallel U-net encoder network for effective polyp image segmentation7
Learning visual variation for object recognition7
Depth-guided saliency detection via boundary information7
Human object interaction detection: Design and survey7
Deep domain adaptation with ordinal regression for pain assessment using weakly-labeled videos7
R2Net: Residual refinement network for salient object detection7
Multimodal image fusion based on point-wise mutual information7
Single image dehazing using extended local dark channel prior7
Multi-source material image optimized selection based multi-option composition7
Few-shot personalized saliency prediction using meta-learning7
Distinguishing foreground and background alignment for unsupervised domain adaptative semantic segmentation7
Aligning vision-language for graph inference in visual dialog7
LP-GAN: Learning perturbations based on generative adversarial networks for point cloud adversarial attacks7
Learning an augmentation strategy for sparse datasets7
Batch feature standardization network with triplet loss for weakly-supervised video anomaly detection7
VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable visual question answering6
Grassmann manifold based framework for automated fall detection from a camera6
GIFSL - grafting based improved few-shot learning6
Cuepervision: self-supervised learning for continuous domain adaptation without catastrophic forgetting6
ScPnP: A non-iterative scale compensation solution for PnP problems6
Progressive ShallowNet for large scale dynamic and spontaneous facial behaviour analysis in children6
Reinforced pedestrian attribute recognition with group optimization reward6
Engagement detection and enhancement for STEM education through computer vision, augmented reality, and haptics6
ArCo: Attention-reinforced transformer with contrastive learning for image captioning6
Double cross-modality progressively guided network for RGB-D salient object detection6
Person re-identification: A taxonomic survey and the path ahead6
Bias alleviating generative adversarial network for generalized zero-shot classification6
Faster and finer pose estimation for multiple instance objects in a single RGB image6
Modeling graph-structured contexts for image captioning6
Monocular contextual constraint for stereo matching with adaptive weights assignment6
Non-local attention association scheme for online multi-object tracking6
Learning rebalanced human parsing model from imbalanced datasets6
Local information fusion network for 3D shape classification and retrieval6
Dual guidance enhanced network for light field salient object detection6
Building NAS: Automatic designation of efficient neural architectures for building extraction in high-resolution aerial images6
AGA-GAN: Attribute Guided Attention Generative Adversarial Network with U-Net for face hallucination6
Cross-view action recognition with small-scale datasets6
Geometric feature statistics histogram for both real-valued and binary feature representations of 3D local shape6
Enhancing single-view 3D mesh reconstruction with the aid of implicit surface learning6
Multi–feature fusion tracking algorithm based on peak–context learning6
Joint patch and instance discrimination learning for unsupervised person re-identification6
Multi-view self-supervised learning for 3D facial texture reconstruction from single image6
A dynamic keypoint selection network for 6DoF pose estimation5
Multi-scale interaction transformer for temporal action proposal generation5
2D progressive fusion module for action recognition5
Attention-guided aggregation stereo matching network5
Task-based parameter isolation for foreground segmentation without catastrophic forgetting using multi-scale region and edges fusion network5
Online-adaptive classification and regression network with sample-efficient meta learning for long-term tracking5
Special issue on role of computer vision in smart cities5
Feature fusion for object detection at one map5
You look so different! Haven’t I seen you a long time ago?5
Collaborative knowledge distillation for incomplete multi-view action prediction5
Accurate and efficient salient object detection via position prior attention5
0.040640115737915