Image and Vision Computing

Papers
(The TQCC of Image and Vision Computing is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
Recent advances in small object detection based on deep learning: A review282
Weighted boxes fusion: Ensembling boxes from different object detection models223
Deep learning-based object detection in low-altitude UAV datasets: A survey146
A comprehensive review on deep learning-based methods for video anomaly detection124
Application of the best evacuation model of deep learning in the design of public structures114
IoU-aware single-stage object detector for accurate localization104
A framework of human action recognition using length control features fusion and weighted entropy-variances based feature selection100
FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public98
Deep multimodal fusion for semantic image segmentation: A survey94
Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model80
A review on 2D instance segmentation based on deep neural networks67
Anomaly detection in surveillance video based on bidirectional prediction59
Deep learning-based detection from the perspective of small or tiny objects: A survey59
ReMOT: A model-agnostic refinement for multiple object tracking53
Deep learning-based person re-identification methods: A survey and outlook of recent works51
Cross-resolution learning for Face Recognition46
A review of deep learning techniques for 2D and 3D human pose estimation42
Intelligent detection of building cracks based on deep learning41
Visual question answering model based on graph neural network and contextual attention39
Intelligent deep learning based ethnicity recognition and classification using facial images38
Motion saliency based multi-stream multiplier ResNets for action recognition37
Person search: New paradigm of person re-identification: A survey and outlook of recent works36
A Survey on Object Detection for the Internet of Multimedia Things (IoMT) using Deep Learning and Event-based Middleware: Approaches, Challenges, and Future Directions36
Exploring region relationships implicitly: Image captioning with visual relationship attention33
Optimization of face recognition algorithm based on deep learning multi feature fusion driven by big data32
An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network31
LSTM with bio inspired algorithm for action recognition in sports videos31
Iris and periocular biometrics for head mounted displays: Segmentation, recognition, and synthetic data generation29
A survey of iris datasets27
Generative adversarial networks and their application to 3D face generation: A survey26
R4 Det: Refined single-stage detector with feature recursion and refinement for rotating object detection in aerial images25
Robust biometric authentication system with a secure user template24
A survey of micro-expression recognition24
Projection-dependent input processing for 3D object recognition in human robot interaction systems24
A two-stage real-time YOLOv2-based road marking detector with lightweight spatial transformation-invariant classification23
Application of 3D laser scanning technology for image data processing in the protection of ancient building sites through deep learning23
Facial expression recognition using human machine interaction and multi-modal visualization analysis for healthcare applications23
Multimodal facial biometrics recognition: Dual-stream convolutional neural networks with multi-feature fusion layers23
CrossATNet - a novel cross-attention based framework for sketch-based image retrieval23
Synthetic data for face recognition: Current state and future prospects22
MEmoR: A Multimodal Emotion Recognition using affective biomarkers for smart prediction of emotional health for people analytics in smart industries22
RoI Tanh-polar transformer network for face parsing in the wild21
Efficient pedestrian detection in top-view fisheye images using compositions of perspective view patches21
PCANet: Pyramid convolutional attention network for semantic segmentation21
Feedback-driven loss function for small object detection21
EDS pooling layer21
Attention-guided chained context aggregation for semantic segmentation20
FastNet: Fast high-resolution network for human pose estimation20
An unsupervised domain adaptation scheme for single-stage artwork recognition in cultural sites20
Improving image captioning with Pyramid Attention and SC-GAN20
Cluster adaptation networks for unsupervised domain adaptation19
A survey of methods, datasets and evaluation metrics for visual question answering19
Development of an embedded road boundary detection system based on deep learning19
Learning to disentangle scenes for person re-identification19
Generalizable deep features for ocular biometrics18
Revisiting crowd counting: State-of-the-art, trends, and future perspectives18
Energy clustering for unsupervised person re-identification18
Improved generative adversarial network and its application in image oil painting style transfer18
Multi-stream slowFast graph convolutional networks for skeleton-based action recognition17
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance17
Zero-sum game theory model for segmenting skin regions17
Few-Shot learning for face recognition in the presence of image discrepancies for limited multi-class datasets17
Investigating bias in deep face analysis: The KANFace dataset and empirical study17
Lightweight and computationally faster Hypermetropic Convolutional Neural Network for small size object detection17
Cross-database and cross-attack Iris presentation attack detection using micro stripes analyses17
A deep-shallow and global–local multi-feature fusion network for photometric stereo17
Facial expression recognition using densely connected convolutional neural network and hierarchical spatial attention17
Multi-view dynamic facial action unit detection16
Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction16
Cross-Correlated Attention Networks for Person Re-Identification16
Unsupervised face Frontalization for pose-invariant face recognition16
SalFBNet: Learning pseudo-saliency distribution via feedback convolutional networks16
Point cloud completion using multiscale feature fusion and cross-regional attention15
Attention guided contextual feature fusion network for salient object detection15
A neural network aided attuned scheme for gun detection in video surveillance images15
IRANet: Identity-relevance aware representation for cloth-changing person re-identification15
Dense convolutional feature histograms for robust visual object tracking15
Boundary guidance network for camouflage object detection15
Bald eagle search optimization with deep transfer learning enabled age-invariant face recognition model14
Improved YOLOX-X based UAV aerial photography object detection algorithm14
Beyond modality alignment: Learning part-level representation for visible-infrared person re-identification14
Synergetic reconstruction from 2D pose and 3D motion for wide-space multi-person video motion capture in the wild14
An efficient foreign objects detection network for power substation14
Collaborative representation of blur invariant deep sparse features for periocular recognition from smartphones14
An attention-based deep learning model for multiple pedestrian attributes recognition14
Face anti-spoofing detection based on multi-scale image quality assessment14
Self-trained prediction model and novel anomaly score mechanism for video anomaly detection14
SalED: Saliency prediction with a pithy encoder-decoder architecture sensing local and global information14
CAM: A fine-grained vehicle model recognition method based on visual attention model13
The effect of image recognition traffic prediction method under deep learning and naive Bayes algorithm on freeway traffic safety13
Convolutional prototype learning for zero-shot recognition13
Pose-guided part matching network via shrinking and reweighting for occluded person re-identification13
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling13
HPRNet: Hierarchical point regression for whole-body human pose estimation13
Real-time semantic segmentation with local spatial pixel adjustment13
A new perceptual hashing method for verification and identity classification of occluded faces13
Certifiable relative pose estimation13
ERF-YOLO: A YOLO algorithm compatible with fewer parameters and higher accuracy13
Digital video intrusion intelligent detection method based on narrowband Internet of Things and its application13
Dual-path CNN with Max Gated block for text-based person re-identification13
Multi-level refinement enriched feature pyramid network for object detection13
An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers13
Feature based video stabilization based on boosted HAAR Cascade and representative point matching algorithm12
Cancelable Iris template generation by aggregating patch level ordinal relations with its holistically extended performance and security analysis12
Expression recognition with deep features extracted from holistic and part-based models12
Real-time semantic segmentation with weighted factorized-depthwise convolution12
From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts12
Fusion of iris and sclera using phase intensive rubbersheet mutual exclusion for periocular recognition12
MFC-Net : Multi-feature fusion cross neural network for salient object detection12
Variance-guided attention-based twin deep network for cross-spectral periocular recognition12
Intelligent multimodal pedestrian detection using hybrid metaheuristic optimization with deep learning model12
A calibration method of computer vision system based on dual attention mechanism12
Detection of anomaly in surveillance videos using quantum convolutional neural networks12
Dense open-set recognition based on training with noisy negative images12
A novel co-attention computation block for deep learning based image co-segmentation12
Multimodal assessment of apparent personality using feature attention and error consistency constraint12
Novel features for art movement classification of portrait paintings12
CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving12
I-SOCIAL-DB: A labeled database of images collected from websites and social media for Iris recognition12
Multi-level prediction Siamese network for real-time UAV visual tracking12
Explaining VQA predictions using visual grounding and a knowledge base12
Combining complementary trackers for enhanced long-term visual object tracking12
Double anchor embedding for accurate multi-person 2D pose estimation11
Dense graph convolutional neural networks on 3D meshes for 3D object segmentation and classification11
Using synthetic data for person tracking under adverse weather conditions11
A study on attention-based LSTM for abnormal behavior recognition with variable pooling11
Face mask detection using deep convolutional neural network and multi-stage image processing11
Spatiotemporal module for video saliency prediction based on self-attention11
Transformer models for enhancing AttnGAN based text to image generation10
Video prediction by efficient transformers10
Joint detection and tracking in videos with identification features10
Deep hybrid learning for facial expression binary classifications and predictions10
Image captioning via proximal policy optimization10
Viewpoint constrained and unconstrained Cricket stroke localization from untrimmed videos10
Multistage temporal convolution transformer for action segmentation10
A motion model based on recurrent neural networks for visual object tracking10
Gender based face aging with cycle-consistent adversarial networks10
Few-shot object detection via baby learning10
E2E-VSDL: End-to-end video surveillance-based deep learning model to detect and prevent criminal activities10
How robust are discriminatively trained zero-shot learning models?10
Interactive multi-scale feature representation enhancement for small object detection10
Co-occurrence of deep convolutional features for image search10
Detection of panoramic vision pedestrian based on deep learning10
Point cloud classification with deep normalized Reeb graph convolution10
Edge supervision and multi-scale cost volume for stereo matching10
Emotion detection and face recognition of drivers in autonomous vehicles in IoT platform9
Tracking fiducial markers with discriminative correlation filters9
Multimodal emotion recognition using cross modal audio-video fusion with attention and deep metric learning9
Pose-guided counterfactual inference for occluded person re-identification9
Towards generalized morphing attack detection by learning residuals9
E2E-V2SResNet: Deep residual convolutional neural networks for end-to-end video driven speech synthesis9
Composite recurrent network with internal denoising for facial alignment in still and video images in the wild9
Dual-branch adaptive attention transformer for occluded person re-identification9
Does explainable machine learning uncover the black box in vision applications?9
ASPset: An outdoor sports pose video dataset with 3D keypoint annotations9
Handcrafted localized phase features for human action recognition9
PDA: Proxy-based domain adaptation for few-shot image recognition9
Demographic classification through pupil analysis9
Single stage architecture for improved accuracy real-time object detection on mobile devices8
Generating facial expression adversarial examples based on saliency map8
Geometry consistency aware confidence evaluation for feature matching8
Cross-modal feature extraction and integration based RGBD saliency detection8
Adaptive weight based on overlapping blocks network for facial expression recognition8
Context-based image explanations for deep neural networks8
Short-term anchor linking and long-term self-guided attention for video object detection8
Tackling multiple object tracking with complicated motions — Re-designing the integration of motion and appearance8
MDCS with fully encoding the information of local shape description for 3D Rigid Data matching8
Activity guided multi-scales collaboration based on scaled-CNN for saliency prediction8
A pooling-based feature pyramid network for salient object detection8
Knowledge distillation methods for efficient unsupervised adaptation across multiple domains8
Omnidirectional stereo depth estimation based on spherical deep network8
Whether normalized or not? Towards more robust iris recognition using dynamic programming8
SiaTrans: Siamese transformer network for RGB-D salient object detection with depth image classification8
Edge-aware salient object detection network via context guidance8
Video-based person re-identification by intra-frame and inter-frame graph neural network8
Intelligent facial expression recognition and classification using optimal deep transfer learning model8
Lightweight boundary refinement module based on point supervision for semantic segmentation8
A Tibetan Thangka data set and relative tasks8
A novel micro-expression detection algorithm based on BERT and 3DCNN8
H-net: Unsupervised domain adaptation person re-identification network based on hierarchy8
Continual coarse-to-fine domain adaptation in semantic segmentation8
Real-time gait biometrics for surveillance applications: A review8
View knowledge transfer network for multi-view action recognition8
Boundary graph convolutional network for temporal action detection8
RAMT-GAN: Realistic and accurate makeup transfer with generative adversarial network8
Texture classification-based feature processing for violence-based anomaly detection in crowded environments8
Triangulate geometric constraint combined with visual-flow fusion network for accurate 6DoF pose estimation8
Crowd density detection method based on crowd gathering mode and multi-column convolutional neural network8
Clothing generation by multi-modal embedding: A compatibility matrix-regularized GAN model8
Camera pose estimation in multi-view environments: From virtual scenarios to the real world8
Improving eye movement biometrics in low frame rate eye-tracking devices using periocular and eye blinking features8
Adversarial sliced Wasserstein domain adaptation networks8
0.076268196105957