Machine Vision and Applications

Papers
(The median citation count of Machine Vision and Applications is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Unsupervised domain adaptation by cross-domain consistency learning for CT body composition105
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer78
Self-supervised representation learning for robust fine-grained human hand action recognition in industrial assembly lines62
Gabor capsule network with preprocessing blocks for the recognition of complex images60
Predicting vehicle collisions using data collected from video games49
A robust information hiding algorithm based on lossless encryption and NSCT-HD-SVD44
FAE-GAN: facial attribute editing with multi-scale attention normalization35
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation30
Image dataset creation and networks improvement method based on CAD model and edge operator for object detection in the manufacturing industry30
On the safety of vulnerable road users by cyclist detection and tracking28
Multiple feature-based contrast enhancement of ROI of backlit images25
GMC_FM : a grid and multi-density-based method for matching ancient Chinese architectural images24
Adversarial structured prediction for domain-adaptive semantic segmentation21
Demographic attribute estimation in face videos combining local information and quality assessment21
Improved two-stage image inpainting with perceptual color loss and modified region normalization20
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter19
Graph convolutional networks and LSTM for first-person multimodal hand action recognition18
A method for high dynamic range 3D color modeling of objects through a color camera17
Correction to: Self-attention network for few-shot learning based on nearest-neighbor algorithm17
Deep 6-DoF camera relocalization in variable and dynamic scenes by multitask learning17
Improved deep depth estimation for environments with sparse visual cues16
Ising granularity image analysis on VAE–GAN16
Performance benchmark of deep learning human pose estimation for UAVs15
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching15
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data14
Class-aware cross-domain target detection based on cityscape in fog13
Unsupervised anomaly detection via knowledge distillation with non-directly-coupled student block fusion13
Beyond a strong baseline: cross-modality contrastive learning for visible-infrared person re-identification13
Pixel representations, sampling, and label correction for semantic part detection12
Recent progress in sign language recognition: a review12
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones11
Enhanced keypoint information and pose-weighted re-ID features for multi-person pose estimation and tracking11
A visual foreign object detection system for wireless charging of electric vehicles11
Volumetric medical image segmentation via scribble annotations and shape priors11
A gradient fusion-based image data augmentation method for reflective workpieces detection under small size datasets11
Automatic label assignment object detection mehtod on only one feature map11
Correction: Adversarial defence by learning differentiated feature representation in deep ensemble11
Multi-shot person re-identification based on appearance and spatial-temporal cues in a large camera network10
Entangled appearance and motion structures network for multi-object tracking and segmentation10
An efficient driving behavior prediction approach using physiological auxiliary and adaptive LSTM10
FERGCN: facial expression recognition based on graph convolution network10
Cross transformer for LiDAR-based loop closure detection10
ET-PointPillars: improved PointPillars for 3D object detection based on optimized voxel downsampling10
Real estate pricing prediction via textual and visual features10
Assessing 3D volumetric asymmetry in facial palsy patients via advanced multi-view landmarks and radial curves10
Triple attention and global reasoning Siamese networks for visual tracking9
Ssman: self-supervised masked adaptive network for 3D human pose estimation9
Paired-D++ GAN for image manipulation with text9
Tensor-guided learning for image denoising using anisotropic PDEs9
ECM: arbitrary style transfer via Enhanced-Channel Module8
A pixel and channel enhanced up-sampling module for biomedical image segmentation8
Cascading spatio-temporal attention network for real-time action detection8
A multilayer human motion prediction perceptron by aggregating repetitive motion8
Study on defect detection of metal castings based on supervised enhancement and attention distillation8
Medtransnet: advanced gating transformer network for medical image classification8
FESAR: SAR ship detection model based on local spatial relationship capture and fused convolutional enhancement8
Feature distribution statistics as a loss objective for robust white balance correction7
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss7
Occlusion recovery face recognition based on information reconstruction7
Bidirectional cascaded multimodal attention for multiple choice visual question answering7
Sparse representation with enhanced nonlocal self-similarity for image denoising7
Correction: BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking7
Normalized margin loss for action unit detection7
Ipdm: identity preserving diffusion model for face sketch and photo synthesis7
Thermal infrared action recognition with two-stream shift Graph Convolutional Network7
A novel method for 3D knee anatomical landmark localization by combining global and local features7
Online camera auto-calibration appliable to road surveillance7
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment7
Attention-enhanced feature mapping network for visible-infrared person re-identification7
Transformer with multi-level grid features and depth pooling for image captioning7
Randomized nonlinear two-dimensional principal component analysis network for object recognition7
Intercity rail platform abnormal action recognition based on a skeleton tracking and recognition framework6
IC solder joint inspection via generator-adversarial-network based template6
Teacher–student training and triplet loss to reduce the effect of drastic face occlusion6
Visible-infrared person re-identification model based on feature consistency and modal indistinguishability6
AP-TransNet: a polarized transformer based aerial human action recognition framework6
Transgaze: exploring plain vision transformers for gaze estimation6
Segmentation of photovoltaic module cells in uncalibrated electroluminescence images6
Multi-scale information fusion generative adversarial network for real-world noisy image denoising6
Fourier feature network for 3D vessel reconstruction from biplane angiograms6
Exploring filter placement in convolutional layer topologies based on ResNet for image classification6
Deep traffic sign detection and recognition without target domain real images6
Deep learning-based object recognition in multispectral satellite imagery for real-time applications6
RCA-IUnet: a residual cross-spatial attention-guided inception U-Net model for tumor segmentation in breast ultrasound imaging6
An insect vision-inspired neuromorphic vision systems in low-light obstacle avoidance for intelligent vehicles6
Lesion-aware attention with neural support vector machine for retinopathy diagnosis6
STARNet: spatio-temporal aware recurrent network for efficient video object detection on embedded devices5
Editor’s Note: Special Issue from Winter Conference on Applications of Computer Vision - WACV 20235
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model5
UP-Net: unique keyPoint description and detection net5
Knowledge-based hybrid connectionist models for morphologic reasoning5
Actions as points: a simple and efficient detector for skeleton-based temporal action detection5
FDT − Dr2T: a unified Dense Radiology Report Generation Transformer framework for X-ray images5
Cross-layer attentive feature upsampling for low-latency semantic segmentation5
A review of recent techniques for person re-identification5
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification5
Enforced clustering for zero-to-one-shot texture anomaly detection5
Continual learning approaches to hand–eye calibration in robots5
Innovative surface roughness detection method based on white light interference images5
An ensemble approach for accelerated and noise-resilient parallel MRI reconstruction utilizing CycleGANs5
A stereo vision SLAM with moving vehicles tracking in outdoor environment5
Efficient abnormality detection using patch-based 3D convolution with recurrent model5
Plug-and-Play video reconstruction using sparse 3D transform-domain block matching4
Multiplicative noise removal and blind inpainting of ultrasound images based on a new variational framework4
Adaptive fast scale estimation, with accurate online model update based on kernelized correlation filter4
MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images4
Global-guided cross-reference network for co-salient object detection4
Kinematic calibration of a hexapod robot based on monocular vision4
AccNet: occluded scene text enhancing network with accretion blocks4
Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation4
Viewpoint placement for inspection planning4
A lightweight convolutional neural network for pose estimation of a planar model4
Correction to: A compressed matrix sequence method for solving normal equations of bundle adjustment4
Enhanced machine perception by a scalable fusion of RGB–NIR image pairs in diverse exposure environments4
A deep learning framework for finding illicit images/videos of children4
Parametric regularization loss in super-resolution reconstruction4
Consensus similarity learning based on tensor nuclear norm4
Single image dehazing based on multi-scale segmentation and deep learning4
Partitioned iterated function systems by regression models for head pose estimation4
LPI: learn postures for interactions4
Ensemble learning with advanced fast image filtering features for semi-global matching4
Designing effective power law-based loss function for faster and better bounding box regression4
Inflated 3D ConvNet context analysis for violence detection4
RAU-Net: U-Net network based on residual multi-scale fusion and attention skip layer for overall spine segmentation4
RGBD mapping solution for low-cost robot4
Two-stream lightweight sign language transformer4
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification3
End-to-end optimized image compression with the frequency-oriented transform3
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles3
CMNet: a novel model and design rationale based on comparison studies and synergy of CNN and MetaFormer3
A transformer-based neural ODE for dense prediction3
Automatic apraxia detection using deep convolutional neural networks and similarity methods3
Utilizing incremental branches on a one-stage object detection framework to avoid catastrophic forgetting3
Poly-cam: high resolution class activation map for convolutional neural networks3
Underwater image object detection based on multi-scale feature fusion3
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis3
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition3
Closing the gap in domain adaptation for semantic segmentation: a time-aware method3
Dual contrast discriminator with sharing attention for video anomaly detection3
Hyperspectral image dynamic range reconstruction using deep neural network-based denoising methods3
A fast anchor-based graph-regularized low-rank representation approach for large-scale subspace clustering3
Temporal superimposed crossover module for effective continuous sign language3
Boosting facial recognition capability for faces wearing masks using attention augmented residual model with quadruplet loss3
PerSnake: a real-time pedestrian instance segmentation network using contour representation3
Generating quality grasp rectangle using Pix2Pix GAN for intelligent robot grasping3
Motion-region annotation for complex videos via label propagation across occluders3
Distortion diminishing with vulnerability filters pruning3
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking3
A lightweight real-time detection method of small objects for home service robots3
Enhanced normal estimation of point clouds via fine-grained geometric information learning3
MYFED: a dataset of affective face videos for investigation of emotional facial dynamics as a soft biometric for person identification3
SGBGAN: minority class image generation for class-imbalanced datasets3
Addressing the generalization of 3D registration methods with a featureless baseline and an unbiased benchmark3
Simple and effective complementary label learning based on mean square error loss3
Neuro-augmented vision for evolutionary robotics3
Parametric loss-based super-resolution for scene text recognition3
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library3
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention3
Multi-person 3D pose estimation from unlabelled data3
Feature refinement with multi-level context for object detection3
SNFR: salient neighbor decoding and text feature refining for scene text recognition3
Joint patch clustering-based adaptive dictionary and sparse representation for multi-modality image fusion3
Material classification of polishing and convex surface objects based on photon accumulation point spread function (PAPSF) from imaging model of binocular pulsed time-of-flight camera2
Similarity contrastive estimation for image and video soft contrastive self-supervised learning2
Multimodal fine-grained grocery product recognition using image and OCR text2
Automated building and evaluation of 2D as-built floor plans2
MDUNet: deep-prior unrolling network with multi-parameter data integration for low-dose computed tomography reconstruction2
AFMCT: adaptive fusion module based on cross-modal transformer block for 3D object detection2
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery2
Residual shuffle attention network for image super-resolution2
SiamMMF: multi-modal multi-level fusion object tracking based on Siamese networks2
Rocnet: 3D robust registration of points clouds using deep learning2
Guest editorial: special issue on human pose estimation and its applications2
Saliency prediction based on multi-channel models of visual processing2
Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks2
Ubiquitous vision of transformers for person re-identification2
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction2
Human–object interaction detection based on disentangled axial attention transformer2
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection2
Squeezed fire binary segmentation model using convolutional neural network for outdoor images on embedded device2
WideCaps: a wide attention-based capsule network for image classification2
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking2
Interpretable visual transmission lines inspections using pseudo-prototypical part network2
MFEMANet: an effective disaster image classification approach for practical risk assessment2
Motioninsights: real-time object tracking in streaming video2
AFC-Net: adjacent feature complementary for crowded pedestrian detection2
Tree-managed network ensembles for video prediction2
Multi-atlas subcortical segmentation: an orchestration of 3D fully convolutional network and generalized mixture function2
Cancelable face recognition using phase retrieval and complex principal component analysis network2
Wide-baseline multi-camera calibration from a room filled with people2
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting2
Alternate guidance network for boundary-aware camouflaged object detection2
Deep multimodal-based finger spelling recognition for Thai sign language: a new benchmark and model composition2
A review of adaptable conventional image processing pipelines and deep learning on limited datasets2
A novel ship classification network with cascade deep features for line-of-sight sea data2
Saliency detection based on color descriptor and high-level prior2
Zero-shot action recognition by clustered representation with redundancy-free features2
Towards scanning electron microscopy image denoising: a state-of-the-art overview, benchmark, taxonomies, and future direction2
VGT-MOT: visibility-guided tracking for online multiple-object tracking2
The improvement of ground truth annotation in public datasets for human detection2
When dual contrastive learning meets disentangled features for unpaired image deraining2
Discriminative feature learning through feature distance loss2
Foreground enhancement network for object detection in sonar images2
Regional filtering distillation for object detection2
Swin transformer with part-level tokenization for occluded person re-identification1
Interpretability of fingerprint presentation attack detection systems: a look at the “representativeness” of samples against never-seen-before attacks1
Deep learning for unambiguous pose estimation of a non-cooperative fixed-wing UAV1
S-pad: self-learning padding mechanism1
Temporal teacher with masked transformers for semi-supervised action proposal generation1
Few-shot object detection via data augmentation and distribution calibration1
The overlapping effect and fusion protocols of data augmentation techniques in iris PAD1
A semi-supervised learning method for surface defect classification of magnetic tiles1
IAFPN: interlayer enhancement and multilayer fusion network for object detection1
Learning more discriminative local descriptors with parameter-free weighted attention for few-shot learning1
Axes-aligned non-linear optimized PnP algorithm1
YOLOMH: you only look once for multi-task driving perception with high efficiency1
Accurate IoU computation for rotated bounding boxes in $${\mathbb {R}}^2$$ and $${\mathbb {R}}^3$$1
Keyframe-based RGB-D dense visual SLAM fused semantic cues in dynamic scenes1
Text-to-face synthesis based on facial landmarks prediction1
Lunar ground segmentation using a modified U-net neural network1
Graph-based relational reasoning network for video question answering1
An empirical study of different machine learning techniques for brain tumor classification and subsequent segmentation using hybrid texture feature1
Synergizing LiDAR and Augmented Reality for precise real-time interior distance measurements for mobile devices1
Global attention guided multi-scale network for face image super-resolution1
LTM: efficient learning with triangular topology constraint for feature matching with heavy outliers1
Multi-view spectral clustering based on constrained Laplacian rank1
Cultural behaviors analysis in video sequences1
Convolutional neural network-based cross-corpus speech emotion recognition with data augmentation and features fusion1
Local region-learning modules for point cloud classification1
3D multi-object tracking based on parallel multimodal data association1
A stacked dense denoising–segmentation network for undersampled tomograms and knowledge transfer using synthetic tomograms1
Dynamic scene blind image deblurring based on local and non-local features1
Automatic high fidelity foot contact location and timing for elite sprinting1
Kernel based local matching network for video object segmentation1
Twinned attention network for occlusion-aware facial expression recognition1
Micro-concrete crack detection of underwater structures based on convolutional neural network1
Automatic exposure strategy network for robust visual odometry in environments with high dynamic range1
Variable exponent diffusion for image detexturing1
Position Puzzle Network and Augmentation: localizing human keypoints beyond the bounding box1
Learning to explore by reinforcement over high-level options1
Semantic convolutional features for face detection1
Multi-level receptive field feature reuse for multi-focus image fusion1
Research on 3D model reconstruction based on a sequence of cross-sectional images1
React: recognize every action everywhere all at once1
Future-proofing class-incremental learning1
Images denoising for COVID-19 chest X-ray based on multi-resolution parallel residual CNN1
The effect of camera settings on image noise and accuracy of subpixel image registration1
Two-stage structural information enhancement for source-free domain adaptation1
An unsupervised approach for thermal to visible image translation using autoencoder and generative adversarial network1
Trusted 3D self-supervised representation learning with cross-modal settings1
0.17031502723694