Machine Vision and Applications

Papers
(The TQCC of Machine Vision and Applications is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
A method for high dynamic range 3D color modeling of objects through a color camera122
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching68
Class-aware cross-domain target detection based on cityscape in fog67
Real estate pricing prediction via textual and visual features57
ECM: arbitrary style transfer via Enhanced-Channel Module42
Non-contact SpO2 monitoring via multi-channel pulse signals from facial videos using machine learning39
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization36
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones36
StyleDemorpher: high-quality face demorphing via StyleGAN2’s latent space34
Triple attention and global reasoning Siamese networks for visual tracking33
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer31
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data29
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss28
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model18
Global-guided cross-reference network for co-salient object detection18
Medtransnet: advanced gating transformer network for medical image classification18
Editing implicit and explicit representations of radiance fields: a survey18
Using breast density for hybrid region and pixel-level loss function18
Enforced clustering for zero-to-one-shot texture anomaly detection17
Motion-region annotation for complex videos via label propagation across occluders17
LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems16
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation16
Innovative surface roughness detection method based on white light interference images16
A stereo vision SLAM with moving vehicles tracking in outdoor environment16
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery15
L-VAE: variational auto-encoder with learnable beta for disentangled representation14
Ubiquitous vision of transformers for person re-identification14
Specular Surface Detection with Deep Static Specular Flow and Highlight14
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection13
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking12
AFC-Net: adjacent feature complementary for crowded pedestrian detection12
Real-World super-resolution under the guidance of optimal transport12
Alternate guidance network for boundary-aware camouflaged object detection12
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction11
Generation of realistic synthetic cable images to train deep learning segmentation models11
Axes-aligned non-linear optimized PnP algorithm11
Discriminant distance template matching for image recognition11
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting11
Kernel based local matching network for video object segmentation11
Two-stage structural information enhancement for source-free domain adaptation10
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network10
LDNet: low-light image enhancement with joint lighting and denoising10
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining10
Traversing the subspace of adversarial patches10
A dual progressive strategy for long-tailed visual recognition10
CAMTrack: a combined appearance-motion method for multiple-object tracking9
Twinned attention network for occlusion-aware facial expression recognition9
Generating comprehensive scene graphs with integrated multiple attribute detection9
Benchmarking large and small MLLMs9
Enhanced hyperspectral image reconstruction via parallel 2D/3D convolution with global layer purification and multiscale pooling fusion9
Improving knowledge distillation via pseudo-multi-teacher network9
Adversarial imitation learning-based network for category-level 6D object pose estimation9
Camera-based mapping in search-and-rescue via flying and ground robot teams9
Online continual learning with saliency-guided experience replay using tiny episodic memory9
Shape related unknown object one-shot learning grasping9
IoU-aware feature fusion R-CNN for dense object detection8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing8
Audio-visual localization based on spatial relative sound order8
Pakistan sign language recognition: leveraging deep learning models with limited dataset8
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques8
Fusing bilinear multi-channel gated vector for fine-grained classification8
Correction: Real estate pricing prediction via textual and visual features8
Explainable interactive projections of images8
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots8
Shape description losses for medical image segmentation8
GOA-net: generic occlusion aware networks for visual tracking8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification8
An anisotropic non-local attention network for image segmentation7
Cross-validation of a semantic segmentation network for natural history collection specimens7
Real-time pedestrian pose estimation, tracking and localization for social distancing7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system7
Automatic cables segmentation from a substation device based on 3D point cloud7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids7
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection7
Multi-scale convolution underwater image restoration network7
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation7
Integrating visual-semantic relational reasoning for fake news detection on video platforms7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning7
Clarity method of fog and dust image in fully mechanized mining face7
An adaptive interpolation and 3D reconstruction algorithm for underwater images7
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection7
Pattern recognition methodologies for pollen grain image classification: a survey7
Attention-based global context network for driving maneuvers prediction7
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment7
Tree-managed network ensembles for video prediction6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification6
Visual-inertial SLAM with line segment merging and efficient feature tracking method6
Parametric loss-based super-resolution for scene text recognition6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles6
Distortion diminishing with vulnerability filters pruning6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms6
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation6
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter6
Kinematic calibration of a hexapod robot based on monocular vision6
Enhanced normal estimation of point clouds via fine-grained geometric information learning6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention6
Meta-learning enhanced global–local feature fusion for image quality assessment6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection6
Tensor-guided learning for image denoising using anisotropic PDEs6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library6
Block-recurrent visual transformer for enhanced human detection in thermal imaging5
Multi-planar geometry and latent image recovery from a single motion-blurred image5
Regional filtering distillation for object detection5
Accelerated fixed-point iterations for image deblurring and defiltering5
VGT-MOT: visibility-guided tracking for online multiple-object tracking5
Guest editorial: special issue on human pose estimation and its applications5
Quality assessment of synthetic images via spatial distortion recognition5
Multiple object tracking using weighted graph convolutional neural networks5
Cascaded attention-guided multi-granularity feature learning for person re-identification5
Residual shuffle attention network for image super-resolution5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement5
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments5
Text-to-face synthesis based on facial landmarks prediction5
A review of adaptable conventional image processing pipelines and deep learning on limited datasets5
Logit scaling for out-of-distribution detection5
Robust semantic segmentation method of urban scenes in snowy environment5
Beyond Kalman filters: deep learning-based filters for improved object tracking4
Fine-grained 3D vehicle shape manipulation via latent space editing4
Self-attention network for few-shot learning based on nearest-neighbor algorithm4
CCTV-Calib: a toolbox to calibrate surveillance cameras around the globe4
The general framework for few-shot learning by kernel HyperNetworks4
ViCap-AD: video caption-based weakly supervised video anomaly detection4
Wavelet and PCA-based glaucoma classification through novel methodological enhanced retinal images4
Real-time 3D reconstruction using point-dependent pose graph optimization framework4
Removing cloud shadows from ground-based solar imagery4
Superpixel-based foreground-preserving image stitching4
Symmetry-induced ambiguity in orientation estimation from RGB images4
Human pose estimation based on lightweight basicblock4
Unsupervised single-shot depth estimation using perceptual reconstruction4
FLAVR: flow-free architecture for fast video frame interpolation4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder4
SiamCAR-Kal: anti-occlusion tracking algorithm for infrared ground targets based on SiamCAR and Kalman filter4
A robust vehicle tracking in low-altitude UAV videos4
Spatial-temporal graph-guided global attention network for video-based person re-identification4
Gait recognition using free-area transformer networks4
An image quality assessment method based on edge extraction and singular value for blurriness4
Swin transformer with part-level tokenization for occluded person re-identification4
Supervised contrastive learning with multi-scale interaction and integrity learning for salient object detection4
Toward phytoplankton parasite detection using autoencoders4
Local region-learning modules for point cloud classification4
YOLOMH: you only look once for multi-task driving perception with high efficiency4
Naturally constrained reject option classification4
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices4
Structure–texture decomposition-based dehazing of a single image with large sky area4
Multimodal dance style transfer4
A deep Retinex network for underwater low-light image enhancement4
Hierarchical contrastive adaptation for cross-domain object detection4
React: recognize every action everywhere all at once4
Mitigating adversarial perturbations via weakly supervised object location and regions recombination4
Residual feature learning with hierarchical calibration for gaze estimation4
Personvit: large-scale self-supervised vision transformer for person re-identification4
0.26787090301514