Machine Vision and Applications

Papers
(The TQCC of Machine Vision and Applications is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss121
Real estate pricing prediction via textual and visual features90
ECM: arbitrary style transfer via Enhanced-Channel Module60
A method for high dynamic range 3D color modeling of objects through a color camera55
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching49
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data33
Class-aware cross-domain target detection based on cityscape in fog30
Medtransnet: advanced gating transformer network for medical image classification29
Triple attention and global reasoning Siamese networks for visual tracking29
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones26
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer25
Global-guided cross-reference network for co-salient object detection24
Motion-region annotation for complex videos via label propagation across occluders23
Using breast density for hybrid region and pixel-level loss function21
Enforced clustering for zero-to-one-shot texture anomaly detection21
A stereo vision SLAM with moving vehicles tracking in outdoor environment20
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model20
Innovative surface roughness detection method based on white light interference images18
Specular Surface Detection with Deep Static Specular Flow and Highlight17
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection17
Ubiquitous vision of transformers for person re-identification16
An unsupervised approach for thermal to visible image translation using autoencoder and generative adversarial network16
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting14
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking14
AFC-Net: adjacent feature complementary for crowded pedestrian detection14
Real-World super-resolution under the guidance of optimal transport14
Generation of realistic synthetic cable images to train deep learning segmentation models14
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction14
Discriminant distance template matching for image recognition13
Axes-aligned non-linear optimized PnP algorithm13
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery13
Kernel based local matching network for video object segmentation12
Alternate guidance network for boundary-aware camouflaged object detection12
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network12
Two-stage structural information enhancement for source-free domain adaptation11
A dual progressive strategy for long-tailed visual recognition11
Online continual learning with saliency-guided experience replay using tiny episodic memory11
Improving knowledge distillation via pseudo-multi-teacher network10
Camera-based mapping in search-and-rescue via flying and ground robot teams10
Twinned attention network for occlusion-aware facial expression recognition10
LDNet: low-light image enhancement with joint lighting and denoising10
Generating comprehensive scene graphs with integrated multiple attribute detection10
Traversing the subspace of adversarial patches10
Images denoising for COVID-19 chest X-ray based on multi-resolution parallel residual CNN10
CAMTrack: a combined appearance-motion method for multiple-object tracking10
Fusing bilinear multi-channel gated vector for fine-grained classification9
Cross-validation of a semantic segmentation network for natural history collection specimens9
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation9
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation9
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques9
Explainable interactive projections of images9
GOA-net: generic occlusion aware networks for visual tracking9
Shape description losses for medical image segmentation9
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection9
Adversarial imitation learning-based network for category-level 6D object pose estimation9
MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection8
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning8
Correction: Real estate pricing prediction via textual and visual features8
Multi-scale convolution underwater image restoration network8
An anisotropic non-local attention network for image segmentation8
IoU-aware feature fusion R-CNN for dense object detection8
Shape related unknown object one-shot learning grasping8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing8
Pakistan sign language recognition: leveraging deep learning models with limited dataset8
Clarity method of fog and dust image in fully mechanized mining face7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system7
Attention-based global context network for driving maneuvers prediction7
On the safety of vulnerable road users by cyclist detection and tracking7
Pattern recognition methodologies for pollen grain image classification: a survey7
Automatic cables segmentation from a substation device based on 3D point cloud7
Real-time pedestrian pose estimation, tracking and localization for social distancing7
Sparse representation with enhanced nonlocal self-similarity for image denoising7
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids7
An adaptive interpolation and 3D reconstruction algorithm for underwater images7
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment7
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter7
A robust information hiding algorithm based on lossless encryption and NSCT-HD-SVD6
Tensor-guided learning for image denoising using anisotropic PDEs6
Lesion-aware attention with neural support vector machine for retinopathy diagnosis6
Parametric loss-based super-resolution for scene text recognition6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition6
Distortion diminishing with vulnerability filters pruning6
Meta-learning enhanced global–local feature fusion for image quality assessment6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library6
Enhanced normal estimation of point clouds via fine-grained geometric information learning6
VGT-MOT: visibility-guided tracking for online multiple-object tracking5
Kinematic calibration of a hexapod robot based on monocular vision5
Guest editorial: special issue on human pose estimation and its applications5
Regional filtering distillation for object detection5
Text-to-face synthesis based on facial landmarks prediction5
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP5
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification5
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles5
Tree-managed network ensembles for video prediction5
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention5
The effect of camera settings on image noise and accuracy of subpixel image registration5
Automatic high fidelity foot contact location and timing for elite sprinting5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement5
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification5
A review of adaptable conventional image processing pipelines and deep learning on limited datasets5
Saliency detection based on color descriptor and high-level prior5
Robust semantic segmentation method of urban scenes in snowy environment5
Multi-planar geometry and latent image recovery from a single motion-blurred image5
Cascaded attention-guided multi-granularity feature learning for person re-identification5
Residual feature learning with hierarchical calibration for gaze estimation4
Depthwise grouped convolution for object detection4
Superpixel-based foreground-preserving image stitching4
Symmetry-induced ambiguity in orientation estimation from RGB images4
Swin transformer with part-level tokenization for occluded person re-identification4
Naturally constrained reject option classification4
Personvit: large-scale self-supervised vision transformer for person re-identification4
Local region-learning modules for point cloud classification4
Human pose estimation based on lightweight basicblock4
Self-attention network for few-shot learning based on nearest-neighbor algorithm4
Unsupervised single-shot depth estimation using perceptual reconstruction4
Structure–texture decomposition-based dehazing of a single image with large sky area4
Residual shuffle attention network for image super-resolution4
FLAVR: flow-free architecture for fast video frame interpolation4
A collaborative SLAM method for dual payload-carrying UAVs in denied environments4
React: recognize every action everywhere all at once4
CCTV-Calib: a toolbox to calibrate surveillance cameras around the globe4
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices4
Removing cloud shadows from ground-based solar imagery4
Mitigating adversarial perturbations via weakly supervised object location and regions recombination4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder4
Beyond Kalman filters: deep learning-based filters for improved object tracking4
Toward phytoplankton parasite detection using autoencoders4
YOLOMH: you only look once for multi-task driving perception with high efficiency4
0.11451983451843