OOIR: Observatory of International Research

Papers

(The TQCC of Machine Vision and Applications is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)

Article	Citations
A method for high dynamic range 3D color modeling of objects through a color camera	122
Text-driven object affordance for guiding grasp-type recognition in multimodal robot teaching	68
Class-aware cross-domain target detection based on cityscape in fog	67
Real estate pricing prediction via textual and visual features	57
ECM: arbitrary style transfer via Enhanced-Channel Module	42
Non-contact SpO2 monitoring via multi-channel pulse signals from facial videos using machine learning	39
DMU-Net: a dual stream multi-scale U-Net for image splicing forgery localization	36
Obs-tackle: an obstacle detection system to assist navigation of visually impaired using smartphones	36
StyleDemorpher: high-quality face demorphing via StyleGAN2’s latent space	34
Triple attention and global reasoning Siamese networks for visual tracking	33
A hybrid overlapping group sparsity denoising model with fractional-order total variation and non-convex regularizer	31
Development of a robust cascaded architecture for intelligent robot grasping using limited labelled data	29
End-to-end unsupervised learning of latent-space clustering for image segmentation via fully dense-UNet and fuzzy C-means loss	28
A motion direction detecting model for colored images based on the Hassenstein–Reichardt model	18
Global-guided cross-reference network for co-salient object detection	18
Medtransnet: advanced gating transformer network for medical image classification	18
Editing implicit and explicit representations of radiance fields: a survey	18
Using breast density for hybrid region and pixel-level loss function	18
Enforced clustering for zero-to-one-shot texture anomaly detection	17
Motion-region annotation for complex videos via label propagation across occluders	17
LOID: Lane Occlusion Inpainting and Detection for Enhanced Autonomous Driving Systems	16
MSPKD: multi spatial projectors for knowledge distillation in semantic segmentation	16
Innovative surface roughness detection method based on white light interference images	16
A stereo vision SLAM with moving vehicles tracking in outdoor environment	16
CGA-Net: channel-wise gated attention network for improved super-resolution in remote sensing imagery	15

L-VAE: variational auto-encoder with learnable beta for disentangled representation	14
Ubiquitous vision of transformers for person re-identification	14
Specular Surface Detection with Deep Static Specular Flow and Highlight	14
A multi-modal framework for continuous and isolated hand gesture recognition utilizing movement epenthesis detection	13
Modeling driving task-relevant attention for intelligent vehicles using triplet ranking	12
AFC-Net: adjacent feature complementary for crowded pedestrian detection	12
Real-World super-resolution under the guidance of optimal transport	12
Alternate guidance network for boundary-aware camouflaged object detection	12
Correction: Unsupervised single-shot depth estimation using perceptual reconstruction	11
Generation of realistic synthetic cable images to train deep learning segmentation models	11
Axes-aligned non-linear optimized PnP algorithm	11
Discriminant distance template matching for image recognition	11
Generalized few-shot learning under large scope by using episode-wise regularizing imprinting	11
Kernel based local matching network for video object segmentation	11
Two-stage structural information enhancement for source-free domain adaptation	10
Novel Cauchy mixture modeling combined with the Sparse-RCNN architecture for enhanced multi-person pose estimation	10
3D face parsing based on 2D CPFNet: conformal parameterized face parsing network	10
LDNet: low-light image enhancement with joint lighting and denoising	10
RPIM-net: residual channel prior-driven interaction multi-scale network for stereo image deraining	10
Traversing the subspace of adversarial patches	10
A dual progressive strategy for long-tailed visual recognition	10
CAMTrack: a combined appearance-motion method for multiple-object tracking	9
Twinned attention network for occlusion-aware facial expression recognition	9
Generating comprehensive scene graphs with integrated multiple attribute detection	9
Benchmarking large and small MLLMs	9
Enhanced hyperspectral image reconstruction via parallel 2D/3D convolution with global layer purification and multiscale pooling fusion	9
Improving knowledge distillation via pseudo-multi-teacher network	9
Adversarial imitation learning-based network for category-level 6D object pose estimation	9
Camera-based mapping in search-and-rescue via flying and ground robot teams	9
Online continual learning with saliency-guided experience replay using tiny episodic memory	9
Shape related unknown object one-shot learning grasping	9
IoU-aware feature fusion R-CNN for dense object detection	8
OmniGlasses: an optical aid for stereo vision CNNs to enable omnidirectional image processing	8
Audio-visual localization based on spatial relative sound order	8
Pakistan sign language recognition: leveraging deep learning models with limited dataset	8
Thin section analysis for ceramic petrography using motion analysis and segmentation techniques	8
Fusing bilinear multi-channel gated vector for fine-grained classification	8
Correction: Real estate pricing prediction via textual and visual features	8
Explainable interactive projections of images	8
X-Align++: cross-modal cross-view alignment for Bird’s-eye-view segmentation	8
A comprehensive survey on SLAM and machine learning approaches for indoor autonomous navigation of mobile robots	8
Shape description losses for medical image segmentation	8
GOA-net: generic occlusion aware networks for visual tracking	8
A camera style-invariant learning and channel interaction enhancement fusion network for visible-infrared person re-identification	8
An anisotropic non-local attention network for image segmentation	7
Cross-validation of a semantic segmentation network for natural history collection specimens	7
Real-time pedestrian pose estimation, tracking and localization for social distancing	7
YG-SLAM: dynamic environment-based geometric constraint point-line fusion visual SLAM system	7
Automatic cables segmentation from a substation device based on 3D point cloud	7
An efficient ground segmentation approach for LiDAR point cloud utilizing adjacent grids	7

MÆIDM: multi-scale anomaly embedding inpainting and discrimination for surface anomaly detection	7
Multi-scale convolution underwater image restoration network	7
Chfnet: a coarse-to-fine hierarchical refinement model for monocular depth estimation	7
Integrating visual-semantic relational reasoning for fake news detection on video platforms	7
DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning	7
Clarity method of fog and dust image in fully mechanized mining face	7
An adaptive interpolation and 3D reconstruction algorithm for underwater images	7
EAF-Net: an enhancement and aggregation–feedback network for RGB-T salient object detection	7
Pattern recognition methodologies for pollen grain image classification: a survey	7
Attention-based global context network for driving maneuvers prediction	7
Identification of facial skin diseases from face phenotypes using FSDNet in uncontrolled environment	7
Tree-managed network ensembles for video prediction	6
Semi-supervised metric learning incorporating weighted triplet constraint and Riemannian manifold optimization for classification	6
Visual-inertial SLAM with line segment merging and efficient feature tracking method	6
Parametric loss-based super-resolution for scene text recognition	6
Environmental factors-aware two-stream GCN for skeleton-based behavior recognition	6
A novel multi-feature fusion deep neural network using HOG and VGG-Face for facial expression classification	6
ConsInstancy: learning instance representations for semi-supervised panoptic segmentation of concrete aggregate particles	6
Distortion diminishing with vulnerability filters pruning	6
Mobgazenet: robust gaze estimation mobile network based on progressive attention mechanisms	6
Deep-plane sweep generative adversarial network for consistent multi-view depth estimation	6
Evolution algorithm of parametric active contour model based on Gaussian smoothing filter	6
Kinematic calibration of a hexapod robot based on monocular vision	6
Enhanced normal estimation of point clouds via fine-grained geometric information learning	6
A dual-path U-Net for pulmonary vessel segmentation method based on lightweight 3D attention	6
Meta-learning enhanced global–local feature fusion for image quality assessment	6
Actions as points: a simple and efficient detector for skeleton-based temporal action detection	6
Tensor-guided learning for image denoising using anisotropic PDEs	6
Boosting few-shot learning via selective patch embedding by comprehensive sample analysis	6
Welding splash and arc noise reduction imaging model based on computationally efficient pairwise response serving welding process library	6
Block-recurrent visual transformer for enhanced human detection in thermal imaging	5
Multi-planar geometry and latent image recovery from a single motion-blurred image	5
Regional filtering distillation for object detection	5
Accelerated fixed-point iterations for image deblurring and defiltering	5
VGT-MOT: visibility-guided tracking for online multiple-object tracking	5
Guest editorial: special issue on human pose estimation and its applications	5
Quality assessment of synthetic images via spatial distortion recognition	5
Multiple object tracking using weighted graph convolutional neural networks	5
Cascaded attention-guided multi-granularity feature learning for person re-identification	5
Residual shuffle attention network for image super-resolution	5
PTDS CenterTrack: pedestrian tracking in dense scenes with re-identification and feature enhancement	5
TFF-temporal fusion framework for advancing video retrieval through long-range dependencies and multi-modal intent	5
Delaunay walk for fast nearest neighbor: accelerating correspondence matching for ICP	5
A collaborative SLAM method for dual payload-carrying UAVs in denied environments	5
Text-to-face synthesis based on facial landmarks prediction	5
A review of adaptable conventional image processing pipelines and deep learning on limited datasets	5
Logit scaling for out-of-distribution detection	5
Robust semantic segmentation method of urban scenes in snowy environment	5
Beyond Kalman filters: deep learning-based filters for improved object tracking	4
Fine-grained 3D vehicle shape manipulation via latent space editing	4
Self-attention network for few-shot learning based on nearest-neighbor algorithm	4
CCTV-Calib: a toolbox to calibrate surveillance cameras around the globe	4
The general framework for few-shot learning by kernel HyperNetworks	4
ViCap-AD: video caption-based weakly supervised video anomaly detection	4
Wavelet and PCA-based glaucoma classification through novel methodological enhanced retinal images	4
Real-time 3D reconstruction using point-dependent pose graph optimization framework	4
Removing cloud shadows from ground-based solar imagery	4
Superpixel-based foreground-preserving image stitching	4
Symmetry-induced ambiguity in orientation estimation from RGB images	4
Human pose estimation based on lightweight basicblock	4
Unsupervised single-shot depth estimation using perceptual reconstruction	4
FLAVR: flow-free architecture for fast video frame interpolation	4
BiTransformer: augmenting semantic context in video captioning via bidirectional decoder	4
SiamCAR-Kal: anti-occlusion tracking algorithm for infrared ground targets based on SiamCAR and Kalman filter	4
A robust vehicle tracking in low-altitude UAV videos	4
Spatial-temporal graph-guided global attention network for video-based person re-identification	4
Gait recognition using free-area transformer networks	4
An image quality assessment method based on edge extraction and singular value for blurriness	4
Swin transformer with part-level tokenization for occluded person re-identification	4
Supervised contrastive learning with multi-scale interaction and integrity learning for salient object detection	4
Toward phytoplankton parasite detection using autoencoders	4
Local region-learning modules for point cloud classification	4
YOLOMH: you only look once for multi-task driving perception with high efficiency	4
Naturally constrained reject option classification	4
Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices	4
Structure–texture decomposition-based dehazing of a single image with large sky area	4
Multimodal dance style transfer	4
A deep Retinex network for underwater low-light image enhancement	4
Hierarchical contrastive adaptation for cross-domain object detection	4
React: recognize every action everywhere all at once	4

Mitigating adversarial perturbations via weakly supervised object location and regions recombination	4
Residual feature learning with hierarchical calibration for gaze estimation	4
Personvit: large-scale self-supervised vision transformer for person re-identification	4