OOIR: Observatory of International Research

Papers

(The TQCC of Image and Vision Computing is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-07-01 to 2025-07-01.)

Article	Citations
Learning diverse and deep clues for person reidentification	160
RGB-T tracking by modality difference reduction and feature re-selection	156
Alignment and fusion for adaptive domain nighttime semantic segmentation	127
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty	122
Modeling content-attribute preference for personalized image esthetics assessment	118
HPD-Depth: High performance decoding network for self-supervised monocular depth estimation	101
Multi-information guided camouflaged object detection	76
Editorial Board	73
ABC: Aligning binary centers for single-stage monocular 3D object detection	60
Cross-scale global attention feature pyramid network for person search	59
BF3D: Bi-directional fusion 3D detector with semantic sampling and geometric mapping	57
Feature decoupling and interaction network for defending against adversarial examples	56
Hourglass cascaded recurrent stereo matching network	50
G-TRACE: Grouped temporal recalibration for video object segmentation	44
PU-GACNet: Graph Attention Convolution Network for Point Cloud Upsampling	43
Few-shot classification with multisemantic information fusion network	43
Synthetic lidar point cloud generation using deep generative models for improved driving scene object recognition	41
Privacy-preserving explainable AI enable federated learning-based denoising fingerprint recognition model	40
Accurate and efficient salient object detection via position prior attention	39
GAN-BodyPose: Real-time 3D human body pose data key point detection and quality assessment assisted by generative adversarial network	37
Background debiased class incremental learning for video action recognition	35
AI-powered trustable and explainable fall detection system using transfer learning	34
SRMA-KD: Structured relational multi-scale attention knowledge distillation for effective lightweight cardiac image segmentation	34
DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions	34
Learning an augmentation strategy for sparse datasets	32

FMD-Yolo: An efficient face mask detection method for COVID-19 prevention and control in public	32
Single stage architecture for improved accuracy real-time object detection on mobile devices	30
1D kernel distillation network for efficient image super-resolution	29
MVPCC-Net: Multi-View Based Point Cloud Completion Network for MLS data	29
ST-VTON: Self-supervised vision transformer for image-based virtual try-on	29
Utilizing Inherent Bias for Memory Efficient Continual Learning: A Simple and Robust Baseline	28
Learning accurate monocular 3D voxel representation via bilateral voxel transformer	28
SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation	28
Spatial likelihood voting with self-knowledge distillation for weakly supervised object detection	28
Multi-view dynamic facial action unit detection	28
Two-stream transformer tracking with messengers	27
Memory-MambaNav: Enhancing object-goal navigation through integration of spatial–temporal scanning with state space models	27
Self-supervised Vision Transformers for 3D pose estimation of novel objects	26
Deep learning with adaptive convolutions for classification of retinal diseases via optical coherence tomography	26
Frequency and content dual stream network for image dehazing	26
SAGNet: Synergistic Attention-Graph Network For video salient object detection	26
A Point-2s reinforcement learning biomimetic model for estimating and analyzing human 3D motion posture	25
Enhanced residual network for burst image super-resolution using simple base frame guidance	24
FSBI: Deepfake detection with frequency enhanced self-blended images	24
Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale benchmark dataset	24
CMS-net: Edge-aware multimodal MRI feature fusion for brain tumor segmentation	24
Underwater bubble plume image generative model based on noise prior and multi conditional labels	24
Flow guided mutual attention for person re-identification	24
Depth assisted novel view synthesis using few images	24
Recent advances in deterministic human motion prediction: A review	23
Dual subspace clustering for spectral-spatial hyperspectral image clustering	23
Intelligent deep learning based ethnicity recognition and classification using facial images	22
Multi-view self-supervised learning for 3D facial texture reconstruction from single image	22
Editorial Board	22
Underwater image restoration based on light attenuation prior and color-contrast adaptive correction	22
Object tracking based on temporal and spatial context information	21
Dual-branch adaptive attention transformer for occluded person re-identification	21
STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation	21
SDMNet: Spatially dilated multi-scale network for object detection for drone aerial imagery	21
TransMix: Crafting highly transferable adversarial examples to evade face recognition models	20
PAGML: Precise Alignment Guided Metric Learning for sketch-based 3D shape retrieval	20
FastNet: Fast high-resolution network for human pose estimation	20
Intelligent facial expression recognition and classification using optimal deep transfer learning model	19
NPVForensics: Learning VA correlations in non-critical phoneme–viseme regions for deepfake detection	19
Mixup Mask Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup	19
Feature alignment via mutual mapping for few-shot fine-grained visual classification	19
A multi-branch dual attention segmentation network for epiphyte drone images	19
RFSC-net: Re-parameterization forward semantic compensation network in low-light environments	19
EMA-GS: Improving sparse point cloud rendering with EMA gradient and anchor upsampling	19
Enhancing consistency in virtual try-on: A novel diffusion-based approach	18
SADGFeat: Learning local features with layer spatial attention and domain generalization	18
Landmark-in-facial-component: Towards occlusion-robust facial landmark localization	18
Editorial Board	18
A new multi-picture architecture for learned video deinterlacing and demosaicing with parallel deformable convolution and self-attention blocks	18
Robust visual tracking via modified Harris hawks optimization	17

A spatial-frequency domain multi-branch decoder method for real-time semantic segmentation	17
Contrast enhancement of region of interest of backlit image for surveillance systems based on multi-illumination fusion	17
Social robot in service of the cognitive therapy of elderly people: Exploring robot acceptance in a real-world scenario	17
Mitigating human fall injuries: A novel system utilizing 3D 4-stream convolutional neural networks and image fusion	17
AGSAM-Net: UAV route planning and visual guidance model for bridge surface defect detection	17
PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding	17
CNN and Transformer-based deep learning models for automated white blood cell detection	17
Enhancing brain tumor classification in MRI images: A deep learning-based approach for accurate diagnosis	17
A deep-shallow and global–local multi-feature fusion network for photometric stereo	17
Detection of anomaly in surveillance videos using quantum convolutional neural networks	17
Matte anything: Interactive natural image matting with segment anything model	17
Face deidentification with controllable privacy protection	17
Anchor-based discriminative dual distribution calibration for transductive zero-shot learning	16
Real-time human-centric segmentation for complex video scenes	16
Adaptive and fast image superpixel segmentation approach	16
TQRFormer: Tubelet query recollection transformer for action detection	16
Class-discriminative domain generalization for semantic segmentation	16
AHA-track: Aggregating hierarchical awareness features for single	16
CollaborativeBEV: Collaborative bird eye view for reconstructing crowded environment	16
Editorial Board	16
CRFormer: A cross-region transformer for shadow removal	15
Adaptive scale matching for remote sensing object detection based on aerial images	15
Self-trained prediction model and novel anomaly score mechanism for video anomaly detection	15
DFG-HCEN: A distinctive-feature guided and hierarchical channel enhanced network-based infrared and visible image fusion	15
Source domain prior-assisted segment anything model for single domain generalization in medical image segmentation	15
Video anomaly detection based on a multi-layer reconstruction autoencoder with a variance attention strategy	15
PW-NeRF: Progressive wavelet-mask guided neural radiance fields view synthesis	15
OFACD: An end-to-end change detection network for small UAVs remote sensing with viewpoint differences	15
M2VAD: Multiview multi	15
Online multi-object tracking with δ-GLMB filter based on occlusion and identity switch handling	15
WPE: Weighted prototype estimation for few-shot learning	15
Multi-axis interactive multidimensional attention network for vehicle re-identification	14
Deep learning-based efficient diagnosis of periapical diseases with dental X-rays	14
Real-time gait biometrics for surveillance applications: A review	14
SAMNet: Adapting segment anything model for accurate light field salient object detection	14
Self-knowledge distillation based on knowledge transfer from soft to hard examples	14
Attentive spatial-temporal contrastive learning for self-supervised video representation	14
FgbCNN: A unified bilinear architecture for learning a fine-grained feature representation in facial expression recognition	14
A novel facial expression recognition model based on harnessing complementary features in multi-scale network with attention fusion	14
Editorial Board	13
Face and body-shape integration model for cloth-changing person re-identification	13
An edge-aware high-resolution framework for camouflaged object detection	13
Bridging efficiency and interpretability: Explainable AI for multi-classification of pulmonary diseases utilizing modified lightweight CNNs	13
Corrigendum to “A novel framework for diverse video generation from a single video using frame-conditioned denoising diffusion probabilistic model and ConvNeXt-V2” [Image and Vision Computing 154 (202	13
Incremental human action recognition with dual memory	13
Semantic-aware for point cloud domain adaptation with self-distillation learning	13
Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training	13
Data-driven 2D-EWT based diabetic retinopathy identification using hybrid neural network	13
H-net: Unsupervised domain adaptation person re-identification network based on hierarchy	13
Enhancing small object tracking with reversible rescaling networks	13
Adaptive graph reasoning network for object detection	13
Dynamic semantic prototype perception for text–video retrieval	13
Optimal deep transfer learning based ethnicity recognition on face images	13
Learning auto-scale representations for person re-identification	12
Few-shot class incremental learning via prompt transfer and knowledge distillation	12
Enhancing single-view 3D mesh reconstruction with the aid of implicit surface learning	12
Exploiting spatial and temporal context for online tracking with improved transformer	12
Perceiving local relative motion and global correlations for weakly supervised group activity recognition	12
Resource-aware strategies for real-time multi-person pose estimation	12
Editorial Board	12
CVAD-GAN: Constrained video anomaly detection via generative adversarial network	12
Guest Editorial : Learning with Manifolds in Computer Vision	12
Editorial Board	11
An analytical proof on suitability of Cauchy-Schwarz Divergence as the aggregation criterion in Region Growing Algorithm	11
A decision support system for acute lymphoblastic leukemia detection based on explainable artificial intelligence	11
Deep hybrid learning for facial expression binary classifications and predictions	11
GFFT: Global-local feature fusion transformers for facial expression recognition in the wild	11
Synthetic multi-view clustering with missing relationships and instances	11
SDE-RAE:CLIP-based realistic image reconstruction and editing network using stochastic differential diffusion	11
Monocular contextual constraint for stereo matching with adaptive weights assignment	11
Qualitative failures of image generation models and their application in detecting deepfakes	11
Self-distillation guided Semantic Knowledge Feedback network for infrared–visible image fusion	11
Black-box reversible adversarial examples with invertible neural network	11
Flexible multi-objective particle swarm optimization clustering with game theory to address human activity discovery fully unsupervised	11
Geometric feature statistics histogram for both real-valued and binary feature representations of 3D local shape	11
External knowledge-assisted Transformer for image captioning	11
Multi-object tracking with adaptive measurement noise and information fusion	11
Multi-granularity for knowledge distillation	11
Effective hybrid attention network based on pseudo-color enhancement in ultrasound image segmentation	10
A dual-channel network based on occlusion feature compensation for human pose estimation	10

Learning language to symbol and language to vision mapping for visual grounding	10
OCUCFormer: An Over-Complete Under-Complete Transformer Network for accelerated MRI reconstruction	10
Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model	10
Weather-degraded image semantic segmentation with multi-task knowledge distillation	10
Speaker independent VSR: A systematic review and futuristic applications	10
Fuzzy set-based Bernoulli Random Noise Weighted Loss for unsupervised person re-identification	10
Robust ensemble person reidentification via orthogonal fusion with occlusion handling	10
Editorial Board	10
ECT: Fine-grained edge detection with learned cause tokens	10
Transformer-based feature interactor for person re-identification with margin self-punishment loss	10
Twin relaxed least squares regression with classwise mean constraint for image classification	10
Semantic scene graph generation based on an edge dual scene graph and message passing neural network	10
Video object segmentation by multi-scale attention using bidirectional strategy	10
Unified Volumetric Avatar: Enabling flexible editing and rendering of neural human representations	10
Contrastive learning based facial action unit detection in children with hearing impairment for a socially assistive robot platform	10
Editorial Board	10
A dedicated benchmark for contour-based corner detection evaluation	10
Gait recognition via View-aware Part-wise Attention and Multi-scale Dilated Temporal Extractor	10
UIR-ES: An unsupervised underwater image restoration framework with equivariance and stein unbiased risk estimator	10
RGB road scene material segmentation	10
Feature extraction and fusion algorithm for infrared visible light images based on residual and generative adversarial network	10
Drone-NeRF: Efficient NeRF based 3D scene reconstruction for large-scale drone survey	9
Editorial Board	9
DiPS: Discriminative pseudo-label sampling with self-supervised transformers for weakly supervised object localization	9
Parameter efficient finetuning of text-to-image models with trainable self-attention layer	9
Mobile-friendly and multi-feature aggregation via transformer for human pose estimation	9
Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction	9
Multiscale parallel deep CNN (mpdCNN) architecture for the real low-resolution face recognition for surveillance	9
A supervised approach for the detection of AM-FM signals’ interference regions in spectrogram images	9
Continual coarse-to-fine domain adaptation in semantic segmentation	9
Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications	9
Boosting semi-supervised face recognition with raw faces	9
LELD: Learn enhancement by learning degradation	9
Improving defocus blur detection via adaptive supervision prior-tokens	9
Federated learning based nonlinear two-stage framework for full-reference image quality assessment: An application for biometric	9
Combining complementary trackers for enhanced long-term visual object tracking	9
AES-Net: An adapter and enhanced self-attention guided network for multi-stage glaucoma classification using fundus images	9
Does explainable machine learning uncover the black box in vision applications?	9
Efficient masked feature and group attention network for stereo image super-resolution	9
Knowledge graph construction in hyperbolic space for automatic image annotation	9
ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation	9
Depth awakens: A depth-perceptual attention fusion network for RGB-D camouflaged object detection	9
Universal domain adaptation from multiple black-box sources	9
CF-SOLT: Real-time and accurate traffic accident detection using correlation filter-based tracking	9
Three dimensional tracking of rigid objects in motion using 2D optical flows	9
EFDCNet: Encoding fusion and decoding correction network for RGB-D indoor semantic segmentation	8
Noisy label facial expression recognition via face-specific label distribution learning	8
FRoundation: Are foundation models ready for face recognition?	8
Transferable dual multi-granularity semantic excavating for partially relevant video retrieval	8
Image–text feature learning for unsupervised visible–infrared person re-identification	8
Multi-level feature disentanglement network for cross-dataset face forgery detection	8
SAKD: Sparse attention knowledge distillation	8
RBGAN: Realistic-generation and balanced-utility GAN for face de-identification	8
GW-net: An efficient grad-CAM consistency neural network with weakening of random erasing features for semi-supervised person re-identification	8
Robust visual tracking based on modified mayfly optimization algorithm	8
SinWaveFusion: Learning a single image diffusion model in wavelet domain	8
A lightweight hash-directed global perception and self-calibrated multiscale fusion network for image super-resolution	8
Editorial Board	8
Rethinking the sample relations for few-shot classification	8
Machine learning applications in breast cancer prediction using mammography	8
Video prediction by efficient transformers	8
Attention guided multi-level feature aggregation network for camouflaged object detection	8
Generative feature-driven image replay for continual learning	8
EMNet: Edge-guided multi-level network for salient object detection in low-light images	8
Dense open-set recognition based on training with noisy negative images	8
Part-aware distillation and aggregation network for human parsing	8
Text-augmented Multi-Modality contrastive learning for unsupervised visible-infrared person re-identification	8
Person re-identification: A taxonomic survey and the path ahead	8
Learning to disentangle scenes for person re-identification	8
Improving multi-focus image fusion through Noisy image and feature difference network	7
Deep Isometric Maps	7
An end-to-end anti-shaking multi-focus image fusion approach	7
Ricci curvature based volumetric segmentation	7
Language conditioned multi-scale visual attention networks for visual grounding	7
Markerless multi-view 3D human pose estimation: A survey	7
Editorial Board	7
Grassmann manifold based framework for automated fall detection from a camera	7
A semi-parallel CNN-transformer fusion network for semantic change detection	7
Wave-based cross-phase representation for weakly supervised classification	7
DeepNet: Protection of deepfake images with aid of deep learning networks	7
Human activity recognition from UAV videos using a novel DMLC-CNN model	7
LP-GAN: Learning perturbations based on generative adversarial networks for point cloud adversarial attacks	7
Advances in deep learning-based image recognition of product packaging	7
An Active Transfer Learning framework for image classification based on Maximum Differentiation Classifier	7
Editorial to special issue on novel insights on ocular biometrics	7
Corrigendum to “STAFFormer: Spatio-temporal adaptive fusion transformer for efficient 3D human pose estimation” [Journal of Image and Vision Computing volume 149 (2024) 105142]	7
MODE: Monocular omnidirectional depth estimation via consistent depth fusion	7
EatSense: Human centric, action recognition and localization dataset for understanding eating behaviors and quality of motion assessment	7
VQA as a factoid question answering problem: A novel approach for knowledge-aware and explainable visual question answering	7
A novel framework for diverse video generation from a single video using frame-conditioned denoising diffusion probabilistic model and ConvNeXt-V2	7
POSER: POsed vs Spontaneous Emotion Recognition using fractal encoding	7
Editorial Board	7
IRPE: Instance-level reconstruction-based 6D pose estimator	7
BPMB: BayesCNNs with perturbed multi-branch structure for robust facial expression recognition	6
Cross channel weight sharing for image classification	6