Journal of Visual Communication and Image Representation

Papers
(The TQCC of Journal of Visual Communication and Image Representation is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Register assisted aggregation for visual place recognition329
A robust coverless image-synthesized video steganography based on asymmetric structure120
DBCFNet: Underwater image enhancement network based on dual branch convolution and cross level feature fusion109
FlareDiffusion: Conditional diffusion model for nighttime flare removal89
Distance distributions and runtime analysis of perceptual hashing algorithms80
Corner-to-Center long-range context model for efficient learned image compression77
Edge-aware object pixel-level representation tracking74
SIM-MFR: Spatial interactions mechanisms based multi-feature representation for background modeling60
Dense-sparse representation matters: A point-based method for volumetric medical image segmentation54
FormerPose: An efficient multi-scale fusion Transformer network based on RGB-D for 6D pose estimation50
Heterogeneity constrained color ellipsoid prior image dehazing algorithm50
Faster-slow network fused with enhanced fine-grained features for action recognition49
A no-reference panoramic image quality assessment with hierarchical perception and color features48
Real-world image dehazing with improved joint enhancement and exposure fusion45
Capsule network with using shifted windows for 3D human pose estimation45
SICNet: Learning selective inter-slice context via Mask-Guided Self-knowledge distillation for NPC segmentation42
DB-TASNet for disease diagnosis and lesion segmentation in medical images41
Multi-image super-resolution based low complexity deep network for image compressive sensing reconstruction39
DDFusion: An efficient multi-exposure fusion network with dense pyramidal convolution and de-correlation fusion38
PTR-CNN for in-loop filtering in video coding37
Fast HEVC inter-frame coding based on LSTM neural network technology34
Inter-image Token Relation Learning for weakly supervised semantic segmentation34
U-TPE: A universal approximate thumbnail-preserving encryption method for lossless recovery33
A fast intra CU partition algorithm in Versatile Video Coding for 360-degree video33
Advancing white balance correction through deep feature statistics and feature distribution matching32
Learning informative and discriminative semantic features for robust facial expression recognition31
Reversible data hiding based on automatic contrast enhancement using histogram expansion29
3D human mesh recovery: Comparative review, models, and prospects29
Aligning computational and human perceptions of image complexity: A dual-task framework for prediction and localization29
High-capacity reversible data hiding in encrypted images based on adaptive block coding selection28
All-in-focus image fusion using graph wavelet transform for multi-modal light field28
DA4NeRF: Depth-aware Augmentation technique for Neural Radiance Fields27
Neural Style Transfer for image within images and conditional GANs for destylization27
PRA-TPE: Perfectly Recoverable Approximate Thumbnail-Preserving Image Encryption26
Editorial Board26
Towards fast and effective low-light image enhancement via adaptive Gamma correction and detail refinement25
Personality modeling from image aesthetic attribute-aware graph representation learning25
Locality-constraint Representation with Minkowski distance metric for an effective Face Hallucination25
Pedestrian trajectory prediction using multi-cue transformer25
GLST-Net: Global and local spatio-temporal feature fusion network for skeleton-based action recognition25
AI-assisted deepfake detection using adaptive blind image watermarking24
Masked latent transformer with random masking ratio to advance the diagnosis of dental fluorosis24
A hierarchical multi-modal cross-attention model for face anti-spoofing24
Action density based frame sampling for human action recognition in videos23
HD-YOLO: Using radius-aware loss function for head detection in top-view fisheye images22
Detection of HEVC double compression based on boundary effect of TU and non-zero DCT coefficient distribution22
TransGANomaly: Transformer based Generative Adversarial Network for Video Anomaly Detection22
MSTG: Multi-Scale Transformer with Gradient for joint spatio-temporal enhancement22
Learning-based JNCD prediction for quality-wise perceptual quantization in HEVC21
Editorial Board21
Exploring training data-free video generation from a single image via a stable diffusion model21
End-to-end wavelet block feature purification network for efficient and effective UAV object tracking21
Zero-CSC: Low-light image enhancement with zero-reference color self-calibration21
Lightweight JPEG image steganalysis using dilated blind-spot network21
DetailCaptureYOLO: Accurately Detecting Small Targets in UAV Aerial Images19
Blind deblurring with fractional-order calculus and local minimal pixel prior19
DiffEEGBooth: A diffusion-based EEG generation framework for motor imagery with temporal consistency and neurophysiological constraint19
Person re-identification based on improved attention mechanism and global pooling method19
Semantic similarity guided contrastive hashing for unsupervised cross-modal retrieval19
Robust text watermarking based on average skeleton mass of characters against cross-media attacks18
Multi-task learning for video anomaly detection18
Editorial Board18
Dual-Branch Wavelet Diffusion models with Dual-Prior Refinement for Underwater Image Enhancement18
Multiple transformation function estimation for image enhancement18
Global–local dual-branch network with local feature enhancement for visual tracking18
An active contour model based on Jeffreys divergence and clustering technology for image segmentation18
EMCFN: Edge-based Multi-scale Cross Fusion Network for video frame interpolation17
OODNet: A deep blind JPEG image compression deblocking network using out-of-distribution detection16
Opinion-unaware blind quality assessment of AI-generated omnidirectional images based on deep feature statistics16
ADPNet: Attention based dual path network for lane detection16
EERCA-ViT: Enhanced Effective Region and Context-Aware Vision Transformers for image sentiment analysis16
Multi-scale convolutional neural networks and saliency weight maps for infrared and visible image fusion16
SR4KVQA: Video quality assessment database and metric for 4K super-resolution16
Multi-modal semantic embedding network for 3D shape recognition and retrieval16
Bi-READ: Bi-Residual AutoEncoder based feature enhancement for video anomaly detection15
A non-extended 3D mesh secret sharing scheme adapted for FPGA processing15
Multiscale residual gradient attention for face anti-spoofing15
AMCFNet: Asymmetric multiscale and crossmodal fusion network for RGB-D semantic segmentation in indoor service robots15
SRI-Net: Similarity retrieval-based inference network for light field salient object detection15
Texture-aware fast mode decision and complexity allocation for VVC based point cloud compression15
A novel and efficient image dehazing technique for Advanced Driver Assistance Systems15
WAGAN: Bi-orthogonal Wavelet-Guided Attention Network for image and video dehazing15
Dictionary-based histogram packing technique for lossless image compression15
LRHW-AP: Using ranking-based metric as loss for Person Re-Identification15
Lite transformer with medium self attention for efficient traffic sign recognition15
Stacked deformable convolution network with weighted non-local attention and branch residual connection for image quality assessment15
Virtualized three-dimensional reference tables for efficient data embedding14
PVT2DNet: Polyp segmentation with vision transformer and dual decoder refinement strategy14
Part-attentive kinematic chain-based regressor for 3D human modeling14
A super-resolution-based license plate recognition method for remote surveillance14
Efficient image dehazing algorithm using multiple priors constraints14
Green learning: Introduction, examples and outlook14
Progressive enhancement network with pseudo labels for weakly supervised temporal action localization14
CCNet: CNN model with channel attention and convolutional pooling mechanism for spatial image steganalysis14
Depth error points optimization for 3D Gaussian Splatting in few-shot synthesis14
Multiple integration model for single-source domain generalizable person re-identification14
Locality sensitive hashing scheme based on online-learning14
GSD-YOLOX: Lightweight and more accurate object detection models13
Iterative decoupling deconvolution network for image restoration13
Aethra-net: Single image and video dehazing using autoencoder13
A Transformer-based invertible neural network for robust image watermarking13
Action recognition method based on lightweight network and rough-fine keyframe extraction13
ADcFNet-deep learning based facial expression identification using FER vision transformer13
Human gait recognition using joint spatiotemporal modulation in deep convolutional neural networks13
Face reconstruction with detailed skin features via three selfie images13
Robust reversible image watermarking scheme based on spread spectrum13
Time series analysis using memory enhanced liquid neural network13
High-capacity multi-MSB predictive reversible data hiding in encrypted domain for triangular mesh models13
Corrigendum to “Generative detect for occlusion object based on occlusion generation and feature completing” [J. Visual Commun. Image Represent. 78 (2021) 103189]13
Knowledge-guided quantization-aware training for EEG-based emotion recognition13
Scientific mapping and bibliometric analysis of research advancements in underwater image enhancement13
Survey: 3D watermarking techniques12
Improved threat item detection in baggage X-ray imagery through image projection12
UnifiedTT: Visual tracking with unified transformer12
Deep chroma prediction of Wyner–Ziv frames in distributed video coding of wireless capsule endoscopy video12
Editorial Board12
Salient object detection enhanced pseudo-labels for weakly supervised semantic segmentation12
Contrastive Deep Supervision Meets self-knowledge distillation12
P-NOC: Adversarial training of CAM generating networks for robust weakly supervised semantic segmentation priors12
Context-dependent emotion recognition12
Neighbor2Global: Self-supervised image denoising for Poisson-Gaussian noise12
Joint strong edge and multi-stream adaptive fusion network for non-uniform image deblurring12
MIEI:A KID-based quality assessment metric for grayscale industrial equipment images12
An efficient optimization of measurement matrix for compressive sensing12
Image cropping based on order learning12
SemMatcher: Semantic-aware feature matching with neighborhood consensus12
Decomposition and replacement: Spatial knowledge distillation for monocular depth estimation12
Image downscaling via co-occurrence learning12
Decomposing style, content, and motion for videos12
DRC: Chromatic aberration intensity priors for underwater image enhancement12
MemFlow-AD: An anomaly detection and localization model based on memory module and normalizing flow12
Transferable targeted adversarial attack via multi-source perturbation generation and integration11
Dual-channel prior-based deep unfolding with contrastive learning for underwater image enhancement11
A two-step enhanced tensor denoising framework based on noise position prior and adaptive ring rank11
Compressive Spectral Video Sensing using the Convolutional Sparse Coding framework CSC4D11
Multi-scale features and attention guided for brain tumor segmentation11
Low-complexity 11
CC-SMC: Chain coding-based segmentation map lossless compression11
Improved inter-view correlations for low complexity MV-HEVC11
SiamMBFAN: Siamese tracker with multi-branch feature aggregation network11
CPA-YOLOv7: Contextual and pyramid attention-based improvement of YOLOv7 for drones scene target detection11
MG-SSAF: An advanced vision Transformer11
Retrieval augmented generation for smart calorie estimation in complex food scenarios11
Image copy-move forgery detection using three-stage matching with constraints11
Perceptually diverse visual saliency prediction with global context attention11
Image watermarking using DNST-PHFMs magnitude domain vector AGGM-HMT11
Intermediate deep feature coding for human–machine vision collaboration11
Residual spatiotemporal convolutional networks for face anti-spoofing11
Multiple correlation filters with gaussian constraint for fast online tracking11
Res2former: A multi-scale fusion based transformer feature extraction method11
Dual-branch manifold information consistency for unsupervised visible–infrared person re-identification10
Research on a face recognition algorithm based on 3D face data and 2D face image matching10
Human object interaction detection based on feature optimization and key human-object enhancement10
Screen-shooting resistant image watermarking based on lightweight neural network in frequency domain10
Gradient degradation-aware rate control for VVC using Nash equilibrium10
Detecting Water in Visual Image Streams from UAV with Flight Constraints10
A dual-task region-boundary aware neural network for accurate pulmonary nodule segmentation10
Multi-scale Superpixel based Hierarchical Attention model for brain CT classification10
Vision-language tracking with attention-based optimization10
Multi-scale and multi-patch transformer for sandstorm image enhancement10
A novel high-fidelity reversible data hiding scheme based on multi-classification pixel value ordering10
NCC-FDM: Frequency-domain diffusion model driven by non-physical-domain color correction for underwater image enhancement10
Document forgery detection based on spatial-frequency and multi-scale feature network10
Corrigendum to “Lightweight macro-pixel quality enhancement network for light field images compressed by versatile video coding” [J. Vis. Commun. Image Represent. 105 (2024) 104329]10
Object semantic-guided graph attention feature fusion network for Siamese visual tracking10
RQVR: A multi-exposure image fusion network that optimizes rendering quality and visual realism10
Machine learning and transformers for thyroid carcinoma diagnosis10
A simple transformer-based baseline for crowd tracking with Sequential Feature Aggregation and Hybrid Group Training10
Incremental pseudo-labeling for black-box unsupervised domain adaptation9
Self2Channel: Self-supervised denoising of different regions using coalition game based channel mask9
Editorial Board9
Multi-dimensional human preference assessment for AI-generated images with supervised contrastive learning9
Joint multi-scale transformers and pose equivalence constraints for 3D human pose estimation9
BAO: Background-aware activation map optimization for weakly supervised semantic segmentation without background threshold9
Gesture image recognition method based on DC-Res2Net and a feature fusion attention module9
Applying usability assessment method for surveillance video anomaly detection with multiple distortion9
LFSimCC: Spatial fusion lightweight network for human pose estimation9
Weakly supervised semantic segmentation based on superpixel affinity9
Cell tracking-by-detection using elliptical bounding boxes9
Chosen plaintext attack on JPEG image encryption with adaptive key and run consistency9
Unknown Sample Selection and Discriminative Classifier Learning for Generalized Category Discovery9
DCPNet: Deformable Control Point Network for image enhancement9
Towards real-world haze removal with uncorrelated graph model9
A robust and adaptive framework with space–time memory networks for Visual Object Tracking9
KF-GS: Kalman filter-guided Gaussian splatting for real-time high-quality dynamic scene reconstruction9
A channel-wise contextual module for learned intra video compression9
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading8
Enhancement-suppression driven lightweight fine-grained micro-expression recognition8
An illumination-guided dual-domain network for image exposure correction8
Multi-branch Segmentation-guided Attention Network for crowd counting8
Category-based depth incorporation for salient object ranking8
Transformer-based weakly supervised 3D human pose estimation8
Densely aggregated U-net with spatial-spectral interaction transformer for hyperspectral compressed imaging reconstruction8
On the multi-level embedding of crypto-image reversible data hiding8
HEVC’s intra mode process expedited using Histogram of Oriented Gradients8
Infrared small UAV target detection via depthwise separable residual dense attention network8
Reversible data hiding in encrypted 3D mesh models via ripple prediction8
3D hand reconstruction via aggregating intra and inter graphs guided by prior knowledge for hand-object interaction scenario8
Quality assessment of windowed 6DoF video with viewpoint switching8
Improving small objects detection using transformer8
Correlation-attention guided regression network for efficient crowd counting8
Effective sparse tracking with convolution-based discriminative sparse appearance model8
Accumulated micro-motion representations for lightweight online action detection in real-time8
Multi-stream feature refinement network for human object interaction detection8
Efficient object tracking on edge devices with MobileTrack8
Enhanced monocular depth estimation using novel scale-invariant Error Structure Similarity Index measure optimization in Convolutional Neural network architecture8
A no-reference perceptual image quality assessment database for learned image codecs8
Low-complexity content-aware encoding optimization of batch video8
Reversible data hiding for color images based on prediction-error value ordering and adaptive embedding8
3D human model guided pose transfer via progressive flow prediction network8
Dynamic gesture recognition using 3D central difference separable residual LSTM coordinate attention networks8
Offline writer identification approach using moment features and high-order correlation functions8
Editorial Board8
Blind quality assessment of light field image based on view and focus stacks8
LaDeL: Lane detection via multimodal large language model with visual instruction tuning7
ThermalDiff: A diffusion architecture for thermal image synthesis7
Texture-aware and color-consistent learning for underwater image enhancement7
DFF-Matcher: Robust cross-source registration with density-fused feature and bidirectional consensus matching7
Multiscale Global-Aware Channel Attention for Person Re-identification7
Multi-scale sampling and feature fusion for dynamic human rendering7
Contour enhanced image super-resolution7
Multiscale spatial temporal attention graph convolution network for skeleton-based anomaly behavior detection7
SecureDL: A privacy preserving deep learning model for image recognition over cloud7
Editorial Board7
Unbiased feature generating for generalized zero-shot learning7
SCPNet: Self-constrained parallelism network for keypoint-based lightweight object detection7
Lightweight whole-body mesh recovery with joints and depth aware hand detail optimization7
Hierarchical boundary feature alignment network for video salient object detection7
HySaM: An improved hybrid SAM and Mask R-CNN for underwater instance segmentation7
Multi-stage feature-fusion dense network for motion deblurring7
DAGNet: Depth-aware Glass-like objects segmentation via cross-modal attention7
SAFA: Lifelong Person Re-Identification learning by statistics-aware feature alignment7
Infrared dim and small target detection based on U-Transformer7
Information entropy induced graph convolutional network for semantic segmentation7
JPEG image encryption with grouping coefficients based on entropy coding7
Similarity-aware generative adversarial network for facial expression image translation7
Night vision self-supervised Reflectance-Aware Depth Estimation based on reflectance7
Non-local feature aggregation quaternion network for single image deraining7
Deep semantic image compression via cooperative network pruning7
Adaptive smoothness evaluation and multiple asymmetric histogram modification for reversible data hiding7
Exploring a Non-Parametric Uncertain Adaptive training method for facial expression recognition7
A no-reference underwater image quality evaluator via quality-aware features7
From synthetic to natural — single natural image dehazing deep networks using synthetic dataset domain randomization7
Depth from focus using directional spherical difference filter and vector to scalar fusion7
Knowledge NeRF: Few-shot novel view synthesis for dynamic articulated objects7
Copy Move Forgery detection and localisation robust to rotation using block based Discrete Cosine Transform and eigenvalues7
DCAM: Disturbed class activation maps for weakly supervised semantic segmentation7
A multi-stage spatio-temporal adaptive network for video super-resolution7
Learn decision trees with deep visual primitives7
Editorial Board7
0.13178205490112