Computational Visual Media

Papers
(The TQCC of Computational Visual Media is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
Attention mechanisms in computer vision: A survey750
PCT: Point cloud transformer732
PVT v2: Improved baselines with Pyramid Vision Transformer573
RGB-D salient object detection: A survey156
A survey of visual analytics techniques for machine learning125
Visual attention network111
A survey on deep geometry learning: From a representation perspective66
Transformers in computational visual media: A survey64
View planning in robot active vision: A survey of systems, algorithms, and applications43
A survey of recent interactive image segmentation methods40
High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review36
A survey on deep learning-based Monte Carlo denoising31
EfficientPose: Efficient human pose estimation with neural architecture search27
DualFace: Two-stage drawing guidance for freehand portrait sketching24
Light field salient object detection: A review and benchmark24
Learning conditional photometric stereo with high-resolution features21
Saliency-based image correction for colorblind patients21
iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks19
Deep image synthesis from intuitive user input: A review and perspectives18
Improved fuzzy clustering for image segmentation based on a low-rank prior17
S4Net: Single stage salient-instance segmentation17
A new dataset of dog breed images and a benchmark for finegrained classification17
Inversion-free geometric mapping construction: A survey17
Detecting human—object interaction with multi-level pairwise feature network16
Scene text removal via cascaded text stroke detection and erasing16
Kernel-blending connection approximated by a neural network for image classification15
A survey of urban visual analytics: Advances and future directions15
Image resizing by reconstruction from deep features15
Image smoothing based on global sparsity decomposition and a variable parameter14
An end-to-end convolutional network for joint detecting and denoising adversarial perturbations in vehicle classification14
Progressive edge-sensing dynamic scene deblurring14
Learning to assess visual aesthetics of food images13
Low and non-uniform illumination color image enhancement using weighted guided image filtering12
Reference-guided structure-aware deep sketch colorization for cartoons11
Joint specular highlight detection and removal in single images via Unet-Transformer11
Jittor-GAN: A fast-training generative adversarial network model zoo based on Jittor11
Can attention enable MLPs to catch up with CNNs?10
Foveated rendering: A state-of-the-art survey9
WGI-Net: A weighted group integration network for RGB-D salient object detection9
Joint 3D facial shape reconstruction and texture completion from a single image9
ClusterSLAM: A SLAM backend for simultaneous rigid body clustering and motion estimation8
Point cloud completion via structured feature maps using a feedback network8
Multispectral image denoising using sparse and graph Laplacian Tucker decomposition8
Mask-aware photorealistic facial attribute manipulation8
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module8
Multi-scale joint feature network for micro-expression recognition8
0.019265174865723