Computational Visual Media

Papers
(The median citation count of Computational Visual Media is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Front Cover1678
Geometry-aware 3D pose transfer using transformer autoencoder1484
Towards harmonized regional style transfer and manipulation for facial images574
Heuristic weakly supervised 3D human pose estimation105
3D Indoor Scene Geometry Estimation from a Single Omnidirectional Image: A Comprehensive Survey68
MA2Net: Multi-Scale Adaptive Mixed Attention Network for Image Demoiréing61
3D face recognition: A comprehensive survey in 202257
Controllable multi-domain semantic artwork synthesis45
A Biophysical-Based Skin Model for Heterogeneous Volume Rendering41
Neighborhood co-occurrence modeling in 3D point cloud segmentation36
FRNeRF: Fusion and regularization fields for dynamic view synthesis31
Temporal vectorized visibility for direct illumination of animated models29
Recent advances in glinty appearance rendering29
Practical construction of globally injective parameterizations with positional constraints27
A causal convolutional neural network for multi-subject motion modeling and generation27
Front cover25
Image-guided color mapping for categorical data visualization23
Central similarity consistency hashing for asymmetric image retrieval23
Multi-granularity sequence generation for hierarchical image classification22
Real-time distance field acceleration based free-viewpoint video synthesis for large sports fields20
Towards robustness and generalization of point cloud representation: A geometry coding method and a large-scale object-level dataset20
Anchor-Regularized GAN Priors20
MusicFace: Music-driven expressive singing face synthesis19
ARM3D: Attention-based relation module for indoor 3D object detection18
IIDM: Image-to-Image Diffusion Model for Semantic Image Synthesis18
Let's all dance: Enhancing amateur dance motions16
D2ANet: Difference-aware attention network for multi-level change detection from satellite imagery16
Multi3D: 3D-aware multimodal image synthesis14
Sphere face model: A 3D morphable model with hypersphere manifold latent space using joint 2D/3D training14
NeuS-PIR: Learning relightable neural surface using pre-integrated rendering13
Watertight surface reconstruction method for CAD models based on optimal transport13
Reference-guided structure-aware deep sketch colorization for cartoons13
A survey of urban visual analytics: Advances and future directions13
DepthGAN: GAN-based depth generation from semantic layouts13
Constructing self-supporting surfaces with planar quadrilateral elements12
Global video object segmentation with spatial constraint module12
MMRelief: Modeling Multi-Human Relief from a Single Photograph11
Addressing Missing Modality Challenges in MRI Images: A Comprehensive Review11
Prediction of scene plausibility11
Sem-iNeRF: Camera Pose Refinement by Inverting Neural Radiance Fields with Semantic Feature Consistency11
Co-occurrence based texture synthesis11
Mindstorms in Natural Language-Based Societies of Mind11
MDFP-Net: A model-driven deep neural network for Fourier ptychography10
Benchmarking visual SLAM methods in mirror environments10
Deep unfolding multi-scale regularizer network for image denoising9
Exploring Contextual Priors for Real-World Image Super-Resolution9
Hybrid Mesh-Neural Representation for 3D Transparent Object Reconstruction8
Deep panoramic depth prediction and completion for indoor scenes8
Deep image synthesis from intuitive user input: A review and perspectives8
Front cover8
Front cover8
A Voronoi diagram approach for detecting defects in 3D printed fiber-reinforced polymers from microscope images8
EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection8
Front cover8
SGformer: Boosting transformers for indoor lighting estimation from a single image7
PuzzleSorter: Certainty-aware visual restoration of multiple cultural artifacts7
Dynamic ocean inverse modeling based on differentiable rendering7
Point cloud completion via structured feature maps using a feedback network7
FCDFusion: A Fast, Low Color Deviation Method for Fusing Visible and Infrared Image Pairs7
A two-step surface-based 3D deep learning pipeline for segmentation of intracranial aneurysms7
Message from the editor-in-chief7
Visual perception driven collage synthesis6
Non-dominated sorting based multi-page photo collage6
Immersive analytics meets artificial intelligence: A systematic review6
Front cover6
A survey on facial image deblurring6
Towards uniform point distribution in feature-preserving point cloud filtering6
Joint specular highlight detection and removal in single images via Unet-Transformer6
Focusing on your subject: Deep subject-aware image composition recommendation networks6
Erroneous pixel prediction for semantic image segmentation6
A Simple and Effective Filtering Scheme for Improving Neural Fields5
Message from the editor-in-chief5
A visual modeling method for spatiotemporal and multidimensional features in epidemiological analysis: Applied COVID-19 aggregated datasets5
An efficient algorithm for approximate Voronoi diagram construction on triangulated surfaces5
FilterGNN: Image feature matching with cascaded outlier filters and linear attention5
An anisotropic Chebyshev descriptor and its optimization for deformable shape correspondence5
Super-resolution reconstruction of single image for latent features4
Attention mechanisms in computer vision: A survey4
Front cover4
Message from the editor-in-chief4
MagicTalk: Implicit and explicit correlation learning for diffusion-based emotional talking face generation4
PVT v2: Improved baselines with pyramid vision transformer4
Polygonal finite element-based content-aware image warping4
Cross-modal learning using privileged information for long-tailed image classification4
Continual few-shot patch-based learning for anime-style colorization4
NPRportrait 1.0: A three-level benchmark for non-photorealistic rendering of portraits4
Autocompletion of repetitive stroking with image guidance4
Learning physically based material and lighting decompositions for face editing3
Noise4Denoise: Leveraging noise for unsupervised point cloud denoising3
LucIE: Language-Guided Local Image Editing for Fashion Images3
Multi-modal visual tracking: Review and experimental comparison3
BLNet: Bidirectional learning network for point clouds3
A survey on rendering homogeneous participating media3
Lossless Intrinsic Image Decomposition via Learning Shading Feature Filtering3
SAM-driven MAE pre-training and background-aware meta-learning for unsupervised vehicle re-identification3
AR assistance for efficient dynamic target search3
Emotion Amplification of Facial Videos Using a Fine-Tuned StyleGAN3
PMSSC: Parallelizable multi-subset based self-expressive model for subspace clustering3
Angle-uniform parallel coordinates3
3D corrective nose reconstruction from a single image3
JNeRF: An efficient heterogeneous NeRF model zoo based on Jittor3
Point Mask Transformer for Outdoor Point Cloud Semantic Segmentation3
Full-duplex strategy for video object segmentation3
CLIP-SP: Vision-language model with adaptive prompting for scene parsing3
Message from the best paper award committee3
Remote sensing tuning: A survey2
Audio-guided implicit neural representation for local image stylization2
Swin3D++: Effective Multi-Source Pretraining for 3D Indoor Scene Understanding2
Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding2
Progressive edge-sensing dynamic scene deblurring2
Contents2
Transformers in computational visual media: A survey2
ARNet: Attribute Artifact Reduction for G-PCC Compressed Point Clouds2
Foundation models meet visualizations: Challenges and opportunities2
Self-supervised coarse-to-fine monocular depth estimation using a lightweight attention module2
FastMAE: Efficient Masked Autoencoder with Offline Tokenizer2
Class-conditional domain adaptation for semantic segmentation2
LDSwap: A semantic-related latent code disentangling method in StyleSpace towards high-resolution face swapping2
Recent advances in 3D Gaussian splatting2
RecStitchNet: Learning to stitch images with rectangular boundaries2
Taming diffusion model for exemplar-based image translation2
0.076802968978882