Multimedia Systems

Papers
(The H4-Index of Multimedia Systems is 25. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
LMFE-RDD: a road damage detector with a lightweight multi-feature extraction network96
Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks81
360° video quality assessment based on saliency-guided viewport extraction81
A research for sound event localization and detection based on local–global adaptive fusion and temporal importance network79
Pseudo-global strategy-based visual comfort assessment considering attention mechanism79
A visual question answering model based on image captioning76
SS-CMT: a label independent cross-modal transferable adversarial video attack with sparse strategy61
Correction: STASiamRPN: visual tracking based on spatiotemporal and attention54
CAPNet: tomato leaf disease detection network based on adaptive feature fusion and convolutional enhancement53
Automatic lymph node segmentation using deep parallel squeeze & excitation and attention Unet51
Towards domain adaptation underwater image enhancement and restoration48
User authentication method based on keystroke dynamics and mouse dynamics using HDA42
Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model42
Recent advancement in haze removal approaches41
SFRA: spatial fusion regression augmentation network for facial landmark detection40
Segmentation-aware image super-resolution with generative adversarial networks39
A comparative study of color quantization methods using various image quality assessment indices39
Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors37
Dual-branch spectral–spatial feature extraction network for multispectral image compression36
SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization36
Multi-view Isolated sign language recognition based on cross-view and multi-level transformer34
Dual convolutional neural network with attention for image blind denoising33
Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image32
Generalizing sentence-level lipreading to unseen speakers: a two-stream end-to-end approach30
The segmented UEC Food-100 dataset with benchmark experiment on food detection25
Improving text-image cross-modal retrieval with contrastive loss25
Point cloud inpainting with normal-based feature matching25
0.068725109100342