Multimedia Systems

Papers
(The TQCC of Multimedia Systems is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Correction: Parentheses insertion based sentence-level text adversarial attack144
Personalized decision-making for agents in face-to-face interaction in virtual reality94
High-resolution network-based multi-feature fusion for generalized forgery detection87
SS-YOLOv8: small-size object detection algorithm based on improved YOLOv8 for UAV imagery82
Spatial-temporal transformer network for protecting person-of-interest from deepfaking76
SEMNet: a simple and efficient MLP-based network for 3D Face point clouds landmarks localization75
Multi-scale attention and loss penalty mechanism for multi-view clustering75
Constraint embedding for prompt tuning in vision-language pre-trained model72
Binary classification for imbalanced data using data conformity mechanism70
Image channel and spatial information integrated method for fall detection69
An efficient federated learning method based on enhanced classification-GAN for medical image classification55
RG-YOLO: multi-scale feature learning for underwater target detection51
SR-DAYOLOv8: cross-domain adaptive object detection based on super-resolution domain classifier45
ENet: event based highlight generation network for broadcast sports videos45
More accurate heatmap generation method for human pose estimation43
A multi-scale channel attention network with federated learning for magnetic resonance image super-resolution38
A cross-texture haptic model based on tactile feature fusion38
Multi-level pyramid fusion for efficient stereo matching38
Social bot detection on Twitter: robustness evaluation and improvement36
Propagating prior information with transformer for robust visual object tracking35
Adaptive B-spline curve fitting with minimal control points using an improved sparrow search algorithm for geometric modeling of aero-engine blades34
Physical-prior-guided single image dehazing network via unpaired contrastive learning33
MSADRCN: meta-learning based joint super-resolution fusion of infrared and visible images32
Unsupervised deep metric learning algorithm for crop disease images based on knowledge distillation networks29
TEST-Net: transformer-enhanced Spatio-temporal network for infectious disease prediction28
Mutual-weighted feature disentanglement for unsupervised domain adaptation28
Quality assessment of identity inpainting based on multidimensional discrimination27
Implicit neural representation steganography by neuron pruning27
Exploring multi-dimensional interests for session-based recommendation26
Channel modulus normalization for CNN image classification26
Exploring the impact of volumetric graphics on the engagement of broadcast media professionals24
A research for sound event localization and detection based on local–global adaptive fusion and temporal importance network23
Online spatio-temporal action detection with adaptive sampling and hierarchical modulation23
Irregular feature enhancer for low-dose CT denoising21
CAFIN: cross-attention based face image repair network21
LLR-MVSNet: a lightweight network for low-texture scene reconstruction21
Pseudo-global strategy-based visual comfort assessment considering attention mechanism20
Model-based portrait video compression with spatial constraint and adaptive pose processing20
ParallelNet: multiple backbone network for detection tasks on thigh bone fracture20
Camouflage design, assessment and breaking techniques: a survey19
A framework of generative adversarial networks with novel loss for JPEG restoration and anti-forensics19
Correction to: Deep reconstruction of 1D ISOMAP representations18
Audio steganography with less modification to the optimal matching CNV-QIM path with the minimal hamming distance expected value to a secret18
Modeling large-scale live video streaming client behavior17
Watermarking techniques for three-dimensional (3D) mesh models: a survey17
3D human pose estimation with multi-scale graph convolution and hierarchical body pooling17
Structural smoothness low-rank matrix recovery via outlier estimation for image denoising17
Objective image fusion evaluation method for target recognition based on target quality factor16
Spatial–temporal correlations learning and action-background jointed attention for weakly-supervised temporal action localization16
Point cloud denoising algorithm with geometric feature preserving16
A novel image encryption cryptosystem based on true random numbers and chaotic systems16
Multi-scale feature balance enhancement network for pedestrian detection16
A novel SPLIT-SIM approach for efficient image retrieval16
Code generation from a graphical user interface via attention-based encoder–decoder model16
Point cloud inpainting with normal-based feature matching16
Local feature fusion and SRC-based decision fusion for ear recognition16
Improving text-image cross-modal retrieval with contrastive loss15
Robust 3D face modeling and tracking from RGB-D images15
Towards a multimodal human activity dataset for healthcare15
BCMask: a finer leaf instance segmentation with bilayer convolution mask14
Rescue decision via Earthquake Disaster Knowledge Graph reasoning14
An improved contrast enhancement for dark images with non-uniform illumination based on edge preservation14
Correction: STASiamRPN: visual tracking based on spatiotemporal and attention14
Hierarchical cross-modal contextual attention network for visual grounding13
A review of micro-expression spotting: methods and challenges13
View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer13
SMPC: boosting social media popularity prediction with caption13
Image-text matching using multi-subspace joint representation13
Micro-expression spotting network based on attention and one-dimensional convolutional sliding window13
Cascaded deep residual learning network for single image dehazing13
A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks13
BENet: bi-directional enhanced network for image captioning12
Dual-branch spectral–spatial feature extraction network for multispectral image compression12
Generalizing sentence-level lipreading to unseen speakers: a two-stream end-to-end approach12
Assessing the adoption of the Yavuz Battleship application in the mixed reality environment using the technology acceptance model12
Improving the application performance of Loki via algorithm optimization12
GHCL: Gaussian heuristic curriculum learning for Brain CT report generation12
Spatial attention-guided deformable fusion network for salient object detection12
Facial expression intensity estimation using label-distribution-learning-enhanced ordinal regression12
VMSG: a video caption network based on multimodal semantic grouping and semantic attention12
DSTC-Net: differential spatio-temporal correlation network for similar action recognition11
Virtual human pose estimation in a fire education system for children with autism spectrum disorders11
Personalized time-sync comment generation based on a multimodal transformer11
G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation11
360° video quality assessment based on saliency-guided viewport extraction11
A comparative study of color quantization methods using various image quality assessment indices11
An adaptive Bagging algorithm based on lightweight transformer for multi-class imbalance recognition10
SMA-GCN: a fall detection method based on spatio-temporal relationship10
Lite general network and MagFace CNN for micro-expression spotting in long videos10
The segmented UEC Food-100 dataset with benchmark experiment on food detection10
Multiscale geometric window transformer for orthodontic teeth point cloud registration10
Image captioning for cultural artworks: a case study on ceramics10
Layer-wise enhanced transformer with multi-modal fusion for image caption10
An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works10
A lightweight algorithm for pedestrian detection in overhead images9
An efficient black widow optimization-based faster R-CNN for classification of COVID-19 from CT images9
Synchronous composition and semantic line detection based on cross-attention9
Recent advancements of deep learning in detecting breast cancer: a survey9
Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)9
GVA: guided visual attention approach for automatic image caption generation9
A cross-view geo-localization method guided by relation-aware global attention9
Full reference image quality assessment based on dual-space multi-feature fusion9
Dual graph-structured semantics multi-subspace learning for cross-modal retrieval9
Real emotion seeker: recalibrating annotation for facial expression recognition9
PS-YOLO: a small object detector based on efficient convolution and multi-scale feature fusion9
Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model9
Real-walk modelling: deep learning model for user mobility in virtual reality9
Wacml: based on graph neural network for imbalanced node classification algorithm9
An object detection-based few-shot learning approach for multimedia quality assessment8
Multimodal heterogeneous graph convolutional network for image recommendation8
Multi-object tracking with scale-aware transformer and enhanced association strategy8
Multi-level sentiment-aware clustering for denoising in multimodal sentiment analysis with ASR errors8
FedFV: federated face verification via equivalent class embeddings8
Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning8
Deepphysio: detecting deepFake with non-personalized feature of physiological signal8
Domain-adaptive person re-identification via domain alignment and mutual pseudo-label refinement8
IOPCNet: inner and outer point classification based low overlap rate local-to-global point cloud registration8
A defensive attention mechanism to detect deepfake content across multiple modalities8
Optimize brain tumor multiclass classification with manta ray foraging and improved residual block techniques8
Frequency disentangled residual network8
Inception-like Large Kernel network for lightweight image super-resolution8
Multi-object tracking based on graph neural networks8
Authenticable medical image-sharing scheme based on embedded small shadow QR code and blockchain framework8
Segmentation-aware image super-resolution with generative adversarial networks8
LCFormer: linear complexity transformer for efficient image super-resolution7
Global adaptive histogram feature network for automatic segmentation of infection regions in CT images7
Motion synthesis via distilled absorbing discrete diffusion model7
DPNet: a dual-attention patching network for breast tumor segmentation in an ultrasound image7
Electric vehicle routing optimization under 3D electric energy modeling7
Gated feature aggregate and alignment network for real-time semantic segmentation of street scenes7
Automatic lymph node segmentation using deep parallel squeeze & excitation and attention Unet7
TFEN: a two-dimensional feature extraction network for single image super-resolution7
LET-Net: locally enhanced transformer network for medical image segmentation7
SenseMLP: a parallel MLP architecture for sensor-based human activity recognition7
SS-CMT: a label independent cross-modal transferable adversarial video attack with sparse strategy7
A visual question answering model based on image captioning7
Map modeling for full body gesture using flex sensor and machine learning algorithms6
Generalizing to unseen domains via PatchMix6
TrafficTrack: rethinking the motion and appearance cue for multi-vehicle tracking in traffic monitoring6
An insight into topological, machine and Deep Learning-based approaches for influential node identification in social media networks: a systematic review6
Composite makeup transfer model based on generative adversarial networks6
A gated multi-hierarchical feature fusion network for recognizing steel plate surface defects6
Adaptafood: an intelligent system to adapt recipes to specialised diets and healthy lifestyles6
ATMKD: adaptive temperature guided multi-teacher knowledge distillation6
ITrans: generative image inpainting with transformers6
Towards domain adaptation underwater image enhancement and restoration6
A feature pyramid network with adaptive fusion strategy and enhanced semantic information6
SFRA: spatial fusion regression augmentation network for facial landmark detection6
SA-MDRAD: sample-adaptive multi-teacher dynamic rectification adversarial distillation6
Development of outdoor swimmers detection system with small object detection method based on deep learning6
Unsupervised single image dehazing with generative adversarial network6
An entropy-weighted local intensity clustering-based model for segmenting intensity inhomogeneous images6
LMFE-RDD: a road damage detector with a lightweight multi-feature extraction network6
Role of deep learning models and analytics in industrial multimedia environment6
Correction to: Cellular automata-based CMF detection under single and multiple post-processing attacks6
Image and audio caps: automated captioning of background sounds and images using deep learning6
Medical image encryption and compression by adaptive sigma filterized synorr certificateless signcryptive Levenshtein entropy-coding-based deep neural learning6
FSformer: fusing frequency and spatial domain transformer network for underwater image enhancement6
Recent advancement in haze removal approaches6
Sat-DehazeGAN: an efficient dehazing model in water-sky background for river-sea transport5
RGB-Net: transformer-based lightweight low-light image enhancement network via RGB channel separation5
RCENet: an efficient pose estimation network based on regression correction5
Underwater small and occlusion object detection with feature fusion and global context decoupling head-based YOLO5
Dual-branch aggregation and edge refinement network for few shot semantic segmentation5
Dual convolutional neural network with attention for image blind denoising5
Hierarchical multiples self-attention mechanism for multi-modal analysis5
A content-style control network with style contrastive learning for underwater image enhancement5
A strong benchmark for yoga action recognition based on lightweight pose estimation model5
MSCA-Sp R-CNN: a segmentation algorithm for pneumonia small lesions integrating multi-scale channel attention and sub-pixel upsampling5
Video and image quality enhancement using an enhanced lower bound on transmission map dehazing technique5
Personalized music recommendation algorithm based on machine learning5
Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image5
A two-stage attention augmented fully convolutional network-based dynamic video summarization5
Image compression with learned lifting-based DWT and learned tree-based entropy models5
Exploring granularity-associated invariance features for text-to-image person re-identification5
GameScript: a simplified scripting language for video game development5
Food nutrition estimation with RGB-D fusion module and bidirectional feature pyramid network5
Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment5
User authentication method based on keystroke dynamics and mouse dynamics using HDA5
An explainable stacked ensemble of deep learning models for improved melanoma skin cancer detection5
A review of computer vision-based approaches for physical rehabilitation and assessment5
Special issue on low complexity methods for multimedia security5
HDR-DANet: single HDR image reconstruction via dual attention5
Generative adversarial defense via conditional diffusion model5
Combating multimodal fake news on social media: methods, datasets, and future perspective4
An olfactory display for virtual reality glasses4
Low-parameter GAN inversion framework based on hypernetwork4
Multi-cue multi-hypothesis tracking with re-identification for multi-object tracking4
Prediction model using SMOTE, genetic algorithm and decision tree (PMSGD) for classification of diabetes mellitus4
CaDaCa: a new caching strategy in NDN using data categorization4
Blind quality evaluator for multi-exposure fusion image via joint sparse features and complex-wavelet statistical characteristics4
Model-based person identification in multi-gait scenario using hybrid classifier4
Research on multi-context aware recommendation methods based on tensor factorization4
Deep learning in multimedia healthcare applications: a review4
Visual transductive learning via iterative label correction4
An effective retrieval model for home textile images based on deep feature extraction4
Research on passengers behavior recognition method in public transport vehicles based on efficient 3D CNN4
Exploring coherence from heterogeneous representations for OCR image captioning4
A two-stage forgery detection and localization framework based on feature classification and similarity metric4
Context-aware and ethics-first crowd mobility portraits over massive smart card data4
Overcoming the practical restrictions in H.266/VVC-based video communication systems by a PI bit rate controller4
Special issue on deep learning methods for cyberbullying detection in multimodal social data4
Segmentation and recognition of filed sweet pepper based on improved self-attention convolutional neural networks4
Unbiased feature enhancement framework for cross-modality person re-identification4
Fusion of AI techniques to tackle COVID-19 pandemic: models, incidence rates, and future trends4
BCRA: bidirectional cross-modal implicit relation reasoning and aligning for text-to-image person retrieval4
KN-VLM: KNowledge-guided Vision-and-Language Model for visual abductive reasoning4
Multiple forgeries identification in digital video based on correlation consistency between entropy coded frames4
Closed-loop reasoning with graph-aware dense interaction for visual dialog4
Exploiting local detail in single image super-resolution via hypergraph convolution4
IGINet: integrating geometric information to enhance inter-modal interaction for fine-grained image captioning4
Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries4
A deep learning-based framework for detecting COVID-19 patients using chest X-rays4
Fast bilateral filter with spatial subsampling4
Dual-focus: person search from Coarse-Grained Focus to Fine-Grained Focus4
An improvement for PDF417 code authentication on mobile phone terminals based on code feature analysis and watermarking4
A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm4
Cross-domain collaborative recommendation without overlapping entities based on domain adaptation4
Attention-guided LiDAR segmentation and odometry using image-to-point cloud saliency transfer4
Special issue deep learning for multimedia healthcare4
CR-DM: A novel craniofacial reconstruction framework based on diffusion model4
Spatial enhanced multi-level alignment learning for text-image person re-identification with coupled noisy labels4
Self-supervised graph clustering via attention auto-encoder with distribution specificity4
Editorial note for few-shot learning for intelligent multimedia systems4
Optimizing codebook training through control chart analysis4
CSLSEP: an ensemble pruning algorithm based on clustering soft label and sorting for facial expression recognition4
Image inpainting method based on AU-GAN4
A novel multiagent system for cervical motor control evaluation and individualized therapy: integrating gamification and portable solutions3
Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations3
Contrastive graph clustering via enhanced hard sample mining and cluster-guiding3
Dynamic hand gesture recognition using combination of two-level tracker and trajectory-guided features3
NasmamSR: a fast image super-resolution network based on neural architecture search and multiple attention mechanism3
Self-expressive induced clustered attention for video-text retrieval3
DMFNet: deep matrix factorization network for image compressed sensing3
DFGPD: a new distillation framework with global and positional distillation3
High-strength synergic-calibration attention system in YOLO for underwater object detection application3
New performance measures for object tracking under complex environments3
Non-uniform circular-structured loss inspired by psychology for image emotion recognition3
Complementary spatiotemporal network for video question answering3
Deep learning and evolutionary intelligence with fusion-based feature extraction for detection of COVID-19 from chest X-ray images3
Hybrid features and semantic reinforcement network for image forgery detection3
A survey on deep learning-based camouflaged object detection3
Multi-scale motion contrastive learning for self-supervised skeleton-based action recognition3
Dark knowledge association guided hashing for unsupervised cross-modal retrieval3
Hierarchical bi-directional conceptual interaction for text-video retrieval3
Radar target recognition based on few-shot learning3
Scale-aware attention-based multi-resolution representation for multi-person pose estimation3
Correction to: Abusive language detection from social media comments using conventional machine learning and deep learning approaches3
Multi-modal cyber-aggression detection with feature optimization by firefly algorithm3
Dynamical semantic enhancement network for continuous sign language recognition3
0.072964191436768