International Journal of Multimedia Information Retrieval

Papers
(The TQCC of International Journal of Multimedia Information Retrieval is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
A voting-based novel spatio-temporal fusion framework for video saliency using transfer learning mechanism329
DAABNet: depth-wise asymmetric attention bottleneck for real-time semantic segmentation87
Editorial: web of science and scopus impact in IJMIR44
How can users’ comments posted on social media videos be a source of effective tags?44
Detecting abnormal behavior in megastore for crime prevention using a deep neural architecture37
Multimodal music datasets? Challenges and future goals in music processing25
VERITE: a Robust benchmark for multimodal misinformation detection accounting for unimodal bias24
Style-aware adversarial pairwise ranking for image recommendation systems21
Enhancing the performance of 3D auto-correlation gradient features in depth action classification21
Stratified Graph Indexing for efficient search in deep descriptor databases17
Mual: enhancing multimodal sentiment analysis with cross-modal attention and difference loss15
End-to-end residual learning-based deep neural network model deployment for human activity recognition14
Visual and semantic ensemble for scene text recognition with gated dual mutual attention13
Similar interior coordination image retrieval with multi-view features12
Reinforcement learning applied to machine vision: state of the art11
Towards a high robust neural network via feature matching11
Correction to: Different techniques for Alzheimer’s disease classification using brain images: a study10
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition10
Gender classification from face images using central difference convolutional networks9
An interactive attribute-preserving fashion recommendation with 3D image-based virtual try-on9
LG-MLFormer: local and global MLP for image captioning9
How does a kernel based on gradients of infinite-width neural networks come to be widely used: a review of the neural tangent kernel9
Improving skeleton-based action recognition with interactive object information8
Recent trends in recommender systems: a survey8
Advancements in machine learning techniques for threat item detection in X-ray images: a comprehensive survey8
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features8
RGBD deep multi-scale network for background subtraction8
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation8
Neural style transfer generative adversarial network (NST-GAN) for facial expression recognition8
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey8
Ornament image retrieval using few-shot learning7
Caption TLSTMs: combining transformer with LSTMs for image captioning7
Maximizing mutual information inside intra- and inter-modality for audio-visual event retrieval7
Multi-modal emotion recognition using tensor decomposition fusion and self-supervised multi-tasking7
Video anomaly detection with memory-guided multilevel embedding7
A review on deep learning in medical image analysis7
State of art and emerging trends on group recommender system: a comprehensive review6
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval6
Multiple feedback based adversarial collaborative filtering with aesthetics6
3D skeleton-based human motion prediction using spatial–temporal graph convolutional network6
0.045791149139404