IEEE Transactions on Multimedia

Papers
(The H4-Index of IEEE Transactions on Multimedia is 62. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification334
Human Memory Update Strategy: A Multi-Layer Template Update Mechanism for Remote Visual Monitoring216
Low-Light Image Enhancement With Semi-Decoupled Decomposition187
Extended Feature Pyramid Network for Small Object Detection167
StrongSORT: Make DeepSORT Great Again162
AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks157
MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation147
3D Room Layout Estimation From a Single RGB Image136
Coarse-to-Fine CNN for Image Super-Resolution131
DSLR: Deep Stacked Laplacian Restorer for Low-Light Image Enhancement129
Automated Colorization of a Grayscale Image With Seed Points Propagation116
Beyond Triplet Loss: Person Re-Identification With Fine-Grained Difference-Aware Pairwise Loss115
Image-to-Image Translation: Methods and Applications114
Parameter Sharing Exploration and Hetero-Center Triplet Loss for Visible-Thermal Person Re-Identification108
SPA-GAN: Spatial Attention GAN for Image-to-Image Translation107
Learning Deep Multi-Level Similarity for Thermal Infrared Object Tracking103
Spatio-Temporal Attention Networks for Action Recognition and Detection100
Consensus Graph Learning for Multi-View Clustering100
Geometric Back-Projection Network for Point Cloud Classification99
Adaptive Graph Completion Based Incomplete Multi-View Clustering96
A Dilated Inception Network for Visual Saliency Prediction94
TBEFN: A Two-Branch Exposure-Fusion Network for Low-Light Image Enhancement92
Spatial-Temporal Cascade Autoencoder for Video Anomaly Detection in Crowded Scenes91
VehicleNet: Learning Robust Visual Representation for Vehicle Re-Identification88
CCAFNet: Crossflow and Cross-Scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images83
STNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification82
Jointly Learning Kernel Representation Tensor and Affinity Matrix for Multi-View Clustering81
Deep Multi-View Subspace Clustering With Unified and Discriminative Learning80
Food Recommendation: Framework, Existing Solutions, and Challenges79
Kernelized Multiview Subspace Analysis By Self-Weighted Learning79
Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification78
Image-Text Multimodal Emotion Classification via Multi-View Attentional Network78
An Improved Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling76
SiamCorners: Siamese Corner Networks for Visual Tracking76
Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic76
Stacked U-Shape Network With Channel-Wise Attention for Salient Object Detection76
ATMFN: Adaptive-Threshold-Based Multi-Model Fusion Network for Compressed Face Hallucination75
Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning74
Multi-Channel Deep Networks for Block-Based Image Compressive Sensing72
Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration72
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation71
Deep-IRTarget: An Automatic Target Detector in Infrared Imagery Using Dual-Domain Feature Extraction and Allocation70
Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation70
YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer69
EHPE: Skeleton Cues-based Gaussian Coordinate Encoding for Efficient Human Pose Estimation69
Anti-Forensics for Face Swapping Videos via Adversarial Training68
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval68
Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval67
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification66
Luminance-Aware Pyramid Network for Low-Light Image Enhancement66
An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks65
PointHop: An Explainable Machine Learning Method for Point Cloud Classification65
Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification65
3D Face Reconstruction From A Single Image Assisted by 2D Face Images in the Wild65
A Recursive Reversible Data Hiding in Encrypted Images Method With a Very High Payload64
Interact as You Intend: Intention-Driven Human-Object Interaction Detection64
PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing64
WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection63
MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB–Thermal Urban Road Scene Parsing63
A Serial Image Copy-Move Forgery Localization Scheme With Source/Target Distinguishment62
Fast Intra Mode Decision Algorithm for Versatile Video Coding62
Illumination-Adaptive Person Re-Identification62
Uncertainty-Aware Unsupervised Domain Adaptation in Object Detection62
0.037294149398804