IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The H4-Index of IEEE Transactions on Circuits and Systems for Video Technology is 88. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 32404
IEEE Transactions on Circuits and Systems for Video Technology Publication Information379
Table of Contents336
IEEE Transactions on Circuits and Systems for Video Technology publication information318
IEEE Transactions on Circuits and Systems for Video Technology publication information306
USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes297
Unsupervised Action Segmentation via Multi-scale Temporal-interaction Enhancement296
Pose-Guided Transformer for Fine-Grained Action Quality Assessment279
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection252
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification249
Dual Difficulty-Aware Adaptive Pseudo Labeling for Semi-Supervised CNV Segmentation243
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera228
Deep Affine Motion Compensation Network for Inter Prediction in VVC225
Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning215
Highly-Parallel Hardwired Deep Convolutional Neural Network for 1-ms Dual-Hand Tracking211
Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching208
DS2VP: Dynamically-Selected Spatially Visual Prompting195
Table of Contents185
IEEE Circuits and Systems Society Information184
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data183
MEF-GD: Multimodal Enhancement and Fusion Network for Garment Designer174
A Format Compliant Framework for HEVC Selective Encryption After Encoding172
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification172
Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation170
Multi-Stage Cross-Modality Feature Interaction for RGB-Thermal Multi-Object Tracking165
Frequency Generation for Real-World Image Super-Resolution163
Filtering-and-Alternating-Calibration: Spatiotemporal Context Alternating Fusion for Event-based Monocular Depth Estimation162
Cross-Level Multi-Modal Features Learning With Transformer for RGB-D Object Recognition161
Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers158
VDTR: Video Deblurring With Transformer152
UDTCWT-PHFMs Domain Statistical Image Watermarking Using Vector BW-Type R Distribution149
SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation148
DMRFlow: 4D Radar Scene Flow Estimation With Decoupled Matching and Refinement147
FastAL: Fast Evaluation Module for Efficient Dynamic Deep Active Learning Using Broad Learning System145
FoV Prediction-Based Adaptive Bitrate Streaming with On-Demand Transcoding for 360-Degree Videos145
RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array139
Block Diagonal Graph Embedded Discriminative Regression for Image Representation133
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark132
CRP2-VCS: Contrast-Oriented Region-Based Progressive Probabilistic Visual Cryptography Schemes131
Dependability Feature Learning based on Sample Generation for Unsupervised Text-to-Image Person Re-identification130
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition130
Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification129
Multi-Level Feature Fusion Network for Shadow Removal Detection127
Uni3DA: Universal 3D Domain Adaptation for Object Recognition125
Learning Spatio-Temporal Sharpness Map for Video Deblurring124
MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension120
Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter119
A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands117
Efficient Single-Object Tracker Based on Local-Global Feature Fusion116
Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images114
Fully Unsupervised Domain-Agnostic Image Retrieval113
Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet113
Lightweight Neural Network for Enhancing Imaging Performance of Under-Display Camera113
Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network113
PPIFuse: Physical Priors Injected Infrared and Visible Image Fusion112
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning112
EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification109
SMART: Semantic Matching Contrastive Learning for Partially View-Aligned Clustering109
TPCM-SegNet: A Text-Prompted Dual-Path Convolution-Mamba Network for Anomaly Segmentation109
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation108
Multi-Modal Attribute Prompting for Vision-Language Models108
Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation105
Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning103
Semi-Supervised Crowd Counting via Multi-Task Pseudo-Label Self-Correction Strategy103
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving102
Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images102
VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection101
Subjective and Objective Quality Assessment of Display Content Videos101
Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information101
Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-Perspective Segmentation101
Single Image Haze Removal With Haze Map Optimization for Various Haze Concentrations100
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection99
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification99
Projected Generative Adversarial Network for Point Cloud Completion98
Iterative Self-Guided Image Filtering97
Deep and Low-Rank Quaternion Priors for Color Image Processing97
Video Understanding with Large Language Models: A Survey96
Exploring and Exploiting High-Order Spatial–Temporal Dynamics for Long-Term Frame Prediction95
Graph-Guided Unsupervised Multiview Representation Learning94
Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition91
Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network90
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows89
Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation89
Reversible Data Hiding Over Encrypted Images via Preprocessing-Free Matrix Secret Sharing89
Truncated Robust Natural Watermarking With Hungarian Optimization88
ASCFormer: An Adaptive Strucure-aware Cascaded Transformer for 3D Object Detection88
Image Super-Resolution With Self-Similarity Prior Guided Network and Sample-Discriminating Learning88
AirSOD: A Lightweight Network for RGB-D Salient Object Detection88
Exploring Explicitly Disentangled Features for Domain Generalization88
0.14108300209045