IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The TQCC of IEEE Transactions on Circuits and Systems for Video Technology is 16. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
IEEE Circuits and Systems Society Information1025
Table of Contents232
E2BA: Environment Exploration and Backtracking Agent for Visual Language Object Navigation231
Facial Depression Estimation via Multi-Cue Contrastive Learning227
Semantic Concept Perception Network With Interactive Prompting for Cross-view Image Geo-localization222
RMGNet: The Progressive Relationship-Mining Graph Neural Network for Text-to-image Person Re-identification210
CRP2-VCS: Contrast-Oriented Region-based Progressive Probabilistic Visual Cryptography Schemes205
Graph Convolutional Mixture-of-Experts Learner Network for Long-Tailed Domain Generalization190
IEEE Transactions on Circuits and Systems for Video Technology Publication Information188
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation186
Learning Comprehensive Representation via Selective Activation and Dual-Level Orthogonality for Pedestrian Attribute Recognition181
Task-aware Attentional Dynamic Alignment for Few-Shot Compressed Video Classification171
A Robust and Efficient Boundary Point Detection Method by Measuring Local Direction Dispersion169
DMRFlow: 4D Radar Scene Flow Estimation with Decoupled Matching and Refinement168
A Discrete Index Graph Diffusion Model for 3D Meshes Synthesis168
Kernel Reformulation with Deep Constrained Least Squares for Blind Image Super-Resolution168
Pose-Guided Transformer for Fine-Grained Action Quality Assessment163
RS+rPPG: Robust Strongly Self-Supervised Learning for rPPG155
Heterogeneous Spatial Quality for Omnidirectional Video152
Table of Contents145
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving140
Zero-Shot Object Counting With Vision-Language Prior Guidance Network138
High Efficiency Image Compression for Large Visual-Language Models136
Collaborative Aware Bidirectional Semantic Reasoning for Video Question Answering130
Visual-Semantic Alignment Temporal Parsing for Action Quality Assessment129
Mirror-Based Full-View Finger Vein Authentication With Illumination Adaptation128
Unifying Motion and Appearance Cues for Visual Tracking via Shared Queries126
Multi-Scale Spatial-Temporal Transformer for Meteorological Variable Forecasting126
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection125
Bootstrapping Audio-Visual Video Segmentation by Strengthening Audio Cues125
An End-to-End Network for Rotary Motion Deblurring in the Polar Coordinate System124
PAS-SLAM: A Visual SLAM System for Planar-Ambiguous Scenes123
Dual Degradation Representation for Joint Deraining and Low-Light Enhancement in the Dark120
Monocular Visual Pose Measurement for Autonomous Landing in Unknown Environments119
WaveFusion: A Novel Wavelet Vision Transformer with Saliency-Guided Enhancement for Multimodal Image Fusion115
IEEE Transactions on Circuits and Systems for Video Technology publication information114
Improving Misaligned Multi-Modality Image Fusion With One-Stage Progressive Dense Registration113
Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network111
ReMarNet: Conjoint Relation and Margin Learning for Small-Sample Image Classification110
Domain-Invariant Prototypes for Semantic Segmentation107
Joint Contextual Representation Model-Informed Interpretable Network With Dictionary Aligning for Hyperspectral and LiDAR Classification106
Visual Anomaly Detection via Partition Memory Bank Module and Error Estimation105
Robust Unpaired Image Dehazing via Adversarial Deformation Constraint104
IEEE Transactions on Circuits and Systems for Video Technology publication information104
IEEE Transactions on Circuits and Systems for Video Technology publication information103
Intra Prediction and Mode Coding in VVC103
Revisiting Feature Fusion for RGB-T Salient Object Detection103
RGB-T Semantic Segmentation With Location, Activation, and Sharpening102
Multi-Relations Aware Network for In-the-Wild Facial Expression Recognition102
DRNet: Disentanglement and Recombination Network for Few-Shot Semantic Segmentation101
Reliable Entropy-induced Anchor Learning for Incomplete Multi-view Subspace Clustering99
SGFormer: Spherical Geometry Transformer for 360° Depth Estimation97
Content-Aware Dynamic In-loop Filter with Adjustable Complexity for VVC Intra Coding95
Tackling Real-world Complexity: Hierarchical Modeling and Dynamic Prompting for Multimodal Long Document Classification94
Blind Stereoscopic Omnidirectional Image Quality Assessment via a Binocular Viewport Hypergraph Convolutional Network94
IEEE Circuits and Systems Society Information94
Table of Contents93
IEEE Transactions on Circuits and Systems for Video Technology Publication Information91
A No-Reference Quality Assessment Model for Screen Content Videos via Hierarchical Spatiotemporal Perception90
Devil in Shadow: Attacking NIR-VIS Heterogeneous Face Recognition via Adversarial Shadow90
No-Reference Image Quality Assessment: Obtain MOS From Image Quality Score Distribution90
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation90
Open World Object Detection: A Survey90
Learning Disentangled Representation for Multi-View 3D Object Recognition89
DFF-VIO: A General Dynamic Feature Fused Monocular Visual-Inertial Odometry88
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 3288
Reversible Data Hiding-Based Local Contrast Enhancement With Nonuniform Superpixel Blocks for Medical Images88
Table of Contents86
LMQFormer: A Laplace-Prior-Guided Mask Query Transformer for Lightweight Snow Removal85
HR-Net: A Landmark Based High Realistic Face Reenactment Network85
Learn by Oneself: Exploiting Weight-Sharing Potential in Knowledge Distillation Guided Ensemble Network85
Image Encryption via Complementary Embedding Algorithm and New Spatiotemporal Chaotic System84
AO2-DETR: Arbitrary-Oriented Object Detection Transformer83
Monocular Robust 3D Human Localization by Global and Body-Parts Depth Awareness82
HFF6D: Hierarchical Feature Fusion Network for Robust 6D Object Pose Tracking81
Zoom Transformer for Skeleton-Based Group Activity Recognition79
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera78
Non-Uniform Illumination Underwater Image Restoration via Illumination Channel Sparsity Prior78
Local Attention Transformer-Based Full-View Finger-Vein Identification78
Hierarchical Dynamic Programming Module for Human Pose Refinement78
Reversible Data Hiding in Encrypted Images With Secret Sharing and Hybrid Coding78
FreqGAN: Infrared and Visible Image Fusion via Unified Frequency Adversarial Learning78
TPE for JPEG images with Dynamic M-ary Decomposition and Adaptive Threshold Constraints77
The Farther the Better: Balanced Stereo Matching via Depth-Based Sampling and Adaptive Feature Refinement77
InfoStyler: Disentanglement Information Bottleneck for Artistic Style Transfer77
MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment77
CBASH: Combined Backbone and Advanced Selection Heads With Object Semantic Proposals for Weakly Supervised Object Detection77
Concept-Level Semantic Transfer and Context-Level Distribution Modeling for Few-Shot Segmentation77
Semi-Supervised Image Deraining Using Knowledge Distillation77
Table of contents76
IEEE Transactions on Circuits and Systems for Video Technology publication information76
Table of Contents75
IEEE Transactions on Circuits and Systems for Video Technology publication information74
IEEE Transactions on Circuits and Systems for Video Technology publication information74
2021 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 3174
Table of contents73
IEEE Transactions on Circuits and Systems for Video Technology publication information71
Call for IEEE T-CSVT Associate Editors Nomination71
IEEE Transactions on Circuits and Systems for Video Technology publication information70
IEEE Transactions on Circuits and Systems for Video Technology publication information70
IEEE Transactions on Circuits and Systems for Video Technology publication information70
Table of Contents69
Table of Contents69
Table of Contents69
Table of Contents68
Towards Quality of Experience for AI-generated Video68
Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations66
Generative Probabilistic Entropy Modeling With Conditional Diffusion for Learned Image Compression66
IEEE Circuits and Systems Society Information66
Table of Contents66
Foreground-Background Parallel Compression With Residual Encoding for Surveillance Video66
Learning Depth-Density Priors for Fourier-Based Unpaired Image Restoration65
Self-Supervised Exclusive-Inclusive Interactive Learning for Multi-Label Facial Expression Recognition in the Wild65
Hierarchical Multi-Modal Attention Network for Time-Sync Comment Video Recommendation65
Concept Parser With Multimodal Graph Learning for Video Captioning64
Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval64
Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection64
PLOVAD: Prompting Vision-Language Models for Open Vocabulary Video Anomaly Detection63
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data63
High Dynamic Range Imaging for Dynamic Scenes Based on Multi-Level Spike Camera62
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition62
ConPCAC: Conditional Lossless Point Cloud Attribute Compression via Spatial Decomposition61
Frame-by-Frame Multi-object Tracking-Guided Video Captioning61
Deep Discriminative Multi-view Clustering61
ResiComp: Loss-Resilient Image Compression via Dual-Functional Masked Visual Token Modeling61
A Unified Open Adapter for Open-World Noisy Label Learning: Data-Centric and Learning-Based Insights60
Deepface-based chaotic image encryption using key optimization and semi-tensor product theory60
Scene-Modulated High-Order Statistical Representation Learning for No-Reference Super-Resolution Image Quality Assessment60
Atmospheric Scattered Light Field Sampling for Improving Reconstruction Efficiency60
High Efficiency Wiener Filter-based Point Cloud Quality Enhancement for MPEG G-PCC59
Distributed Learning for Privacy-Preserving Semi-Supervised Video Anomaly Detection59
PRFormer: Matching Proposal and Reference Masks by Semantic and Spatial Similarity for Few-Shot Semantic Segmentation59
A Novel Dense Object Detector with Scale Balanced Sample Assignment and Refinement58
Partial Alignment for Object Detection in the Wild58
Table of Contents57
Spectral–Spatial Feature Extraction With Dual Graph Autoencoder for Hyperspectral Image Clustering57
RAPT360: Reinforcement Learning-Based Rate Adaptation for 360-Degree Video Streaming With Adaptive Prediction and Tiling57
Quality of Experience Oriented Cross-Layer Optimization for Real-Time XR Video Transmission57
An Interpretable Fusion Siamese Network for Multi-Modality Remote Sensing Ship Image Retrieval57
Tensorial Multi-View Clustering via Low-Rank Constrained High-Order Graph Learning56
Weakly Supervised Learning for Raindrop Removal on a Single Image56
Enhancing Cross-View Geo-Localization With Domain Alignment and Scene Consistency56
Table of Contents56
NLIC: Non-Uniform Quantization-Based Learned Image Compression56
Self-Attention Memory-Augmented Wavelet-CNN for Anomaly Detection55
Overview and Efficiency of Decoder-Side Depth Estimation in MPEG Immersive Video55
Ownership Verification of DNN Architectures via Hardware Cache Side Channels55
Large-Scale Crowdsourced Subjective Assessment of Picturewise Just Noticeable Difference55
DRAKE: Deep Pair-Wise Relation Alignment for Knowledge-Enhanced Multimodal Scene Graph Generation in Social Media Posts55
IEEE Circuits and Systems Society Information55
A High Voltage Driving Chiplet in Standard 0.18-μm CMOS for Micro-Pixelated LED Displays Integrated With LTPS TFTs54
Extending Momentum Contrast With Cross Similarity Consistency Regularization54
SNP-S3: Shared Network Pre-Training and Significant Semantic Strengthening for Various Video-Text Tasks54
Learning Transferable and Discriminative Representations for 2D Image-Based 3D Model Retrieval54
Auto-Perceiving Correlation Filter for UAV Tracking54
Guest Editorial Special Section on Learning With Multimodal Data for Biomedical Informatics54
Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation54
AAGAN: Accuracy-Aware Generative Adversarial Network for Supervised Tasks54
Implicit Motion-Compensated Network for Unsupervised Video Object Segmentation54
Variational Hyperparameter Inference for Few-Shot Learning Across Domains53
Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search52
Vector-Based Efficient Data Hiding in Encrypted Images via Multi-MSB Replacement52
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark52
RSDet++: Point-Based Modulated Loss for More Accurate Rotated Object Detection51
Pseudo Multi-Port SRAM Circuit for Image Processing in Display Drivers51
Joint Sample Enhancement and Instance-Sensitive Feature Learning for Efficient Person Search51
Accelerated PALM for Nonconvex Low-Rank Matrix Recovery With Theoretical Analysis51
MPAI-EEV: Standardization Efforts of Artificial Intelligence Based End-to-End Video Coding50
DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume50
Reference-Guided Landmark Image Inpainting With Deep Feature Matching50
Improving the Post-Training Neural Network Quantization by Prepositive Feature Quantization50
Boosting Robust Multi-Focus Image Fusion With Frequency Mask and Hyperdimensional Computing50
Deep Metric Learning on the SPD Manifold for Image Set Classification50
Multi-View Maximum Margin Clustering With Privileged Information Learning50
Learning Hadamard-Product-Propagation for Image Dehazing and Beyond50
VDTR: Video Deblurring With Transformer49
Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention49
Uni3DA: Universal 3D Domain Adaptation for Object Recognition49
Graph-Guided Unsupervised Multiview Representation Learning49
Learnable Spatial-Spectral Transform-Based Tensor Nuclear Norm for Multi-Dimensional Visual Data Recovery49
Appearance-and-Dynamic Learning With Bifurcated Convolution Neural Network for Action Recognition49
A High-Performance Robust Reversible Data Hiding Algorithm Based on Polar Harmonic Fourier Moments49
Camera Pose-Based Background Modeling for Video Coding in Moving Cameras48
Fine-Grained Instance-Level Sketch-Based Video Retrieval48
Self-Supervised Blind Image Deconvolution via Deep Generative Ensemble Learning48
Graph Regularized and Feature Aware Matrix Factorization for Robust Incomplete Multi-View Clustering48
Closed-Form Solution of Principal Line for Camera Calibration Based on Orthogonal Vanishing Points48
Comment-Guided Semantics-Aware Image Aesthetics Assessment48
A Robust Quality Enhancement Method Based on Joint Spatial-Temporal Priors for Video Coding47
Cryptanalysis of Image Ciphers With Permutation-Substitution Network and Chaos47
Scale-Balanced Real-Time Object Detection With Varying Input-Image Resolution47
Quality Assessment of UGC Videos Based on Decomposition and Recomposition47
Diverse Feature Learning Network With Attention Suppression and Part Level Background Suppression for Person Re-Identification47
Structured and Consistent Multi-Layer Multi-Kernel Subtask Correction Filter Tracker47
A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands47
Consistent Intra-Video Contrastive Learning With Asynchronous Long-Term Memory Bank46
Break the Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation46
Food and Ingredient Joint Learning for Fine-Grained Recognition46
Filter Clustering for Compressing CNN Model With Better Feature Diversity46
Diagonal XOR-Based FEC Method to Improve Burst-Loss Tolerance for 4K/8K UHDTV Transmission46
A Detail-Aware Transformer to Generalisable Face Forgery Detection45
Towards Long Video Understanding via Fine-detailed Video Story Generation45
Correlation Filters for UAV Online Tracking Based on Complementary Appearance Model and Reversibility Reasoning45
A Format Compliant Framework for HEVC Selective Encryption After Encoding45
COCAS+: Large-Scale Clothes-Changing Person Re-Identification With Clothes Templates45
Adaptive Context Reading Network for Movie Scene Detection45
Fast Adapting Without Forgetting for Face Recognition45
Pro-Tuning: Unified Prompt Tuning for Vision Tasks45
Multi-Camera Color Correction via Hybrid Histogram Matching45
TCTL-Net: Template-Free Color Transfer Learning for Self-Attention Driven Underwater Image Enhancement45
Cross Time-Frequency Transformer for Temporal Action Localization44
Harmony: An Eco-friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet44
Improving Video Moment Retrieval by Auxiliary Moment-Query Pairs with Hyper-Interaction44
Efficient Neural Image Decoding via Fixed-Point Inference44
Improving Cross-Modal Image-Text Retrieval With Teacher-Student Learning44
StegMamba: Distortion-free Immune-Cover for Multi-Image Steganography with State Space Model44
Uncertainty-Aware Label Refinement on Hypergraphs for Personalized Federated Facial Expression Recognition44
Temporal Consistency Learning of Inter-Frames for Video Super-Resolution44
Augmented Queue-Based Transmission and Transcoding Optimization for Livecast Services Based on Cloud-Edge-Crowd Integration43
Simultaneous Learning Intensity and Optical Flow from High-speed Spike Stream43
Edge-Aware Correlation Learning for Unsupervised Progressive Homography Estimation43
Tri-AFLLM: Resource-Efficient Adaptive Asynchronous Accelerated Federated LLMs43
BAFN: Bi-Direction Attention Based Fusion Network for Multimodal Sentiment Analysis43
Hierarchical Frequency-based Upsampling and Refining for HEVC Compressed Video Enhancement43
Fully Unsupervised Domain-Agnostic Image Retrieval43
DGECN++: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism43
Rules for Expectation: Learning to Generate Rules via Social Environment Modeling42
Quality-Driven Variable Frame-Rate for Green Video Coding in Broadcast Applications42
Semantic Scene Completion via Semantic-aware Guidance and Interactive Refinement Transformer42
Adversarial Analysis for Source Camera Identification42
Heterogeneous Generative Tokens and Distance-aware Recovery Network for Occluded Person Re-identification42
Motion Compression Using Structurally Connected Neural Network42
Repeatable Data Hiding: Towards the Reusability of Digital Images41
Texture Brush for Fashion Inspiration Transfer: A Generative Adversarial Network With Heatmap-Guided Semantic Disentanglement41
Ct-LVI: A Framework Toward Continuous-Time Laser-Visual-Inertial Odometry and Mapping41
Jointly-Learnt Networks for Future Action Anticipation via Self-Knowledge Distillation and Cycle Consistency41
RT3DHVC: A Real-time Human Holographic Video Conferencing System with a Consumer RGB-D Camera Array41
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification41
Spatial Quality Oriented Rate Control for Volumetric Video Streaming via Deep Reinforcement Learning41
Guest Editorial Introduction to the Special Issue on Advanced Machine Learning Methodologies for Large-Scale Video Object Segmentation and Detection41
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning40
Bridging the Gap Between Voltage Over-Scaling and Joint Hardware Accelerator-Algorithm Closed-Loop40
Multimodal Emotional Talking Face Generation based on Action Units40
Bi-Directional and Triangular Circulation Fusion Neural Networks for Small Object Detection40
Robust Monocular Pose Tracking of Less-Distinct Objects Based on Contour-Part Model40
On Understanding of Spatiotemporal Prediction Model40
Information Gap Narrowing for Point Cloud Few-Shot Segmentation40
Sparse Point Clouds Assisted Learned Image Compression40
Rate Control for Predictive Transform Screen Content Video Coding Based on RANSAC40
Multi-Source Video Domain Adaptation With Temporal Attentive Moment Alignment Network39
0.19021606445312