IEEE Transactions on Circuits and Systems for Video Technology

Papers
(The TQCC of IEEE Transactions on Circuits and Systems for Video Technology is 17. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Table of Contents1170
IEEE Transactions on Circuits and Systems for Video Technology publication information287
IEEE Transactions on Circuits and Systems for Video Technology publication information271
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 32267
Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images263
Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition262
Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter251
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification222
MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension218
IEEE Transactions on Circuits and Systems for Video Technology Publication Information217
Highly-Parallel Hardwired Deep Convolutional Neural Network for 1-ms Dual-Hand Tracking215
DMRFlow: 4D Radar Scene Flow Estimation with Decoupled Matching and Refinement215
Pose-Guided Transformer for Fine-Grained Action Quality Assessment213
Learning Depth-Density Priors for Fourier-Based Unpaired Image Restoration209
Semi-Supervised Crowd Counting via Multi-Task Pseudo-Label Self-Correction Strategy190
CRP2-VCS: Contrast-Oriented Region-Based Progressive Probabilistic Visual Cryptography Schemes190
VPA: Multi-modal Virtual Point Augmentation for 3D Object Detection190
Robust Monocular Pose Tracking of Less-Distinct Objects Based on Contour-Part Model170
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection167
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data160
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving160
Table of Contents158
IEEE Circuits and Systems Society Information157
Representing Boundary-Ambiguous Scene Online With Scale-Encoded Cascaded Grids and Radiance Field Deblurring152
A Clinically Guided Graph Convolutional Network for Assessment of Parkinsonian Pronation-Supination Movements of Hands152
MMI-Det: Exploring Multi-Modal Integration for Visible and Infrared Object Detection151
Frequency Generation for Real-World Image Super-Resolution150
Relative Comparison-Based Consensus Learning for Multi-View Subspace Clustering148
PhyDAA: Physiological Dataset Assessing Attention144
Multi-Level Feature Fusion Network for Shadow Removal Detection140
EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification139
Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification136
Dual-Stream Transformer With Distribution Alignment for Visible-Infrared Person Re-Identification134
Learning to Capture the Query Distribution for Few-Shot Learning133
Fuzzified Contrast Enhancement for Nearly Invisible Images132
Iterative Self-Guided Image Filtering130
VDTR: Video Deblurring With Transformer130
A Format Compliant Framework for HEVC Selective Encryption After Encoding130
UDTCWT-PHFMs Domain Statistical Image Watermarking Using Vector BW-Type R Distribution130
Pro-Tuning: Unified Prompt Tuning for Vision Tasks128
Spectral–Spatial Feature Extraction With Dual Graph Autoencoder for Hyperspectral Image Clustering124
Reversible Data Hiding in Encrypted Image via Secret Sharing Based on GF(p) and GF(2⁸)124
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows123
Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation121
Cross-Level Multi-Modal Features Learning With Transformer for RGB-D Object Recognition118
Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet118
Projected Generative Adversarial Network for Point Cloud Completion117
SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation114
Uni3DA: Universal 3D Domain Adaptation for Object Recognition111
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark110
Block Diagonal Graph Embedded Discriminative Regression for Image Representation108
Hierarchical Dynamic Programming Module for Human Pose Refinement106
Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning105
Future Feature-Based Supervised Contrastive Learning for Streaming Perception104
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition102
RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array102
Ct-LVI: A Framework Toward Continuous-Time Laser-Visual-Inertial Odometry and Mapping100
Learning Spatio-Temporal Sharpness Map for Video Deblurring100
Towards Video Anomaly Detection in the Real World: A Binarization Embedded Weakly-Supervised Network99
Fully Unsupervised Domain-Agnostic Image Retrieval99
Efficient Single-Object Tracker Based on Local-Global Feature Fusion99
Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-perspective Segmentation98
Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information98
Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning97
Truncated Robust Natural Watermarking With Hungarian Optimization97
FastAL: Fast Evaluation Module for Efficient Dynamic Deep Active Learning Using Broad Learning System96
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification95
Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers92
Dual Difficulty-aware Adaptive Pseudo Labeling for Semi-supervised CNV Segmentation92
Deep Affine Motion Compensation Network for Inter Prediction in VVC92
Robust Image Watermarking With Synchronization Using Template Enhanced-Extracted Network91
Exploring and Exploiting High-Order Spatial–Temporal Dynamics for Long-Term Frame Prediction90
Single Image Haze Removal With Haze Map Optimization for Various Haze Concentrations90
Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network90
Local Attention Transformer-Based Full-View Finger-Vein Identification90
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera89
Video Understanding with Large Language Models: A Survey88
UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion87
Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation86
Deep and Low-Rank Quaternion Priors for Color Image Processing86
Exploring Explicitly Disentangled Features for Domain Generalization85
Graph-Guided Unsupervised Multiview Representation Learning84
Learning Appearance-Motion Synergy via Memory-Guided Event Prediction for Video Anomaly Detection84
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification84
Reversible Data Hiding Over Encrypted Images via Preprocessing-Free Matrix Secret Sharing83
Image Super-Resolution With Self-Similarity Prior Guided Network and Sample-Discriminating Learning83
Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images83
Progressive Point Cloud Upsampling via Differentiable Rendering83
D3C2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing82
Multi-Modal Attribute Prompting for Vision-Language Models82
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning82
Equity in Unsupervised Domain Adaptation by Nuclear Norm Maximization81
Key Role Guided Transformer for Group Activity Recognition81
AirSOD: A Lightweight Network for RGB-D Salient Object Detection78
Lightweight Neural Network for Enhancing Imaging Performance of Under-Display Camera77
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection77
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation77
Low-Light Image Enhancement via Progressive-Recursive Network76
Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation76
Reliable Entropy-Induced Anchor Learning for Incomplete Multi-View Subspace Clustering75
Efficient Non-Blind Image Deblurring with Discriminative Shrinkage Deep Networks75
IEEE Transactions on Circuits and Systems for Video Technology publication information74
IEEE Transactions on Circuits and Systems for Video Technology publication information73
IEEE Circuits and Systems Society Information73
Touchless Finger Vein and Fingerprint Verification via Exploiting Attention-Based Cross-Domain Fusion72
StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting72
Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering71
Flow-Edge Guided Unsupervised Video Object Segmentation70
Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding70
Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models70
Texture-Aware Spherical Rotation for High Efficiency Omnidirectional Intra Video Coding69
MSGA-Net: Progressive Feature Matching via Multi-Layer Sparse Graph Attention69
A Novel Video Coding Strategy in HEVC for Object Detection69
Diverse Batch Steganography Using Model-Based Selection and Double-Layered Payload Assignment69
Efficient Selective Context Network for Accurate Object Detection68
Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring68
Mesh2Animation: Unsupervised Animating for Quadruped 3D Objects67
WeaFU: Weather-Informed Image Blind Restoration via Multi-Weather Distribution Diffusion67
Monocular Depth Estimation on Adverse Weathers With Curriculum Domain Distribution Alignment66
Semantic Disentanglement Adversarial Hashing for Cross-Modal Retrieval66
Recent Advances in Rate Control: From Optimization to Implementation and Beyond66
Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering65
FDAC: Federated Domain Adaptation via Dual Contrastive Learning65
FaceGCN: Structured Priors Inspired Graph Convolutional Networks for Face Restoration With Unknown Degradations65
G2LP-Net: Global to Local Progressive Video Inpainting Network64
Multi-Level Fusion and Attention-Guided CNN for Image Dehazing64
Unsupervised Deep Hashing With Fine-Grained Similarity-Preserving Contrastive Learning for Image Retrieval64
Fixing Defect of Photometric Loss for Self-Supervised Monocular Depth Estimation64
VSOIQE: A Novel Viewport-Based Stitched 360° Omnidirectional Image Quality Evaluator64
Compensating for the Incomplete with the Complete: An Efficient Scene Text Detector64
A Universal Framework for Improving the Robustness of Coverless Image Steganography Based on Image Restoration63
DiffVein: A Unified Diffusion Network for Finger Vein Segmentation and Authentication63
Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching63
Flow Visualization for Complex Fluid Flows via A Structure-enhanced Motion Estimator62
MixSSC: Forward-Backward Mixture for Vision-Based 3D Semantic Scene Completion61
Depth Estimation From a Single Image of Blast Furnace Burden Surface Based on Edge Defocus Tracking61
OraL: An Observational Learning Paradigm for Unsupervised Hyperspectral Change Detection61
Table of Contents61
Dynamic Particle Filter Framework for Robust Object Tracking60
Optical Flow Reusing for High-Efficiency Space-Time Video Super Resolution60
Appearance Matters, So Does Audio: Revealing the Hidden Face via Cross-Modality Transfer60
Erratum to “Local-Global Temporal Difference Learning for Satellite Video Super-Resolution”60
An Efficient Algorithm for Generating Harmonized Stereoscopic 360° VR Images60
SMR: Spatial-Guided Model-Based Regression for 3D Hand Pose and Mesh Reconstruction59
Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels59
Conditional Dual Diffusion for Multimodal Clustering of Optical and SAR Images59
Forgery-Aware Adaptive Learning With Vision Transformer for Generalized Face Forgery Detection59
Multi-Scale Explicit Matching and Mutual Subject Teacher Learning for Generalizable Person Re-Identification59
Learning With Noisy Labels by Semantic and Feature Space Collaboration59
FDNet: Frequency Decomposition Network for Learned Image Compression59
Searching a Compact Architecture for Robust Multi-Exposure Image Fusion58
ImagingNet: A New Learnable SAR Imaging Method via Hierarchical U-shaped Network58
Balanced Teacher for Source-Free Object Detection58
Enhancing Robustness of Multi-Object Trackers With Temporal Feature Mix58
Target-Aware Tracking With Spatial-Temporal Context Attention58
DEP-Former: Multimodal Depression Recognition Based on Facial Expressions and Audio Features via Emotional Changes57
Self-Supervised Adversarial Video Summarizer With Context Latent Sequence Learning57
Boosting Semi-Supervised Face Recognition With Noise Robustness57
DAHP: Deep Attention-Guided Hashing With Pairwise Labels57
STAF: 3D Human Mesh Recovery From Video With Spatio-Temporal Alignment Fusion57
Table of Contents57
Meta-Learning Based Domain Prior With Application to Optical-ISAR Image Translation57
Low-Resolution Object Recognition With Cross-Resolution Relational Contrastive Distillation56
A Novel Deep Learning Framework for Automatic Recognition of Thyroid Gland and Tissues of Neck in Ultrasound Image56
Multi-Prior Driven Network for RGB-D Salient Object Detection56
Transformer-Based Multimodal Emotional Perception for Dynamic Facial Expression Recognition in the Wild55
Dynamic Hypergraph Convolutional Network for No-Reference Point Cloud Quality Assessment55
Enhanced Spatial-Temporal Salience for Cross-View Gait Recognition55
Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering55
One for All: A Unified Generative Framework for Image Emotion Classification54
Blind Image Quality Index for Authentic Distortions With Local and Global Deep Feature Aggregation54
VmambaIR: Visual State Space Model for Image Restoration54
VVC In-Loop Filters54
Cloth-Imbalanced Gait Recognition via Hallucination54
All-Inclusive Image Enhancement for Degraded Images Exhibiting Low-Frequency Corruption54
Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion54
Holistic Prototype Attention Network for Few-Shot Video Object Segmentation54
Robust Matrix Completion Based on Factorization and Truncated-Quadratic Loss Function54
Generative Image Steganography Based on Text-to-Image Multimodal Generative Model53
CNN-Transformer Based Generative Adversarial Network for Copy-Move Source/ Target Distinguishment53
Progressive Multi-Prompt learning for Vision-Language Models53
Table of Contents53
POS-Trends Dynamic-Aware Model for Video Caption53
TAKD: Target-Aware Knowledge Distillation for Remote Sensing Scene Classification53
Content-Adaptive Rate Control Method for User-Generated Content Videos52
MtArtGPT: A Multi-Task Art Generation System With Pre-Trained Transformer52
Learning Multi-View Stereo with Geometry-Aware Prior52
Feature Alignment in Anchor-Free Object Detection52
FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU52
Exploiting Global Camera Network Constraints for Unsupervised Video Person Re-Identification52
Generalized Intra-Camera Supervised Person Re-Identification52
Class Activation Map Calibration for Weakly Supervised Semantic Segmentation51
Efficient and Effective Nonconvex Low-Rank Subspace Clustering via SVT-Free Operators51
VideoPure: Diffusion-based Adversarial Purification for Video Recognition51
Spike Camera Image Reconstruction Using Deep Spiking Neural Networks51
Sampling Propagation Attention With Trimap Generation Network for Natural Image Matting51
Semantic-Context Graph Network for Point-Based 3D Object Detection51
Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation50
Glimpse and Zoom: Spatio-Temporal Focused Dynamic Network for Skeleton-Based Action Recognition50
Bi-Directional Progressive Guidance Network for RGB-D Salient Object Detection50
Curiosity-Driven Class-Incremental Learning via Adaptive Sample Selection50
A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization50
U²-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning50
Real Image Denoising via Guided Residual Estimation and Noise Correction49
Learning Informative and Discriminative Features for Facial Expression Recognition in the Wild49
Toward Extreme Image Compression With Latent Feature Guidance and Diffusion Prior49
Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition49
Collaborative Multi-Dynamic Pattern Modeling for Human Motion Prediction49
Revisiting Modality-Specific Feature Compensation for Visible-Infrared Person Re-Identification49
Globally Deformable Information Selection Transformer for Underwater Image Enhancement48
Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation48
IEEE Transactions on Circuits and Systems for Video Technology publication information48
A Pixel-Level Segmentation-Synthesis Framework for Dynamic Texture Video Compression48
LGTrack: Exploiting Local and Global Properties for Robust Visual Tracking48
Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning48
Special Issue on Segment Anything for Videos and Beyond48
Dense Crosstalk Feature Aggregation for Classification and Localization in Object Detection48
Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer48
A Novel Cross-Perturbation for Single Domain Generalization48
Concept-Enhanced Relation Network for Video Visual Relation Inference48
Locality-Adaptive Structured Dictionary Learning for Cross-Domain Recognition48
CodingHomo: Bootstrapping Deep Homography With Video Coding47
Corruption-Invariant Person Re-Identification via Coarse-to-Fine Feature Alignment47
DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework47
Deep Video Super-Resolution Using Hybrid Imaging System47
Diffusion-Based Depth Inpainting for Transparent and Reflective Objects47
M3CS: Multi-Target Masked Point Modeling with Learnable Codebook and Siamese Decoders47
UNeLF: Unconstrained Neural Light Field for Self-Supervised Angular Super-Resolution47
Surface-Continuous Scene Representation for Light Field Depth Estimation via Planarity Prior46
Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression46
CFB-Then-ECB Mode-Based Image Encryption for an Efficient Correction of Noisy Encrypted Images46
Small Sample Image Segmentation by Coupling Convolutions and Transformers46
Learning Physical-Spatio-Temporal Features for Video Shadow Removal46
Enhancing Skeleton-Based Action Recognition With Language Descriptions From Pre-Trained Large Multimodal Models46
Flexible Temperature Parallel Distillation for Dense Object Detection: Make Response-Based Knowledge Distillation Great Again46
PCTrack: Accurate Object Tracking for Live Video Analytics on Resource-Constrained Edge Devices46
Exploiting Multiperspective Driven Hierarchical Content-Aware Network for Finger Vein Verification46
Exploring Relational Knowledge for Source-Free Domain Adaptation46
Neuron-Based Spiking Transmission and Reasoning Network for Robust Image-Text Retrieval45
CSTA: Spatial-Temporal Causal Adaptive Learning for Exemplar-Free Video Class-Incremental Learning45
Enhancing Transparent Object Matting Using Predicted Definite Foreground and Background45
Deep Sparse Representation Based Image Restoration With Denoising Prior45
Matching Multi-Scale Feature Sets in Vision Transformer for Few-Shot Classification45
CLSR: Cross-Layer Interaction Pyramid Super-Resolution Network45
Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework45
Complementary Blind-Spot Network for Self-Supervised Real Image Denoising44
Reference-Guided Large-Scale Face Inpainting With Identity and Texture Control44
Neuromorphic Imaging With Super-Resolution44
Dual Prototypes-Based Personalized Federated Adversarial Cross-Modal Hashing44
Knowledge-Based Visual Question Generation44
0.33574485778809