OOIR: Observatory of International Research

Papers

(The H4-Index of IEEE Transactions on Circuits and Systems for Video Technology is 85. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)

Article	Citations
2022 Index IEEE Transactions on Circuits and Systems for Video Technology Vol. 32	541
IEEE Transactions on Circuits and Systems for Video Technology Publication Information	474
Table of Contents	351
IEEE Transactions on Circuits and Systems for Video Technology publication information	340
IEEE Transactions on Circuits and Systems for Video Technology publication information	316
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification	263
SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation	247
DMRFlow: 4D Radar Scene Flow Estimation With Decoupled Matching and Refinement	238
Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network	226
Table of Contents	219
IEEE Circuits and Systems Society Information	213
Guest Editorial Introduction to the Special Issue on Label-Efficient Learning on Video Data	204
Harmony: An Eco-Friendly Adaptive Rate Control Scheme for Video-on-Demand in Low Earth Orbit Satellite Internet	203
Synergistic Fusion Network of Microscopic Hyperspectral and RGB Images for Multi-Perspective Segmentation	182
RT3DHVC: A Real-Time Human Holographic Video Conferencing System With a Consumer RGB-D Camera Array	177
Convolutional Neural Networks for Omnidirectional Image Quality Assessment: A Benchmark	175
CRP2-VCS: Contrast-Oriented Region-Based Progressive Probabilistic Visual Cryptography Schemes	170
Stochastic Gradient Perturbation: An Implicit Regularizer for Person Re-Identification	165
Toward Meta-Shape-Based Multi-View 3D Point Cloud Registration: An Evaluation	156
Relative Comparison-Based Consensus Learning for Multi-View Subspace Clustering	155
Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning	155
SpiReco: Fast and Efficient Recognition of High-Speed Moving Objects With Spike Camera	154
DSC3D: Deformable Sampling Constraints in Stereo 3D Object Detection for Autonomous Driving	152
Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and User Trajectory Information	149
TPCM-SegNet: A Text-Prompted Dual-Path Convolution-Mamba Network for Anomaly Segmentation	148

Filtering and Alternating Calibration: Spatiotemporal Context Alternating Fusion for Event-Based Monocular Depth Estimation	144
FoV Prediction-Based Adaptive Bitrate Streaming With On-Demand Transcoding for 360° Videos	143
LiveMatte: Dynamic Scene Background Restoration and Selective Portrait Patch Enhancement	141
DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation	140
Draw Like an Artist: Complex Scene Generation With Diffusion Model via Composition, Painting, and Retouching	140
Exploring and Exploiting High-Order Spatial–Temporal Dynamics for Long-Term Frame Prediction	137
USVTrack: A Benchmark for Multi-Object Tracking in Complex Water Surface Scenes	134
Projected Generative Adversarial Network for Point Cloud Completion	133
Few-Shot Temporal Sentence Grounding via Memory-Guided Semantic Learning	128
Scene Prior Constrained Self-Paced Learning for Unsupervised Satellite Video Vehicle Detection	128
Semi-Supervised Crowd Counting via Multi-Task Pseudo-Label Self-Correction Strategy	126
Instance-Incremental Scene Graph Generation From Real-World Point Clouds via Normalizing Flows	126
Future Feature-Based Supervised Contrastive Learning for Streaming Perception	125
Semantic-Aware Late-Stage Supervised Contrastive Learning for Fine-Grained Action Recognition	125
UAMD-Net: A Unified Adaptive Multimodal Neural Network for Dense Depth Completion	120
Unsupervised Action Segmentation via Multi-Scale Temporal-Interaction Enhancement	120
Phase-Guided Cross-Frequency Integration Network for ISAR and Optical Image Fusion	119
ProMoT: Progressive Prompting of Modality and Temporal Dynamics for RGB-T Tracking	117
Scalable and Robust Tensor Ring Decomposition for Large-Scale Data With Missing Data and Outliers	117
Dependability Feature Learning Based on Sample Generation for Unsupervised Text-to-Image Person Re-Identification	117
Graph-Guided Unsupervised Multiview Representation Learning	117
Ct-LVI: A Framework Toward Continuous-Time Laser-Visual-Inertial Odometry and Mapping	116
A Format Compliant Framework for HEVC Selective Encryption After Encoding	116
Reconstructing Sparse-view Indoor Scenes in View Space with Global Monocular Prior Alignment	114
CLIP-Based Class Incremental Semantic Segmentation Framework with Generalization-Preserving Knowledge Distillation	113
NDM: Boosting Dataset Distillation via Nested Difficulty Matching	113
Boosting Video Object Segmentation with Discriminative Core Features and Adaptive Position Refinement	111
Efficient Single-Object Tracker Based on Local-Global Feature Fusion	109
Hierarchical Dynamic Programming Module for Human Pose Refinement	109
Multi-Stage Cross-Modality Feature Interaction for RGB-Thermal Multi-Object Tracking	109
VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection	107
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection	107
MPCF: Multi-Phase Consolidated Fusion for Multi-Modal 3D Object Detection with Pseudo Point Cloud	104
Universal Immunized Cover Construction for Secure Adaptive Steganography across Multiple Domains	103
DS ² VP: Dynamically-Selected Spatially Visual Prompting	102
Local Attention Transformer-Based Full-View Finger-Vein Identification	101
Lossless Dynamic Point Cloud Geometry Compression via Rate-Distortion Optimized Motion Estimation	99
Plausible Proxy Mining With Credibility for Unsupervised Person Re-Identification	98
EIFNet: An Explicit and Implicit Feature Fusion Network for Finger Vein Verification	98
FastAL: Fast Evaluation Module for Efficient Dynamic Deep Active Learning Using Broad Learning System	97
Morphology-Guided Muscle Cell Detection & Counting based on Transfer Learning, FFD Augmentation and Density-Aware Loss Optimization	97
Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter	96
Dual Difficulty-Aware Adaptive Pseudo Labeling for Semi-Supervised CNV Segmentation	96
PPIFuse: Physical Priors Injected Infrared and Visible Image Fusion	95
Spectral–Spatial Feature Extraction With Dual Graph Autoencoder for Hyperspectral Image Clustering	95
Block Diagonal Graph Embedded Discriminative Regression for Image Representation	95
DP-Retinex: Dual-Prior Guided Low-Light Image Enhancement With YUV-Domain Reflectance-Illumination Decomposition	94
Uni3DA: Universal 3D Domain Adaptation for Object Recognition	92
Reversible Data Hiding Over Encrypted Images via Preprocessing-Free Matrix Secret Sharing	92
Reliable Entropy-Induced Anchor Learning for Incomplete Multi-View Subspace Clustering	89

Fully Unsupervised Domain-Agnostic Image Retrieval	88
Semantic Boosting via Knowledge Sharing and Feedback for Video Anomaly Detection	88
Learning Spatio-Temporal Sharpness Map for Video Deblurring	88
Robust Image Watermarking With Synchronization Using Template Enhanced-Extracted Network	88
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition	88
Push-and-Pull: A General Training Framework With Differential Augmentor for Domain Generalized Point Cloud Classification	87
MEF-GD: Multimodal Enhancement and Fusion Network for Garment Designer	86
Iterative Self-Guided Image Filtering	86
Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images	86
Key Role Guided Transformer for Group Activity Recognition	85
Spatial Attention-Guided Light Field Salient Object Detection Network With Implicit Neural Representation	85
Representing Boundary-Ambiguous Scene Online With Scale-Encoded Cascaded Grids and Radiance Field Deblurring	85