IEEE Transactions on Parallel and Distributed Systems

Papers
(The H4-Index of IEEE Transactions on Parallel and Distributed Systems is 50. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Tsinghua University289
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications227
Design and Implementation of 2D Convolution on x86/x64 Processors222
Replicated Versioned Data Structures for Wide-Area Distributed Systems173
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information161
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations146
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach144
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays106
H5Intent: Autotuning HDF5 With User Intent105
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds98
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement92
IRHunter: Universal Detection of Instruction Reordering Vulnerabilities for Enhanced Concurrency in Distributed and Parallel Systems92
AWB+-Tree: A Novel Width-Based Index Structure Supporting Hybrid Matching for Large-Scale Content-Based Pub/Sub Systems86
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity84
Federated Learning With Nesterov Accelerated Gradient84
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing83
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization83
Joint Task Scheduling and Containerizing for Efficient Edge Computing82
Building High-throughput Neural Architecture Search Workflows via a Decoupled Fitness Prediction Engine79
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System79
Enabling Large Scale Simulations for Particle Accelerators77
STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training75
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation71
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From University of Washington70
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization70
Graph-Centric Performance Analysis for Large-Scale Parallel Applications70
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks69
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network67
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity67
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems66
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment64
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads63
DyLaClass: Dynamic Labeling Based Classification for Optimal Sparse Matrix Format Selection in Accelerating SpMV63
Building Accurate and Interpretable Online Classifiers on Edge Devices63
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms62
Asynchronous Algorithms for Decentralized Resource Allocation Over Directed Networks59
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats59
CiMBA: Accelerating Genome Sequencing Through On-Device Basecalling via Compute-in-Memory58
Improved MPC Algorithms for Edit Distance and Ulam Distance58
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription56
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration55
RHINO: An Efficient Serverless Container System for Small-Scale HPC Applications55
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems55
Coordinating Fast Concurrency Adapting With Autoscaling for SLO-Oriented Web Applications54
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning53
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration53
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning53
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud52
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation50
Improving the Scalability of GPU Synchronization Primitives50
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment50
0.067439794540405