IEEE Transactions on Parallel and Distributed Systems

Papers
(The H4-Index of IEEE Transactions on Parallel and Distributed Systems is 49. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
Enabling Large Scale Simulations for Particle Accelerators269
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Tsinghua University228
Building High-throughput Neural Architecture Search Workflows via a Decoupled Fitness Prediction Engine206
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications204
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity162
Design and Implementation of 2D Convolution on x86/x64 Processors160
Replicated Versioned Data Structures for Wide-Area Distributed Systems155
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information134
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays130
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations126
H5Intent: Autotuning HDF5 With User Intent102
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach101
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System98
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds93
IRHunter: Universal Detection of Instruction Reordering Vulnerabilities for Enhanced Concurrency in Distributed and Parallel Systems86
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement84
AWB+-Tree: A Novel Width-Based Index Structure Supporting Hybrid Matching for Large-Scale Content-Based Pub/Sub Systems83
STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training81
Improving I/O Performance for Exascale Applications Through Online Data Layout Reorganization79
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing78
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation76
Joint Task Scheduling and Containerizing for Efficient Edge Computing76
Federated Learning With Nesterov Accelerated Gradient76
Coordinating Fast Concurrency Adapting With Autoscaling for SLO-Oriented Web Applications74
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From University of Washington72
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network71
Graph-Centric Performance Analysis for Large-Scale Parallel Applications69
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization67
LB-Chain: Load-Balanced and Low-Latency Blockchain Sharding via Account Migration67
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning65
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity65
DyLaClass: Dynamic Labeling Based Classification for Optimal Sparse Matrix Format Selection in Accelerating SpMV64
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms63
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment63
Asynchronous Algorithms for Decentralized Resource Allocation Over Directed Networks61
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription61
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats60
Improved MPC Algorithms for Edit Distance and Ulam Distance60
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment58
AESM2 Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems57
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks55
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud55
CiMBA: Accelerating Genome Sequencing Through On-Device Basecalling via Compute-in-Memory55
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration54
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads54
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems52
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation49
Improving the Scalability of GPU Synchronization Primitives49
Libfork: Portable Continuation-Stealing With Stackless Coroutines49
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning49
0.16528701782227