OOIR: Observatory of International Research

Papers

(The H4-Index of IEEE Transactions on Parallel and Distributed Systems is 52. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)

Article	Citations
Critique of “MemXCT: Memory-Centric X-Ray CT Reconstruction With Massive Parallelization” by SCC Team From Tsinghua University	204
Optimizing Data Locality by Integrating Intermediate Data Partitioning and Reduce Task Scheduling in Spark Framework	192
Online Container Caching for IoT Data Processing in Serverless Edge Computing	140
ComStar: Compression-Aware Stream Query for Heterogeneous Hybrid Architecture	138
H5Intent: Autotuning HDF5 With User Intent	134
Enabling Large Scale Simulations for Particle Accelerators	129
Design and Implementation of 2D Convolution on x86/x64 Processors	123
Federated Learning With Nesterov Accelerated Gradient	112
A Memory-Constraint-Aware List Scheduling Algorithm for Memory-Constraint Heterogeneous Muti-Processor System	112
Bal-DGCN: A Hardware Acceleration Framework for Balanced Computational Efficiency in Dynamic Graph Convolutional Networks (DGCNs)	111
Decentralized Federated Learning with Period Gradient Tracking over Time-Varying Networks	110
On the Message Complexity of Fault-Tolerant Computation: Leader Election and Agreement	106
A Point Cloud Video Recognition Acceleration Framework Based on Tempo-Spatial Information	101
mtGEMM: An Efficient GEMM Library for Modern Multi-Core DSPs	99
Jdebug: A Fast, Non-Intrusive and Scalable Fault Locating Tool for Ten-Million-Scale Parallel Applications	94
Large-Scale Neural Network Quantum States Calculation for Quantum Chemistry on a New Sunway Supercomputer	94
IRHunter: Universal Detection of Instruction Reordering Vulnerabilities for Enhanced Concurrency in Distributed and Parallel Systems	88
Mapping Large-Scale Spiking Neural Network on Arbitrary Meshed Neuromorphic Hardware	87
QoS-Aware Scheduling of Remote Rendering for Interactive Multimedia Applications in Edge Computing	85
Fully Decentralized Data Distribution for Large-Scale HPC Systems	85
EdgeTB: A Hybrid Testbed for Distributed Machine Learning at the Edge With High Fidelity	84
Distributed Task Processing Platform for Infrastructure-Less IoT Networks: A Multi-Dimensional Optimization Approach	84
An Efficient Bottleneck Planes Exclusion Method for Reconfiguring 3D VLSI Arrays	82
fPIM: A Holistic Design to Optimize PIM Data Flow for High Execution Efficiency	80
GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds	78

STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training	75
AWB+-Tree: A Novel Width-Based Index Structure Supporting Hybrid Matching for Large-Scale Content-Based Pub/Sub Systems	72
HRCM: A Hierarchical Regularizing Mechanism for Sparse and Imbalanced Communication in Whole Human Brain Simulations	71
Replicated Versioned Data Structures for Wide-Area Distributed Systems	69
RHINO: An Efficient Serverless Container System for Small-Scale HPC Applications	68
AESM² Attribute-Based Encrypted Search for Multi-Owner and Multi-User Distributed Systems	67
Accelerating Data Delivery of Latency-Sensitive Applications in Container Overlay Network	66
On the Performance of SMASH: A Non-Preemptive Window-Based Scheduler for Multiserver Jobs	66
Simple, Fast and Widely Applicable Concurrent Memory Reclamation via Neutralization	64
Asynchronous Algorithms for Decentralized Resource Allocation Over Directed Networks	63
Joint Model Pruning and Topology Construction for Accelerating Decentralized Machine Learning	63
Agile Cache Replacement in Edge Computing via Offline-Online Deep Reinforcement Learning	62
Tag-Sharer-Fusion Directory: A Scalable Coherence Directory With Flexible Entry Formats	61
BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription	61
Efficient and Automated Deployment Architecture for OpenStack in TianHe SuperComputing Environment	60
Scalable Hybrid Learning Techniques for Scientific Data Compression	60
PHIDE: A Parallel Hybrid Direct–Iterative Eigensolver for Hermitian Eigenvalue Problems	59
Building Accurate and Interpretable Online Classifiers on Edge Devices	58
Graph-Centric Performance Analysis for Large-Scale Parallel Applications	58
Securing Fine-Grained Data Sharing and Erasure in Outsourced Storage Systems	58
HarmonyCache: Scalable In-Network Cache With Read-Write Separation	58
A Novel Parallel Algorithm for Sparse Tensor Matrix Chain Multiplication via TCU-Acceleration	55
DyLaClass: Dynamic Labeling Based Classification for Optimal Sparse Matrix Format Selection in Accelerating SpMV	55
Improving the Scalability of GPU Synchronization Primitives	54
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud	54
PreTrans: Enabling Efficient CGRA Multi-Task Context Switch Through Config Pre-Mapping and Data Transceiving	54
High-Level Data Abstraction and Elastic Data Caching for Data-Intensive AI Applications on Cloud-Native Platforms	53
GreenFlow: A Carbon-Efficient Scheduler for Deep Learning Workloads	52