IEEE Transactions on Parallel and Distributed Systems

Papers
(The TQCC of IEEE Transactions on Parallel and Distributed Systems is 11. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning202
Accelerating Federated Learning via Momentum Gradient Descent199
A Scalable Multi-Layer PBFT Consensus for Blockchain198
Self-Balancing Federated Learning With Global Imbalanced Data in Mobile Systems195
Online Collaborative Data Caching in Edge Computing152
Kokkos 3: Programming Model Extensions for the Exascale Era133
Decentralized Edge Intelligence: A Dynamic Resource Allocation Framework for Hierarchical Federated Learning130
Biscotti: A Blockchain System for Private and Secure Federated Learning130
Towards Fair and Privacy-Preserving Federated Deep Models126
Cost-Effective App Data Distribution in Edge Computing122
Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments110
Recent Advances of Resource Allocation in Network Function Virtualization109
GRP-HEFT: A Budget-Constrained Resource Provisioning Scheme for Workflow Scheduling in IaaS Clouds106
Multi-Task Federated Learning for Personalised Deep Neural Networks in Edge Computing104
Distributed and Dynamic Service Placement in Pervasive Edge Computing Networks92
Multi-Agent Imitation Learning for Pervasive Edge Computing: A Decentralized Computation Offloading Algorithm89
The Deep Learning Compiler: A Comprehensive Survey88
Auditing Cache Data Integrity in the Edge Computing Environment88
Communication-Efficient Federated Learning With Compensated Overlap-FedAvg80
Blockchain Assisted Decentralized Federated Learning (BLADE-FL): Performance Analysis and Resource Allocation78
Online Deadline-Aware Task Dispatching and Scheduling in Edge Computing77
Modeling and Optimization of Performance and Cost of Serverless Applications75
Distributed Task Migration Optimization in MEC by Extending Multi-Agent Deep Reinforcement Learning Approach74
CASpMV: A Customized and Accelerative SpMV Framework for the Sunway TaihuLight71
Distributed and Collective Deep Reinforcement Learning for Computation Offloading: A Practical Perspective69
AUCTION: Automated and Quality-Aware Client Selection Framework for Efficient Federated Learning68
Multi-Hop Multi-Task Partial Computation Offloading in Collaborative Edge Computing66
Offloading Tasks With Dependency and Service Caching in Mobile Edge Computing64
POCLib: A High-Performance Framework for Enabling Near Orthogonal Processing on Compression62
Elastic Scheduling for Microservice Applications in Clouds62
FRATO: Fog Resource Based Adaptive Task Offloading for Delay-Minimizing IoT Service Provisioning61
Blockchain at the Edge: Performance of Resource-Constrained IoT Networks61
Min-Max Cost Optimization for Efficient Hierarchical Federated Learning in Wireless Edge Networks59
Proof of Federated Learning: A Novel Energy-Recycling Consensus Algorithm58
CSEdge: Enabling Collaborative Edge Storage for Multi-Access Edge Computing Based on Blockchain58
Reliability-Aware Network Service Provisioning in Mobile Edge-Cloud Networks58
A Potential Game Theoretic Approach to Computation Offloading Strategy Optimization in End-Edge-Cloud Computing57
Energy-Aware Inference Offloading for DNN-Driven Applications in Mobile Edge Clouds55
Privacy-Preserving Multi-Keyword Searchable Encryption for Distributed Systems54
Efficient Algorithms for Delay-Aware NFV-Enabled Multicasting in Mobile Edge Clouds With Resource Sharing53
On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation52
COSCO: Container Orchestration Using Co-Simulation and Gradient Based Optimization for Fog Computing Environments52
VQL: Efficient and Verifiable Cloud Query Services for Blockchain Systems51
On Consortium Blockchain Consistency: A Queueing Network Model Approach49
Cryptomining Detection in Container Clouds Using System Calls and Explainable Machine Learning49
Thermal Prediction for Efficient Energy Management of Clouds Using Machine Learning49
Towards Efficient Scheduling of Federated Mobile Devices Under Computational and Statistical Heterogeneity48
Evaluation of Stream Processing Frameworks47
Faster Parallel Core Maintenance Algorithms in Dynamic Graphs46
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum46
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System46
Distributed Training of Deep Learning Models: A Taxonomic Perspective45
TODG: Distributed Task Offloading With Delay Guarantees for Edge Computing45
Transformations of High-Level Synthesis Codes for High-Performance Computing44
Performance and Cost-Efficient Spark Job Scheduling Based on Deep Reinforcement Learning in Cloud Computing Environments44
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model44
e-PoS: Making Proof-of-Stake Decentralized and Fair44
Joint Task Scheduling and Containerizing for Efficient Edge Computing43
Task Scheduling for Energy Consumption Constrained Parallel Applications on Heterogeneous Computing Systems43
Elastic and Reliable Bandwidth Reservation Based on Distributed Traffic Monitoring and Control43
IPPTS: An Efficient Algorithm for Scientific Workflow Scheduling in Heterogeneous Computing Systems42
The Design of Fast Content-Defined Chunking for Data Deduplication Based Storage Systems41
A Game-Based Approach for Cost-Aware Task Assignment With QoS Constraint in Collaborative Edge and Cloud Environments41
DeepSlicing: Collaborative and Adaptive CNN Inference With Low Latency40
Data, User and Power Allocations for Caching in Multi-Access Edge Computing39
Mechanisms for Resource Allocation and Pricing in Mobile Edge Computing Systems39
Towards Distributed SDN: Mobility Management and Flow Scheduling in Software Defined Urban IoT38
Heterogeneous Edge Offloading With Incomplete Information: A Minority Game Approach38
ADRL: A Hybrid Anomaly-Aware Deep Reinforcement Learning-Based Resource Scaling in Clouds37
A Quantum Approach Towards the Adaptive Prediction of Cloud Workloads37
DL2: A Deep Learning-Driven Scheduler for Deep Learning Clusters36
Automated Fine-Grained CPU Cap Control in Serverless Computing Platform35
Burst Load Evacuation Based on Dispatching and Scheduling In Distributed Edge Networks35
Online Learning for Distributed Computation Offloading in Wireless Powered Mobile Edge Computing Networks35
Congestion-Balanced and Welfare-Maximized Charging Strategies for Electric Vehicles35
Location-Aware and Budget-Constrained Service Deployment for Composite Applications in Multi-Cloud Environment35
FedGraph: Federated Graph Learning With Intelligent Sampling35
Horus: Interference-Aware and Prediction-Based Scheduling in Deep Learning Systems35
Hierarchical Multi-Agent Optimization for Resource Allocation in Cloud Computing34
On the Effective Parallelization and Near-Optimal Deployment of Service Function Chains34
Algorithm-Based Fault Tolerance for Convolutional Neural Networks34
PQC Acceleration Using GPUs: FrodoKEM, NewHope, and Kyber34
Deep Reinforcement Learning for Load-Balancing Aware Network Control in IoT Edge Systems34
VPIC 2.0: Next Generation Particle-in-Cell Simulations33
Maximizing User Service Satisfaction for Delay-Sensitive IoT Applications in Edge Computing32
Customer Perceived Value- and Risk-Aware Multiserver Configuration for Profit Maximization31
Network-Aware Locality Scheduling for Distributed Data Operators in Data Centers30
Efficient Compute-Intensive Job Allocation in Data Centers via Deep Reinforcement Learning30
Cost-Efficient Workflow Scheduling Algorithm for Applications With Deadline Constraint on Heterogeneous Clouds30
Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning29
Multi-Swarm Co-Evolution Based Hybrid Intelligent Optimization for Bi-Objective Multi-Workflow Scheduling in the Cloud29
A Ubiquitous Machine Learning Accelerator With Automatic Parallelization on FPGA29
Large-Scale Analysis of Docker Images and Performance Implications for Container Storage Systems29
Dependent Function Embedding for Distributed Serverless Edge Computing29
Distributed Adaptive Consensus Tracking Control for Multi-Agent System With Communication Constraints29
Liquid: Intelligent Resource Estimation and Network-Efficient Scheduling for Deep Learning Jobs on Distributed GPU Clusters29
GPU-Accelerated Real-Time Stereo Estimation With Binary Neural Network29
VeriML: Enabling Integrity Assurances and Fair Payments for Machine Learning as a Service28
Learning Spatiotemporal Failure Dependencies for Resilient Edge Computing Services27
Optimizing Depthwise Separable Convolution Operations on GPUs27
LightChain: Scalable DHT-Based Blockchain27
Constructing Completely Independent Spanning Trees in Data Center Network Based on Augmented Cube27
An In-Depth Study of Microservice Call Graph and Runtime Performance27
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference26
Topology-Aware Neural Model for Highly Accurate QoS Prediction26
Minority Disk Failure Prediction Based on Transfer Learning in Large Data Centers of Heterogeneous Disk Systems26
Energy-Efficient Parallel Real-Time Scheduling on Clustered Multi-Core26
T-Caching: Enhancing Feasibility of In-Network Caching in ICN26
Towards Higher Performance and Robust Compilation for CGRA Modulo Scheduling26
Multi-GPU Design and Performance Evaluation of Homomorphic Encryption on GPU Clusters26
GPGPU Performance Estimation With Core and Memory Frequency Scaling26
A Decentralized Federated Learning Framework via Committee Mechanism With Convergence Guarantee26
ERA-LSTM: An Efficient ReRAM-Based Architecture for Long Short-Term Memory25
Rusty: Runtime Interference-Aware Predictive Monitoring for Modern Multi-Tenant Systems25
Elastic Resource Allocation Against Imbalanced Transaction Assignments in Sharding-Based Permissioned Blockchains25
Coordinated Batching and DVFS for DNN Inference on GPU Accelerators24
Endurance-Aware Mapping of Spiking Neural Networks to Neuromorphic Hardware24
Completely Independent Spanning Trees on BCCC Data Center Networks With an Application to Fault-Tolerant Routing24
Towards Revenue-Driven Multi-User Online Task Offloading in Edge Computing24
Joint SFC Deployment and Resource Management in Heterogeneous Edge for Latency Minimization24
EdgeDR: An Online Mechanism Design for Demand Response in Edge Clouds23
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift23
K-Athena: A Performance Portable Structured Grid Finite Volume Magnetohydrodynamics Code23
Efficient Distributed Approaches to Core Maintenance on Large Dynamic Graphs23
Cooperative Edge Caching Based on Temporal Convolutional Networks22
Incentive Mechanism Design for Joint Resource Allocation in Blockchain-Based Federated Learning22
Monodirectional Evolutional Symport Tissue P Systems With Promoters and Cell Division22
An Efficient Parallel Secure Machine Learning Framework on GPUs22
Scalable and Adaptive Data Replica Placement for Geo-Distributed Cloud Storages22
High-Performance Routing With Multipathing and Path Diversity in Ethernet and HPC Networks22
Elastic Deep Learning in Multi-Tenant GPU Clusters21
GPU Tensor Cores for Fast Arithmetic Reductions21
LOCUS: User-Perceived Delay-Aware Service Placement and User Allocation in MEC Environment21
An Optimal Locality-Aware Task Scheduling Algorithm Based on Bipartite Graph Modelling for Spark Applications21
Differentially Private Byzantine-Robust Federated Learning21
aeSpTV: An Adaptive and Efficient Framework for Sparse Tensor-Vector Product Kernel on a High-Performance Computing Platform20
An Approximate Communication Framework for Network-on-Chips20
Efficient Virtual Network Embedding of Cloud-Based Data Center Networks into Optical Networks20
Context-Aware Online Client Selection for Hierarchical Federated Learning20
High-Quality Shared-Memory Graph Partitioning20
Towards Efficient and Stable K-Asynchronous Federated Learning With Unbounded Stale Gradients on Non-IID Data20
Exploring Data Analytics Without Decompression on Embedded GPU Systems20
COOPER-SCHED: A Cooperative Scheduling Framework for Mobile Edge Computing with Expected Deadline Guarantee20
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution20
Decentralized Application Placement in Fog Computing19
A Survey of System Architectures and Techniques for FPGA Virtualization19
Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS19
Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression19
Parallelization and Optimization of NSGA-II on Sunway TaihuLight System19
Scalable, Multi-Constraint, Complex-Objective Graph Partitioning19
BOSSA: A Decentralized System for Proofs of Data Retrievability and Replication19
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning19
Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization19
Anomaly Detection and Anticipation in High Performance Computing Systems19
P-PFC: Reducing Tail Latency with Predictive PFC in Lossless Data Center Networks19
Decentralized Utility- and Locality-Aware Replication for Heterogeneous DHT-Based P2P Cloud Storage Systems19
Design and Performance Characterization of RADICAL-Pilot on Leadership-Class Platforms19
CURE: A High-Performance, Low-Power, and Reliable Network-on-Chip Design Using Reinforcement Learning19
Performance-Aware Speculative Resource Oversubscription for Large-Scale Clusters18
SLEEF: A Portable Vectorized Library of C Standard Mathematical Functions18
Accelerating Gossip-Based Deep Learning in Heterogeneous Edge Computing Platforms18
Cost-Efficient Server Configuration and Placement for Mobile Edge Computing17
An Event-Driven Approach to Serverless Seismic Imaging in the Cloud17
Scheduling Periodical Multi-Stage Jobs With Fuzziness to Elastic Cloud Resources17
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs17
A Bifactor Approximation Algorithm for Cloudlet Placement in Edge Computing17
Microservice Deployment in Edge Computing Based on Deep Q Learning16
RENDA: Resource and Network Aware Data Placement Algorithm for Periodic Workloads in Cloud16
QShield: Protecting Outsourced Cloud Data Queries With Multi-User Access Control Based on SGX16
GossipFL: A Decentralized Federated Learning Framework With Sparsified and Adaptive Communication16
Towards Usable Cloud Storage Auditing16
Hamiltonian Paths of -cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests16
A Practical and Efficient Bidirectional Access Control Scheme for Cloud-Edge Data Sharing16
Accelerating Deep Learning Inference via Model Parallelism and Partial Computation Offloading16
Parallel and Asynchronous Smart Contract Execution16
Benzene: Scaling Blockchain With Cooperation-Based Sharding16
Mobility-Aware Offloading and Resource Allocation for Distributed Services Collaboration16
Endpoint-Flexible Coflow Scheduling Across Geo-Distributed Datacenters16
Modeling Analysis and Cost-Performance Ratio Optimization of Virtual Machine Scheduling in Cloud Computing15
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC15
GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine Learning15
Co-Active: A Workload-Aware Collaborative Cache Management Scheme for NVMe SSDs15
Dynamic Load Balancing in Parallel Execution of Cellular Automata15
Cuttlefish: Neural Configuration Adaptation for Video Analysis in Live Augmented Reality15
HiTDL: High-Throughput Deep Learning Inference at the Hybrid Mobile Edge15
Achieving Fine-Grained Flow Management Through Hybrid Rule Placement in SDNs15
Error-Compensated Sparsification for Communication-Efficient Decentralized Training in Edge Environment15
SF-Sketch: A Two-Stage Sketch for Data Streams15
Efficient Function Queryable and Privacy Preserving Data Aggregation Scheme in Smart Grid15
Combinatorial BLAS 2.0: Scaling Combinatorial Algorithms on Distributed-Memory Systems15
Improving Federated Learning With Quality-Aware User Incentive and Auto-Weighted Model Aggregation15
Fine-Grained Multi-Query Stream Processing on Integrated Architectures15
Adaptive Federated Deep Reinforcement Learning for Proactive Content Caching in Edge Computing14
Deterministic Data Distribution for Efficient Recovery in Erasure-Coded Storage Systems14
Octans: Optimal Placement of Service Function Chains in Many-Core Systems14
WindFlow: High-Speed Continuous Stream Processing With Parallel Building Blocks14
Privacy-Preserving Efficient Federated-Learning Model Debugging14
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training14
Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests14
TherMa-MiCs: Thermal-Aware Scheduling for Fault-Tolerant Mixed-Criticality Systems14
Trust: Triangle Counting Reloaded on GPUs14
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services14
The Workflow Trace Archive: Open-Access Data From Public and Private Computing Infrastructures14
The Case for Strong Scaling in Deep Learning: Training Large 3D CNNs with Hybrid Parallelism14
GOSH: Task Scheduling Using Deep Surrogate Models in Fog Computing Environments14
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models14
A Runtime and Non-Intrusive Approach to Optimize EDP by Tuning Threads and CPU Frequency for OpenMP Applications14
Collaborative Heterogeneity-Aware OS Scheduler for Asymmetric Multicore Processors13
Eiffel: Efficient and Fair Scheduling in Adaptive Federated Learning13
Resilient Real-Valued Consensus in Spite of Mobile Malicious Agents on Directed Graphs13
Decentralized Dual Proximal Gradient Algorithms for Non-Smooth Constrained Composite Optimization Problems13
Overlapping Communication With Computation in Parameter Server for Scalable DL Training13
libEnsemble: A Library to Coordinate the Concurrent Evaluation of Dynamic Ensembles of Calculations13
Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters13
Performant, Multi-Objective Scheduling of Highly Interleaved Task Graphs on Heterogeneous System on Chip Devices13
A Generic Stochastic Model for Resource Availability in Fog Computing Environments13
MCDS: AI Augmented Workflow Scheduling in Mobile Edge Cloud Computing Systems13
Improving Restore Performance for In-Line Backup System Combining Deduplication and Delta Compression13
HiFlash: Communication-Efficient Hierarchical Federated Learning With Adaptive Staleness Control and Heterogeneity-Aware Client-Edge Association13
Phase-Aware Cache Partitioning to Target Both Turnaround Time and System Performance13
Millimeter-Scale and Billion-Atom Reactive Force Field Simulation on Sunway Taihulight13
High Performance Simulation of Spiking Neural Network on GPGPUs13
cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope13
Accelerating Restarted GMRES With Mixed Precision Arithmetic13
Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures13
Efficient Forwarding Anomaly Detection in Software-Defined Networks12
Joint Application Placement and Request Routing Optimization for Dynamic Edge Computing Service Management12
Cost-Effective Web Application Replication and Deployment in Multi-Cloud Environment12
PISTIS: An Event-Triggered Real-Time Byzantine-Resilient Protocol Suite12
Replica Exchange MCMC Hardware With Automatic Temperature Selection and Parallel Trial12
Preemptive and Low Latency Datacenter Scheduling via Lightweight Containers12
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures12
Improving the Performance of Deduplication-Based Storage Cache via Content-Driven Cache Management Methods12
Look-up-Table Based Processing-in-Memory Architecture With Programmable Precision-Scaling for Deep Learning Applications12
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics12
Joint Coverage-Reliability for Budgeted Edge Application Deployment in Mobile Edge Computing Environment12
Coflow Scheduling in Data Centers: Routing and Bandwidth Allocation12
ESetStore: An Erasure-Coded Storage System With Fast Data Recovery12
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms12
Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory12
Reliability and Confidentiality Co-Verification for Parallel Applications in Distributed Systems12
Learning-Driven Interference-Aware Workload Parallelization for Streaming Applications in Heterogeneous Cluster12
DTransE: Distributed Translating Embedding for Knowledge Graph12
A Pessimistic Fault Diagnosability of Large-Scale Connected Networks via Extra Connectivity12
NITI: Training Integer Neural Networks Using Integer-Only Arithmetic12
Data-Centric Client Selection for Federated Learning Over Distributed Edge Networks11
Auction-Based Cluster Federated Learning in Mobile Edge Computing Systems11
CNNPC: End-Edge-Cloud Collaborative CNN Inference With Joint Model Partition and Compression11
Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management11
Reputation-aware Hedonic Coalition Formation for Efficient Serverless Hierarchical Federated Learning11
0.035639047622681