OOIR: Observatory of International Research

Papers

(The median citation count of International Journal of Parallel Programming is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)

Article	Citations
Accelerating OCaml Programs on FPGA	18
Efficient High-Performance Computing Strategies for the Legendre Pairs Search	16
SkePU-Streaming: Distributed Pipelining of Portable Data-Parallel Skeleton Computations for the Heterogeneous Edge-Cloud Continuum	11
Special Issue on SAMOS 2022	10
Calculation of Distributed-Order Fractional Derivative on Tensor Cores-Enabled GPU	10
Meerkat: A Framework for Dynamic Graph Algorithms on GPUs	7
Erasure-Coded Hybrid Writes Based on Data Delta	7
High-level Programming of Vulkan-based GPUs Through OpenMP	7
SGgraph: A Scalable GPU-Based Edge-Centric Graph Processing Framework	6
Investigating Methods for ASPmT-Based Design Space Exploration in Evolutionary Product Design	6
Scaling the Maximum Flow Computation on GPUs	6
Portable C++ Code that can Look and Feel Like Fortran Code with Yet Another Kernel Launcher (YAKL)	5
Declarative Data Flow in a Graph-Based Distributed Memory Runtime System	5
A Practical Approach for Employing Tensor Train Decomposition in Edge Devices	5
K*-Means: An Efficient Clustering Algorithm with Adaptive Decision Boundaries	5
ControlPULP: A RISC-V On-Chip Parallel Power Controller for Many-Core HPC Processors with FPGA-Based Hardware-In-The-Loop Power and Thermal Emulation	5
Using Machine Learning Hardware to Solve Linear Partial Differential Equations with Finite Difference Methods	4
Design and Performance Evaluation of a Novel High-Speed Hardware Architecture for Keccak Crypto Coprocessor	3
Generic Exact Combinatorial Search at HPC Scale	3
Automatic Heterogeneous Runtime Using Signal Processing Domain-Specific and Parallel Patterns	3
Advancing Interactive Parallelization: iCetus	3
Efficient Implementation of AI Algorithms on an FPGA-Based System for Enhancing Blood Vessel Segmentation	3
Self-Adaptive Micro-Batching for Low-Latency GPU-Accelerated Stream Processing	2
RMOWOA: A Revamped Multi-Objective Whale Optimization Algorithm for Maximizing the Lifetime of a Network in Wireless Sensor Networks	2
Programming Parallelism on FPGAs with Eclat	2

Larger-Than-Memory Stateful Stream Processing with WindFlow	2
CAPIO-CL: The CAPIO Coordination Language	2
Enabling Pinning Strategies for Stream Processing Applications on Multicores	2
SymTensor: Symbolic and Adaptive Tensor Partitioning by Unified Parallelism for Deep Learning	2
Optimizing Three-Dimensional Stencil-Operations on Heterogeneous Computing Environments	2
Generating Sparse Matrices for Large-Scale Spectral Clustering on a Single GPU	2
MICPAT: Micro-Architecture Independent Characteristics Profiling Analysis Tool for GPU Programs	1
SMSG: Profiling-Free Parallelism Modeling for Distributed Training of DNN	1
Giraph-Based Distributed Algorithms for Coloring Large-Scale Graphs	1
RISC-V Instruction Fetch Architecture Optimized for Harsh Environments	1
Split’n’Cover: ISO 26262 Hardware Safety Analysis with SystemC	1
A High-Level API for End-to-End Data Compression in Multi-GPU Cluster Applications	1
Accelerating the Conjugate Gradient Method on Distributed-Memory Computers	1
A Fault-Model-Relevant Classification of Consensus Mechanisms for MPI and HPC	1
Retraction Note: QoS and QoE Enhanced Resource Allocation for Wireless Video Sensor Networks Using Hybrid Optimization Algorithm	1
Dynamic Communication Optimization with Collision Avoidance for Parallel Programs in Distributed Systems	1
High-Level Programming of FPGA-Accelerated Systems with Parallel Patterns	1
Acknowledgement 2025	1
Thread and Data Mapping in Software Transactional Memory: an Overview	1
Intelligent Page Migration on Heterogeneous Memory by Using Transformer	1
Quantitative Evaluation of Fault Detection Strategies in FPGAs for Space Applications	1
Yet Another Lock-Free Atom Table Design for Scalable Symbol Management in Prolog	1