International Journal of High Performance Computing Applications

Papers
(The median citation count of International Journal of High Performance Computing Applications is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Visualization at exascale: Making it all work with VTK-m63
Dynamic spawning of MPI processes applied to malleability63
Enhancing scalability of a matrix-free eigensolver for studying many-body localization59
Accelerating atmospheric physics parameterizations using graphics processing units38
Automatizing the creation of specialized high-performance computing containers35
HPL-MxP benchmark: Mixed-precision algorithms, iterative refinement, and scalable data generation32
Running ahead of evolution—AI-based simulation for predicting future high-risk SARS-CoV-2 variants22
Compressed basis GMRES on high-performance graphics processing units16
HPC I/O innovations in the exascale era15
Refining HPCToolkit for application performance analysis at exascale14
Large-scale ab initio simulation of light–matter interaction at the atomic scale in Fugaku14
HDF5 in the exascale era: Delivering efficient and scalable parallel I/O for exascale applications13
Direct numerical simulations for hybrid rocket boundary layers: Performance modeling and scaling13
Modeling, evaluating, and orchestrating heterogeneous environmental leverages for large-scale data center management13
Julia versus C++ Kokkos for performance portable Cartesian CFD solvers on heterogeneous architectures11
Scalable multilevel Monte Carlo methods exploiting parallel redistribution on coarse levels10
Orchestration of materials science workflows for heterogeneous resources at large scale10
A tale of two codes: CUDA vs OpenACC for mass-zero constrained dynamics9
General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods9
Preparing MPICH for exascale9
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications9
Performance of explicit and IMEX MRI multirate methods on complex reactive flow problems within modern parallel adaptive structured grid frameworks8
Hypergraph-based locality-enhancing methods for graph operations in Big Data applications8
An elastic framework for ensemble-based large-scale data assimilation8
Special issue introduction8
GPU-based molecular dynamics of fluid flows: Reaching for turbulence8
Retraction Notice8
Integrating ytopt and libEnsemble to autotune OpenMC7
Massively parallel nodal discontinous Galerkin finite element method simulator for room acoustics7
A study on the performance of distributed training of data-driven CFD simulations7
Accelerated dynamic data reduction using spatial and temporal properties7
PeleC: An adaptive mesh refinement solver for compressible reacting flows6
Special issue: Introduction6
Fast truncated SVD of sparse and dense matrices on graphics processors6
TransGRU-X – A fusion Seq2Seq network enhanced with multiresolution analysis and gating for forecasting of AI/ML workloads in cloud environments5
HOPPS: A performance portable spectral difference solver for high-fidelity computational fluid dynamics5
Fair-sharing simulator: Toward fair scheduling in batch computing systems5
Heterogeneous programming using OpenMP and CUDA/HIP for hybrid CPU-GPU scientific applications5
Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors5
Clacc: OpenACC for C/C++ in Clang5
Sequence length scaling in vision transformers for scientific images on frontier5
Preparing the TAU performance system for exascale and beyond5
Data-driven scalable pipeline using national agent-based models for real-time pandemic response and decision support5
NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors5
Democratizing responsible artificial intelligence for innovation and impact5
Cache blocking of distributed-memory parallel matrix power kernels5
Technology trends in computing hardware and their impacts on high-performance scientific computing Part I: General-purpose processors and hardware accelerators5
Bricks: A high-performance portability layer for computations on block-structured grids4
Accelerating cluster dynamics simulation of fission gas behavior in nuclear fuel on deep computing unit–based heterogeneous architecture supercomputer4
Asynchronous-many-task systems: Challenges and opportunities - Scaling an AMR astrophysics code on exascale machines using Kokkos and HPX4
UMap: An application-oriented user level memory mapping library4
P4IRS: An intermediate representation and compiler for parallel and performance-portable particle simulations4
Guest editors note: Special issue on clusters, clouds, and data for scientific computing4
Understanding power and energy utilization in large scale production physics simulation codes4
Semi-Lagrangian 4d, 5d, and 6d kinetic plasma simulation on large-scale GPU-equipped supercomputers4
An integrated three-dimensional aeromechanical analysis for the prediction of stresses on modern coaxial rotors4
PoCL-R: An open standard based heterogeneous offloading layer with server side scalability4
Advances in ArborX to support exascale applications4
Cache-optimized and low-overhead implementations of additive Schwarz methods for high-order FEM multigrid computations4
HPC-AI coupling methodology for scientific applications4
Feynman and computation: From Los Alamos to quantum computers4
Abisko: Deep codesign of an architecture for spiking neural networks using novel neuromorphic materials4
Performance analysis of relaxation Runge–Kutta methods4
MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures4
Scalable cosmic AI inference using cloud serverless computing3
PaRSEC: Scalability, flexibility, and hybrid architecture support for task-based applications in ECP3
Performance evaluation of mixed-precision Runge–Kutta methods for the solution of partial differential equations3
IO-aware Job-Scheduling: Exploiting the Impacts of Workload Characterizations to select the Mapping Strategy3
ECP libraries and tools: An overview3
Exploiting mesh structure to improve multigrid performance for saddle-point problems3
An HPC benchmark survey and taxonomy for characterization3
Enhancing data locality of the conjugate gradient method for high-order matrix-free finite-element implementations3
The ECP ALPINE project: In situ and post hoc visualization infrastructure and analysis capabilities for exascale3
Simulation-based machine learning for real-time assessment of side-branch hemodynamics in coronary bifurcation lesions3
Black-box statistical prediction of lossy compression ratios for scientific data3
Detecting interference between applications and improving the scheduling using malleable application clones2
Efficient solution of batched band linear systems on GPUs2
An implicit barotropic mode solver for MPAS-ocean using a modern Fortran solver interface2
Fixed-work versus fixed-time checkpointing on large-scale failure-prone platforms2
#COVIDisAirborne: AI-enabled multiscale computational microscopy of delta SARS-CoV-2 in a respiratory aerosol2
Fault-tolerant numerical iterative algorithms at scale2
Deep learning foundation and pattern models: Challenges in hydrological time series2
High-performance conjugate gradient benchmark: A comprehensive survey2
End-to-end GPU acceleration of low-order-refined preconditioning for high-order finite element discretizations2
High performance computing seismic redatuming by inversion with algebraic compression and multiple precisions2
Corrigendum to large-scale direct numerical simulations of turbulence using GPUs and modern Fortran2
SWARM: Reimagining scientific workflow management systems in a distributed world2
Role-shifting threads: Increasing OpenMP malleability to address load imbalance at MPI and OpenMP2
Evolution of the SLATE linear algebra library2
Mixed precision LU factorization on GPU tensor cores: reducing data movement and memory footprint2
Experience and analysis of scalable high-fidelity computational fluid dynamics on modular supercomputing architectures1
HipBone: A performance-portable graphics processing unit-accelerated C++ version of the NekBone benchmark1
Numerical eigen-spectrum slicing, accurate orthogonal eigen-basis, and mixed-precision eigenvalue refinement using OpenMP data-dependent tasks and accelerator offload1
Performance enhancement of the Ozaki Scheme on integer matrix multiplication unit1
Result-scalability: Following the evolution of selected social impact of HPC1
ZFP: A compressed array representation for numerical computations1
Task-parallel in situ temporal compression of large-scale computational fluid dynamics data1
Parallel performance of shared memory parallel spectral deferred corrections1
A two-level GPU-accelerated incomplete LU preconditioner for general sparse linear systems1
Efficiency and scalability of fully-resolved fluid-particle simulations on heterogeneous CPU-GPU architectures1
Performance comparison of the A-grid and C-grid shallow-water models on icosahedral grids1
PETSc/TAO developments for GPU-based early exascale systems1
Performance portability in a real world application: PHAST applied to Caffe1
Parallel performance comparison of different CFD solvers and overset libraries for store separation analyses1
A compilation-based approach to performant reduction and redistribution collective communication algorithms1
Myths and legends in high-performance computing1
Batched sparse direct solver design and evaluation in SuperLU_DIST1
Towards exascale for wind energy simulations1
Portable, heterogeneous ensemble workflows at scale using libEnsemble1
Parthenon—a performance portable block-structured adaptive mesh refinement framework1
A fine-grained parallelization of the immersed boundary method1
Analytic roofline modeling and energy analysis of the LULESH proxy application on multi-core clusters1
0.083791017532349