Ara
Toplam kayıt 2, listelenen: 1-2
A runtime heuristic to selectively replicate tasks for application-specific reliability targets
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2016)
n this paper we propose a runtime-based selective task replication technique for task-parallel high performance computing applications. Our selective task replication technique is automatic and does not require ...
Designing and Modelling Selective Replication for Fault-tolerant HPC Applications
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2017)
Fail-stop errors and Silent Data Corruptions (SDCs) are the most common failure modes for High Performance Computing (HPC) applications. There are studies that address fail-stop errors and studies that address SDCs. However ...