Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed Up Mixed-Precision Iterative Refinement Solvers

doi 10.1109/sc.2018.00050
Full Text
Abstract

Available in full text

Date
Authors
Publisher

IEEE