(PDF) Harnessing GPU Tensor Cores for Fast FP16 Arithmetic

Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed Up Mixed-Precision Iterative Refinement Solvers

doi 10.1109/sc.2018.00050

Full Text

Abstract

Available in full text

Date

November 1, 2018

Authors

Stanimire Tomov

Nicholas J. Higham

Publisher

IEEE

Related search

The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques

Lecture Notes in Computer Science

Computer Science

Theoretical Computer Science

English

Adaptive Precision Solvers for Sparse Linear Systems

English

High-Speed Real-Time Spectrum Analysis System Based on FPGA and GPU Parallel Arithmetic

English

Fast Poisson Solvers for Graphics Processing Units

Lecture Notes in Computer Science

Computer Science

Theoretical Computer Science

English

Evolving to Generalize: Trading Precision for Speed

British Journal for the Philosophy of Science

Philosophy of Science

English

Tensor-Rank and Lower Bounds for Arithmetic Formulas

Journal of the ACM

Systems Engineering

Information Systems

Artificial Intelligence

English

Final Report:B595949 - Fast Solvers for Discrete Hodge Laplacians

English

Iterative and Incremental Model Generation by Logic Solvers

Lecture Notes in Computer Science

Computer Science

Theoretical Computer Science

English

Fast Relaxation Solvers for Hyperbolic-Elliptic Phase Transition Problems

SIAM Journal of Scientific Computing

Computational Mathematics

Applied Mathematics

English