Optimizing Computation-Communication Overlap in Asynchronous Task-Based Programs

doi 10.1145/3293883.3295720