Skip to main content

Automatic speculative DOALL for clusters

Author(s): Kim, Hanjun; Johnson, Nick P; Lee, Jae W; Mahlke, Scott A; August, David I

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1t24d
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKim, Hanjun-
dc.contributor.authorJohnson, Nick P-
dc.contributor.authorLee, Jae W-
dc.contributor.authorMahlke, Scott A-
dc.contributor.authorAugust, David I-
dc.date.accessioned2021-10-08T19:45:19Z-
dc.date.available2021-10-08T19:45:19Z-
dc.date.issued2012-03en_US
dc.identifier.citationKim, Hanjun, Nick P. Johnson, Jae W. Lee, Scott A. Mahlke, and David I. August. "Automatic speculative DOALL for clusters." Proceedings of the Tenth International Symposium on Code Generation and Optimization (2012): pp. 94-103. doi:10.1145/2259016.2259029en_US
dc.identifier.issn2164-2397-
dc.identifier.urihttp://corelab.yonsei.ac.kr/Pubs/cgo12_cluster_specdoall.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1t24d-
dc.description.abstractAutomatic parallelization for clusters is a promising alternative to time-consuming, error-prone manual parallelization. However, automatic parallelization is frequently limited by the imprecision of static analysis. Moreover, due to the inherent fragility of static analysis, small changes to the source code can significantly undermine performance. By replacing static analysis with speculation and profiling, automatic parallelization becomes more robust and applicable. A naïve automatic speculative parallelization does not scale for distributed memory clusters, due to the high bandwidth required to validate speculation. This work is the first automatic speculative DOALL (Spec-DOALL) parallelization system for clusters. We have implemented a prototype automatic parallelization system, called Cluster Spec-DOALL, which consists of a Spec-DOALL parallelizing compiler and a speculative runtime for clusters. Since the compiler optimizes communication patterns, and the runtime is optimized for the cases in which speculation succeeds, Cluster Spec-DOALL minimizes the communication and validation overheads of the speculative runtime. Across 8 benchmarks, Cluster Spec-DOALL achieves a geomean speedup of 43.8x on a 120-core cluster, whereas DOALL without speculation achieves only 4.5x speedup. This demonstrates that speculation makes scalable fully-automatic parallelization for clusters possible.en_US
dc.format.extent94 - 103en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the Tenth International Symposium on Code Generation and Optimizationen_US
dc.rightsAuthor's manuscripten_US
dc.titleAutomatic speculative DOALL for clustersen_US
dc.typeConference Articleen_US
dc.identifier.doi10.1145/2259016.2259029-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
AutoSpeculativeDOALLClusters.pdf412.02 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.