Automatic speculative DOALL for clusters

Kim, Hanjun; Johnson, Nick P; Lee, Jae W; Mahlke, Scott A; August, David I

Automatic speculative DOALL for clusters

Author(s): Kim, Hanjun; Johnson, Nick P; Lee, Jae W; Mahlke, Scott A; August, David I

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1t24d

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Hanjun	-
dc.contributor.author	Johnson, Nick P	-
dc.contributor.author	Lee, Jae W	-
dc.contributor.author	Mahlke, Scott A	-
dc.contributor.author	August, David I	-
dc.date.accessioned	2021-10-08T19:45:19Z	-
dc.date.available	2021-10-08T19:45:19Z	-
dc.date.issued	2012-03	en_US
dc.identifier.citation	Kim, Hanjun, Nick P. Johnson, Jae W. Lee, Scott A. Mahlke, and David I. August. "Automatic speculative DOALL for clusters." Proceedings of the Tenth International Symposium on Code Generation and Optimization (2012): pp. 94-103. doi:10.1145/2259016.2259029	en_US
dc.identifier.issn	2164-2397	-
dc.identifier.uri	http://corelab.yonsei.ac.kr/Pubs/cgo12_cluster_specdoall.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1t24d	-
dc.description.abstract	Automatic parallelization for clusters is a promising alternative to time-consuming, error-prone manual parallelization. However, automatic parallelization is frequently limited by the imprecision of static analysis. Moreover, due to the inherent fragility of static analysis, small changes to the source code can significantly undermine performance. By replacing static analysis with speculation and profiling, automatic parallelization becomes more robust and applicable. A naïve automatic speculative parallelization does not scale for distributed memory clusters, due to the high bandwidth required to validate speculation. This work is the first automatic speculative DOALL (Spec-DOALL) parallelization system for clusters. We have implemented a prototype automatic parallelization system, called Cluster Spec-DOALL, which consists of a Spec-DOALL parallelizing compiler and a speculative runtime for clusters. Since the compiler optimizes communication patterns, and the runtime is optimized for the cases in which speculation succeeds, Cluster Spec-DOALL minimizes the communication and validation overheads of the speculative runtime. Across 8 benchmarks, Cluster Spec-DOALL achieves a geomean speedup of 43.8x on a 120-core cluster, whereas DOALL without speculation achieves only 4.5x speedup. This demonstrates that speculation makes scalable fully-automatic parallelization for clusters possible.	en_US
dc.format.extent	94 - 103	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the Tenth International Symposium on Code Generation and Optimization	en_US
dc.rights	Author's manuscript	en_US
dc.title	Automatic speculative DOALL for clusters	en_US
dc.type	Conference Article	en_US
dc.identifier.doi	10.1145/2259016.2259029	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
AutoSpeculativeDOALLClusters.pdf		412.02 kB	Adobe PDF	View/Download

Show Simple Item Record