Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets

Anderson, Michael J.; Capotă, Mihai; Turek, Javier S.; Zhu, Xia; Willke, Theodore L.; Wang, Yida; Chen, Po-Hsuan; Manning, Jeremy R.; Ramadge, Peter J.; Norman, Kenneth A.

Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets

Author(s): Anderson, Michael J.; Capotă, Mihai; Turek, Javier S.; Zhu, Xia; Willke, Theodore L.; et al

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr12q93

Full metadata record

DC Field	Value	Language
dc.contributor.author	Anderson, Michael J.	-
dc.contributor.author	Capotă, Mihai	-
dc.contributor.author	Turek, Javier S.	-
dc.contributor.author	Zhu, Xia	-
dc.contributor.author	Willke, Theodore L.	-
dc.contributor.author	Wang, Yida	-
dc.contributor.author	Chen, Po-Hsuan	-
dc.contributor.author	Manning, Jeremy R.	-
dc.contributor.author	Ramadge, Peter J.	-
dc.contributor.author	Norman, Kenneth A.	-
dc.date.accessioned	2019-10-28T15:53:48Z	-
dc.date.available	2019-10-28T15:53:48Z	-
dc.date.issued	2016	en_US
dc.identifier.citation	Anderson, Michael J, Capotă, Mihai, Turek, Javier S, Zhu, Xia, Willke, Theodore L, Wang, Yida, Chen, Po-Hsuan, Manning, Jeremy R, Ramadge, Peter J, Norman, Kenneth A. (Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets. 10.1109/BigData.2016.7840719	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr12q93	-
dc.description.abstract	The scale of functional magnetic resonance image data is rapidly increasing as large multi-subject datasets are becoming widely available and high-resolution scanners are adopted. The inherent low-dimensionality of the information in this data has led neuroscientists to consider factor analysis methods to extract and analyze the underlying brain activity. In this work, we consider two recent multi-subject factor analysis methods: the Shared Response Model and Hierarchical Topographic Factor Analysis. We perform analytical, algorithmic, and code optimization to enable multi-node parallel implementations to scale. Single-node improvements result in 99x and 1812x speedups on these two methods, and enables the processing of larger datasets. Our distributed implementations show strong scaling of 3.3x and 5.5x respectively with 20 nodes on real datasets. We also demonstrate weak scaling on a synthetic dataset with 1024 subjects, on up to 1024 nodes and 32,768 cores.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	2016 IEEE International Conference on Big Data (Big Data)	en_US
dc.rights	Author's manuscript	en_US
dc.title	Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	doi:10.1109/BigData.2016.7840719	-
dc.date.eissued	2016	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
1608.04647v2.pdf		906.8 kB	Adobe PDF	View/Download

Show Simple Item Record