Skip to main content

Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets

Author(s): Anderson, Michael J.; Capotă, Mihai; Turek, Javier S.; Zhu, Xia; Willke, Theodore L.; et al

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr12q93
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAnderson, Michael J.-
dc.contributor.authorCapotă, Mihai-
dc.contributor.authorTurek, Javier S.-
dc.contributor.authorZhu, Xia-
dc.contributor.authorWillke, Theodore L.-
dc.contributor.authorWang, Yida-
dc.contributor.authorChen, Po-Hsuan-
dc.contributor.authorManning, Jeremy R.-
dc.contributor.authorRamadge, Peter J.-
dc.contributor.authorNorman, Kenneth A.-
dc.date.accessioned2019-10-28T15:53:48Z-
dc.date.available2019-10-28T15:53:48Z-
dc.date.issued2016en_US
dc.identifier.citationAnderson, Michael J, Capotă, Mihai, Turek, Javier S, Zhu, Xia, Willke, Theodore L, Wang, Yida, Chen, Po-Hsuan, Manning, Jeremy R, Ramadge, Peter J, Norman, Kenneth A. (Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets. 10.1109/BigData.2016.7840719en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr12q93-
dc.description.abstractThe scale of functional magnetic resonance image data is rapidly increasing as large multi-subject datasets are becoming widely available and high-resolution scanners are adopted. The inherent low-dimensionality of the information in this data has led neuroscientists to consider factor analysis methods to extract and analyze the underlying brain activity. In this work, we consider two recent multi-subject factor analysis methods: the Shared Response Model and Hierarchical Topographic Factor Analysis. We perform analytical, algorithmic, and code optimization to enable multi-node parallel implementations to scale. Single-node improvements result in 99x and 1812x speedups on these two methods, and enables the processing of larger datasets. Our distributed implementations show strong scaling of 3.3x and 5.5x respectively with 20 nodes on real datasets. We also demonstrate weak scaling on a synthetic dataset with 1024 subjects, on up to 1024 nodes and 32,768 cores.en_US
dc.language.isoen_USen_US
dc.relation.ispartof2016 IEEE International Conference on Big Data (Big Data)en_US
dc.rightsAuthor's manuscripten_US
dc.titleEnabling Factor Analysis on Thousand-Subject Neuroimaging Datasetsen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1109/BigData.2016.7840719-
dc.date.eissued2016en_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
1608.04647v2.pdf906.8 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.