Skip to main content

FST and kinship for arbitrary population structures I: Generalized definitions

Author(s): Ochoa, Alejandro; Storey, John

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr13r0pt2p
Full metadata record
DC FieldValueLanguage
dc.contributor.authorOchoa, Alejandro-
dc.contributor.authorStorey, John-
dc.date.accessioned2023-12-14T18:57:11Z-
dc.date.available2023-12-14T18:57:11Z-
dc.date.issued2016-10-27en_US
dc.identifier.citationOchoa, Alejandro, Storey, John. (2016). FST and kinship for arbitrary population structures I: Generalized definitions. 10.1101/083915en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr13r0pt2p-
dc.description.abstractAbstract F ST is a fundamental measure of genetic differentiation and population structure, currently defined for subdivided populations. F ST in practice typically assumes independent, non-overlapping subpopulations , which all split simultaneously from their last common ancestral population so that genetic drift in each subpopulation is probabilistically independent of the other subpopulations. We introduce a generalized F ST definition for arbitrary population structures, where individuals may be related in arbitrary ways, allowing for arbitrary probabilistic dependence among individuals. Our definitions are built on identity-by-descent (IBD) probabilities that relate individuals through inbreeding and kinship coefficients. We generalize F ST as the mean inbreeding coefficient of the individuals’ local populations relative to their last common ancestral population. We show that the generalized definition agrees with Wright’s original and the independent subpopulation definitions as special cases. We define a novel coancestry model based on “individual-specific allele frequencies” and prove that its parameters correspond to probabilistic kinship coefficients. Lastly, we extend the Pritchard-Stephens-Donnelly admixture model in the context of our coancestry model and calculate its F ST . To motivate this work, we include a summary of analyses we have carried out in follow-up papers, where our new approach has been applied to simulations and global human data, showcasing the complexity of human population structure, demonstrating our success in estimating kinship and F ST , and the shortcomings of existing approaches. The probabilistic framework we introduce here provides a theoretical foundation that extends F ST in terms of inbreeding and kinship coefficients to arbitrary population structures, paving the way for new estimators and novel analyses.en_US
dc.language.isoen_USen_US
dc.relation.ispartofArXiven_US
dc.rightsAuthor's manuscripten_US
dc.titleFST and kinship for arbitrary population structures I: Generalized definitionsen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1101/083915-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
Fst_kinship_arbitrary_generalized_definitions.pdf6.09 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.