Document Type



Single nucleotide polymorphisms (SNPs) with large allele frequency differences between human populations are relatively rare. The longest run of SNPs with an allele frequency difference of one between the Yoruba of Nigeria and the Han Chinese is found on the long arm of the X chromosome in the intergenic region separating the EDA2R and AR genes. It has been proposed that the unusual allele frequency distributions of these SNPs are the result of a selective sweep affecting African populations that occurred after the out-of-Africa migration. To investigate the evolutionary history of the EDA2R/AR intergenic region, we characterized the haplotype structure of 52 of its highly differentiated SNPs. Using a publicly available data set of 3,000 X chromosomes from 65 human populations, we found that nearly all human X chromosomes carry one of two modal haplotypes for these 52 SNPs. The predominance of two highly divergent haplotypes at this locus was confirmed by use of a subset of individuals sequenced to high coverage. The first of these haplotypes, the α-haplotype, is at high frequencies in most of the African populations surveyed and likely arose before the separation of African populations into distinct genetic entities. The second, the β-haplotype, is frequent or fixed in all non-African populations and likely arose in East Africa before the out-of-Africa migration. We also observed a small group of rare haplotypes with no clear relationship to the α- and β-haplotypes. These haplotypes occur at relatively high frequencies in African hunter-gatherer populations, such as the San and Mbuti Pygmies. Our analysis indicates that these haplotypes are part of a pool of diverse, ancestral haplotypes that have now been almost entirely replaced by the α- and β-haplotypes. We suggest that the rise of the α- and β-haplotypes was the result of the demographic forces that human populations experienced during the formation of modern African populations and the out-of-Africa migration. However, we also present evidence that this region is the target of selection in the form of positive selection on the α- and β-haplotypes and of purifying selection against α/β recombinants.