sábado, 22 de septiembre de 2012

Saami and Berbers—An Unexpected Mitochondrial DNA Link

Abstract

The sequencing of entire human mitochondrial DNAs belonging to haplogroup U reveals that this clade arose shortly after the “out of Africa” exit and rapidly radiated into numerous regionally distinct subclades. Intriguingly, the Saami of Scandinavia and the Berbers of North Africa were found to share an extremely young branch, aged merely ∼9,000 years. This unexpected finding not only confirms that the Franco-Cantabrian refuge area of southwestern Europe was the source of late-glacial expansions of hunter-gatherers that repopulated northern Europe after the Last Glacial Maximum but also reveals a direct maternal link between those European hunter-gatherer populations and the Berbers.
Because of maternal transmission and lack of recombination, the sequence differentiation of human mtDNA has been generated by only the sequential accumulation of new mutations along radiating maternal lineages. Over the course of time, this process of molecular divergence has given rise to monophyletic units that are called “haplogroups.” Because this process of molecular differentiation occurred mainly during and after the process of human colonization of and diffusion into the different continents and regions, haplogroups and subhaplogroups tend to be restricted to specific geographic areas and population groups (Wallace 1995; Achilli et al. 2004).
Only the founders of the sister superhaplogroups M and N (which includes haplogroup R) (Quintana-Murci et al. 1999) participated in the “out of Africa” exit (Cann et al. 1987; Stringer and Andrews 1988; Cavalli-Sforza et al. 1994; Underhill et al. 2000; Forster et al. 2001) and were successful in colonizing the rest of the Old World. Superhaplogroup N is globally distributed outside Africa, encompassing virtually all of the western Eurasian mtDNA variation, and embraces haplogroup U, nested in haplogroup R. Haplogroup U has an extremely broad geographic distribution that ranges from Europe and North Africa to India and Central Asia and has a very high overall frequency (15%–30%) (Richards et al. 2000; Kivisild et al. 2003; Quintana-Murci et al. 2004)
To assess the nature and extent of haplogroup U variation, we initially sequenced 28 entire U mtDNAs (see authors' Web site and GenBank) from a wide range of populations. These were selected through a preliminary sequence analysis of the mtDNA control region to include the widest possible range of haplogroup U internal variation. A tree of the mtDNA sequences (fig. 1) reveals that haplogroup U first splits into two major subsets, distinguished by the mutation at nt 1811, and that there is a very large number of independent basal branches. Among these, representatives of all known subhaplogroups (U1–U9) were included. However, subhaplogroup U5 provided a rather intriguing result. A Yakut from northeastern Siberia (27 in fig. 1) and a Fulbe from Senegal (29 in fig. 1) harbored mtDNAs that differed at only two coding-region nucleotide positions.
Figure  1

Figure 1

Tree of 39 mtDNA sequences belonging to haplogroup U. The tree, rooted using the reference sequence (rCRS) (Andrews et al. 1999) as an outgroup, illustrates subhaplogroup affiliations. The sequencing procedure and phylogeny construction were performed (more ...)
To investigate this striking similarity, the portion of the tree encompassing these two mtDNAs was enriched by sequencing 11 additional mtDNA sequences (22–26, 28, 30, 32, 34, 36, and 37 in fig. 1) bearing the control-region motif 16270-150, a motif found generally at low frequencies (<2%) in Berber populations and in other African groups (such as the Fulbe) known to have intermingled with Berbers (Rosa et al. 2004).

The motif also shows similarly low frequencies in virtually all European populations, except the Saami of northern Scandinavia, in which it reaches ∼48% (Tambets et al 2004). Because virtually all Saami mtDNAs with 16270-150 harbor the transitions at nts 16189 and 16144 seen in the Yakut mtDNA, three Saami mtDNAs were included among the additional samples.
Seven of the new sequences (one Berber from Algeria, two Italian, one Spanish, and three Saami) clustered into U5b1b, the subclade encompassing the Yakut and Fulbe mtDNAs. The Saami and the Yakut mtDNAs formed a minor branch distinguished only by the transition at nt 16144, the Berber and the Fulbe mtDNAs clustered in a second minor branch also characterized only by control-region mutations, and the Italian and Spanish mtDNAs formed other minor branches.
The average sequence divergence (± SE, computed as per Saillard et al. [2000]) of the 39 coding-region sequences from the root of haplogroup U was 11.4 ± 1.3 substitutions (disregarding indels and pathological mutations), a value which corresponds—according to the mutation-rate estimate of Mishmar et al. (2003)—to a coalescence time estimate of 58.8 ± 6.8 thousand years (ky) for the entire haplogroup U. This value agrees well with the corresponding estimate of 61.6 ± 12.5 ky, based on the hypervariable segment I (HVS-I) mutation rate (Forster et al. 1996), for these 39 mtDNAs. An age of ∼60 ky indicates that haplogroup U arose very soon after the “out of Africa” exit. As for U5, its sequence divergence was 8.1 ± 1.8 substitutions, corresponding to 41.4 ± 9.2 ky, a time estimate in full agreement with its proposed proto-European origin (Richards et al. 2000). It is striking that the sequence divergence of U5b1b, the subclade encompassing mtDNAs from the Saami, Yakut, Berbers, and Fulbe, was 1.7 ± 0.5 substitutions, thus corresponding to only 8.6 ± 2.4 ky.
Such a recent common ancestry of maternal lineages found in populations living as far as 9,000 miles apart and whose anthropological affinities are not at all obvious is, to say the least, unexpected. Can we provide a reasonable explanation? The recent molecular dissection of other mtDNA haplogroups reveals some clues. H1 and H3, two frequent subhaplogroups of H, display frequency peaks centered in Iberia and surrounding populations, including the Berbers of Morocco, and coalescence ages of ∼11 ky (Achilli et al. 2004). Furthermore, their frequency patterns and ages resemble those reported for haplogroup V (Torroni et al. 2001a)—which, similar to U5b1b, is extremely common only in the Saami (together, U5b1b and V encompass almost 90% of the Saami mtDNAs) (Torroni et al. 1996; Tambets et al. 2004).

 Thus, although these previous studies have highlighted the role of the Franco-Cantabrian refuge area as a major source of the hunter-gatherer populations that gradually repopulated much of central and northern Europe when climatic conditions began to improve ∼15 ky ago, the identification of U5b1b now unequivocally links the maternal gene pool of the ancestral Berbers to the same refuge area and indicates that European hunter-gatherers also moved toward the south and, by crossing the Strait of Gibraltar, contributed their U5b1b, H1, H3, and V mtDNAs to modern North Africans.
In conclusion, this study is a paradigmatic example of the power of genetic inference in human-origin and evolutionary studies. It shows that mtDNA data—in this case, at the highest possible level of molecular resolution—can be used not only to evaluate models proposed by other disciplines and based on the direct survey of ancient material but also to identify previously unknown links between populations and geographic areas. Thus, the study of human genetics directly fosters the development of new research avenues in paleontology, archaeology, linguistics, and history.

Acknowledgments

This research was supported by CNR-MIUR Genomica Funzionale-Legge 449/97, Fondo Investimenti Ricerca di Base 2001, the Istituto Pasteur Fondazione Cenci Bolognetti, and Progetti Ricerca Interesse Nazionale 2003.

Electronic-Database Information

Accession numbers and URLs for data presented herein are as follows:
Author’s Web site, http://ipvgen.unipv.it/docs/projects/torroni_data/torroni_sequences.html (for the complete mtDNA sequences)
GenBank, http://www.ncbi.nlm.nih.gov/Genbank/ (for the complete mtDNA sequences [accession numbers AY882379–AY882417])