Saami and Berbers—An Unexpected Mitochondrial DNA Link
This article has been cited by other articles in PMC.
Abstract
The
sequencing of entire human mitochondrial DNAs belonging to haplogroup U
reveals that this clade arose shortly after the “out of Africa” exit
and rapidly radiated into numerous regionally distinct subclades.
Intriguingly, the Saami of Scandinavia and the Berbers of North Africa
were found to share an extremely young branch, aged merely ∼9,000 years.
This unexpected finding not only confirms that the Franco-Cantabrian
refuge area of southwestern Europe was the source of late-glacial
expansions of hunter-gatherers that repopulated northern Europe after
the Last Glacial Maximum but also reveals a direct maternal link between
those European hunter-gatherer populations and the Berbers.
Because
of maternal transmission and lack of recombination, the sequence
differentiation of human mtDNA has been generated by only the sequential
accumulation of new mutations along radiating maternal lineages. Over
the course of time, this process of molecular divergence has given rise
to monophyletic units that are called “haplogroups.” Because this
process of molecular differentiation occurred mainly during and after
the process of human colonization of and diffusion into the different
continents and regions, haplogroups and subhaplogroups tend to be
restricted to specific geographic areas and population groups (Wallace 1995; Achilli et al. 2004).
Only the founders of the sister superhaplogroups M and N (which includes haplogroup R) (Quintana-Murci et al. 1999) participated in the “out of Africa” exit (Cann et al. 1987; Stringer and Andrews 1988; Cavalli-Sforza et al. 1994; Underhill et al. 2000; Forster et al. 2001)
and were successful in colonizing the rest of the Old World.
Superhaplogroup N is globally distributed outside Africa, encompassing
virtually all of the western Eurasian mtDNA variation, and embraces
haplogroup U, nested in haplogroup R. Haplogroup U has an extremely
broad geographic distribution that ranges from Europe and North Africa
to India and Central Asia and has a very high overall frequency
(15%–30%) (Richards et al. 2000; Kivisild et al. 2003; Quintana-Murci et al. 2004)
To assess the nature and extent of haplogroup U variation, we initially sequenced 28 entire U mtDNAs (see authors' Web site and GenBank)
from a wide range of populations. These were selected through a
preliminary sequence analysis of the mtDNA control region to include the
widest possible range of haplogroup U internal variation. A tree of the
mtDNA sequences (fig. 1)
reveals that haplogroup U first splits into two major subsets,
distinguished by the mutation at nt 1811, and that there is a very large
number of independent basal branches. Among these, representatives of
all known subhaplogroups (U1–U9) were included. However, subhaplogroup
U5 provided a rather intriguing result. A Yakut from northeastern
Siberia (27 in fig. 1) and a Fulbe from Senegal (29 in fig. 1) harbored mtDNAs that differed at only two coding-region nucleotide positions.
Figure 1
Tree of 39 mtDNA sequences belonging to haplogroup U. The tree, rooted using the reference sequence (rCRS) (Andrews et al. 1999) as an outgroup, illustrates subhaplogroup affiliations. The sequencing procedure and phylogeny construction were performed (more ...)
To
investigate this striking similarity, the portion of the tree
encompassing these two mtDNAs was enriched by sequencing 11 additional
mtDNA sequences (22–26, 28, 30, 32, 34, 36, and 37 in fig. 1)
bearing the control-region motif 16270-150, a motif found generally at
low frequencies (<2%) in Berber populations and in other African
groups (such as the Fulbe) known to have intermingled with Berbers (Rosa
et al. 2004).
The motif also shows similarly low frequencies in virtually all European populations, except the Saami of northern Scandinavia, in which it reaches ∼48% (Tambets et al 2004). Because virtually all Saami mtDNAs with 16270-150 harbor the transitions at nts 16189 and 16144 seen in the Yakut mtDNA, three Saami mtDNAs were included among the additional samples.
The motif also shows similarly low frequencies in virtually all European populations, except the Saami of northern Scandinavia, in which it reaches ∼48% (Tambets et al 2004). Because virtually all Saami mtDNAs with 16270-150 harbor the transitions at nts 16189 and 16144 seen in the Yakut mtDNA, three Saami mtDNAs were included among the additional samples.
Seven of the new sequences (one
Berber from Algeria, two Italian, one Spanish, and three Saami)
clustered into U5b1b, the subclade encompassing the Yakut and Fulbe
mtDNAs. The Saami and the Yakut mtDNAs formed a minor branch
distinguished only by the transition at nt 16144, the Berber and the
Fulbe mtDNAs clustered in a second minor branch also characterized only
by control-region mutations, and the Italian and Spanish mtDNAs formed
other minor branches.
The average sequence divergence (± SE, computed as per Saillard et al. [2000])
of the 39 coding-region sequences from the root of haplogroup U was
11.4 ± 1.3 substitutions (disregarding indels and pathological
mutations), a value which corresponds—according to the mutation-rate
estimate of Mishmar et al. (2003)—to
a coalescence time estimate of 58.8 ± 6.8 thousand years (ky) for the
entire haplogroup U. This value agrees well with the corresponding
estimate of 61.6 ± 12.5 ky, based on the hypervariable segment I (HVS-I)
mutation rate (Forster et al. 1996),
for these 39 mtDNAs. An age of ∼60 ky indicates that haplogroup U arose
very soon after the “out of Africa” exit. As for U5, its sequence
divergence was 8.1 ± 1.8 substitutions, corresponding to 41.4 ± 9.2 ky, a
time estimate in full agreement with its proposed proto-European origin
(Richards et al. 2000).
It is striking that the sequence divergence of U5b1b, the subclade
encompassing mtDNAs from the Saami, Yakut, Berbers, and Fulbe, was 1.7 ±
0.5 substitutions, thus corresponding to only 8.6 ± 2.4 ky.
Such
a recent common ancestry of maternal lineages found in populations
living as far as 9,000 miles apart and whose anthropological affinities
are not at all obvious is, to say the least, unexpected. Can we provide a
reasonable explanation? The recent molecular dissection of other mtDNA
haplogroups reveals some clues. H1 and H3, two frequent subhaplogroups
of H, display frequency peaks centered in Iberia and surrounding
populations, including the Berbers of Morocco, and coalescence ages of
∼11 ky (Achilli et al. 2004). Furthermore, their frequency patterns and ages resemble those reported for haplogroup V (Torroni et al. 2001a)—which,
similar to U5b1b, is extremely common only in the Saami (together,
U5b1b and V encompass almost 90% of the Saami mtDNAs) (Torroni et al. 1996; Tambets et al. 2004).
Thus, although these previous studies have highlighted the role of the Franco-Cantabrian refuge area as a major source of the hunter-gatherer populations that gradually repopulated much of central and northern Europe when climatic conditions began to improve ∼15 ky ago, the identification of U5b1b now unequivocally links the maternal gene pool of the ancestral Berbers to the same refuge area and indicates that European hunter-gatherers also moved toward the south and, by crossing the Strait of Gibraltar, contributed their U5b1b, H1, H3, and V mtDNAs to modern North Africans.
Thus, although these previous studies have highlighted the role of the Franco-Cantabrian refuge area as a major source of the hunter-gatherer populations that gradually repopulated much of central and northern Europe when climatic conditions began to improve ∼15 ky ago, the identification of U5b1b now unequivocally links the maternal gene pool of the ancestral Berbers to the same refuge area and indicates that European hunter-gatherers also moved toward the south and, by crossing the Strait of Gibraltar, contributed their U5b1b, H1, H3, and V mtDNAs to modern North Africans.
In conclusion, this study is a
paradigmatic example of the power of genetic inference in human-origin
and evolutionary studies. It shows that mtDNA data—in this case, at the
highest possible level of molecular resolution—can be used not only to
evaluate models proposed by other disciplines and based on the direct
survey of ancient material but also to identify previously unknown links
between populations and geographic areas. Thus, the study of human
genetics directly fosters the development of new research avenues in
paleontology, archaeology, linguistics, and history.
Acknowledgments
This
research was supported by CNR-MIUR Genomica Funzionale-Legge 449/97,
Fondo Investimenti Ricerca di Base 2001, the Istituto Pasteur Fondazione
Cenci Bolognetti, and Progetti Ricerca Interesse Nazionale 2003.
Electronic-Database Information
Accession numbers and URLs for data presented herein are as follows:
Author’s Web site, http://ipvgen.unipv.it/docs/projects/torroni_data/torroni_sequences.html (for the complete mtDNA sequences)
GenBank, http://www.ncbi.nlm.nih.gov/Genbank/ (for the complete mtDNA sequences [accession numbers AY882379–AY882417])