R1b1c7

 

A short note by

 

Iain Kennedy

 

Last update March 24th, 2008

 

 

Copyright © 2007, 2008 Iain Kennedy

Every living man has a Y-chromosome inherited from his father. It is usually handed down as an exact copy but from time to time a benign mutation occurs – the wrong nucleotide base is selected. From these events come our genetic diversity. The mutations divide men up into groups, each one of which is defined by the mutation that spawned it. Several layers of grouping exist since within one mutated branch further mutations can occur.

 

Scientists have a nomenclature for these groups, which they call ‘haplogroups’ based on an alternating sequence of letters and numbers. The original haplogroup ‘A’ came into being in Africa; the commonest western European group is a sub-division of group ‘R’ called ‘R1b’. R1b and R1a split off from the parent R1 tens of thousands of years ago. There is much controversy about the date and time of these splits – whatever I were to write here as the current latest theory would quickly get outdated. Maybe it happened in the Zagros mountains in what is now Iran/Iraq.

 

How are we supposed to work out these locations? Scientists tell us we have to look at two parameters: preponderance and diversity. Simple logic may dictate that the group should occur most heavily where it originated. But since it was there before it was anywhere else, it is easy to see that it should also have evolved or mutated more there than anywhere else too since it has had more time to do so.

 

Normally we measure the diversity with a slightly different yardstick. Some of the genetic material is in the form of repeated ‘arrays’ of base letters eg GATGATGAT and so on. These repeats also suffer mutational error not in the base letters but in the number of times it is repeated. For example 3xGAT might get copied as 4xGAT and so forth. These mutations happen more often than the first kind I described and so work as a finer yardstick gradation.

 

Note that what we don’t want to do is use the same yardstick to define the group and measure its internal diversity! Whilst it is possible to use these ‘tandem repeats’ (or ‘STRs’) to estimate the groups which are defined by the base letter mutations, it would not be wise to then use the same yardstick to work out how varied that group is.

 

It can be useful to draw in haplogroup density maps over geographic ones. Sometimes an interesting pattern may emerge, like a gradient or ‘cline’. A group may be heaviest on one side of a landmass or continent and gradually decline as you cross it. You might think that there is a single and even simple explanation for these clines, but there isn’t.

 

For example the heaviest concentration of R1b is in western Ireland. Good grief, Ireland is the cradle of civilization! But didn’t we just suggest that R1b originated in the Middle East? Why isn’t it most concentrated there? Unfortunately the answer has already been indicated; there isn’t a single explanation for genetic cline.

 

Note that it would patently be absurd to study the migration of R1b by overlaying it on a map of Ireland, even if I hadn’t already mentioned the Zagros mountains. We want to see it in as wide a context as possible, subject to the budget of our study.

 

Further caution must be applied if we use calculations to work out the most recent common ancestor (TMRCA) of our genetic samples:

 

‘this is a thorny and highly contentious issue, on the cusp of population genetics and phylogenetics. The date of an MRCA is a purely genetic estimate and need not have any obvious correlate in the other prehistorical records. This date does not represent the timing of a migration that spread the chromosomes carrying those haplotypes … the TMRCA is typically much older than the population in which they are found’ (1).

 

In fact the origin in the Zagros mountains was determined in the first case by diversity measure – variation of STRs within the R1b haplogroup. It is currently thought that R1b spread out from there, eventually reaching western Ireland where it is most dominant; the dropping cline as we move back across Europe is due to replacement by other later haplogroups. If you compare the STR diversity of R1b in Ireland and Iraq the former will be much lower than the latter.

 

Let us now turn our attention to R1b1c7, defined by the mutation called ‘M222’ (the letters which name these mutations represent the scientist who first describes them and are nothing to do with the haplogroup nomenclature), of which I am a member. How do we work out where and when it originated? By following the exact same technique with the same caveats.

 

So we will NOT

(i)                  draw a map of a single country and start overlaying R1b1c7 distribution on it

(ii)                estimate people to be R1b1c7 using STRs then measure the diversity and thus age of ‘R1b1c7-estimated’ using STRS too

(iii)               only quote one possible interpretation for any geographic cline we find

 

As of late 2007, the nearest we have come to respectable analysis of R1b1c7 is by Sykes in his recent book ‘Blood of the Isles’. He at least took samples throughout Scotland and compared them with customer data of his from Ireland. The analysis by Moore et al. (sometimes referred to as ‘McEvoy’) [6] is poor for a number of reasons; his hasty conclusion at the end that there is a link with the semi-historical figure of Niall of the Nine Hostages, coupled with falling back on Capelli’s ultra-low resolution Scottish data. Capelli had earlier sampled Scotland at various points on a grid he imposed on a map of the country but decided to omit the Scottish Highlands altogether. Analysis by amateurs is flawed by self-selection and a determination to interpret everything through an Irish prism, and the constant claims that the group only exists in lowland Scotland. The lone voice of reason amongst amateurs is John McLaughlin of the United States, who has pointed out the lack of a logical explanation of how R1b1c7 could get from Donegal to SW Scotland without also appearing heavily in Ulster, where it is less prominent; or why Irish and/or Scottish Dal Riata should share a genetic motif with the Ui Neill when there is no pedigree link between them. None of the ‘experts’ have ever responded to this, but are content to paint their results tables a lurid green colour as if everything is settled. It cannot be emphasized enough that a test that is now marketed as an ‘Irish’ DNA test will attract people who suspect or wish to verify they are from Ireland. All an attempt to plot their results in a table achieves is to demonstrate that a lot of Irish people have taken the test! The huge ranges quoted for the age of R1b1c7 show that it could far predate the Ui Neill and maybe go back as far as the first settlers in Ireland and Scotland.

 

In the meantime most of the research, scholarly or amateur, is concentrated on Ireland or on Scottish clans with an Irish origin tradition, who the same Irish researchers can look up in the traditional but often suspect Irish annals and other genealogies. This means that Scottish clans or families who lack such a tradition, like the Kennedys, must probably await lower level SNPs splitting M222 into sub-trees so that work can continue on such issues as how our ancestors moved around and between the lowlands and highlands. In the case of the Highland Kennedys which of course interests me most, the jury is still out on their origins, either surname-based or genetic.

 

Breaking news 24th March 2008

 

As we await low level SNPs which will sub-divide R1b1c7, the next best thing may have arrived – a SNP directly above it, which will probe the deep history of the group. This SNP was revealed by comparing results of early testers at DecodeMe. Over the last few days interest in the SNP built up to such an extent that pioneering testing company Ethnoancestry, now a wholly UK-based organization, offered the individual test for just £20. I immediately signed up for it and will report the results back here as soon as I get them. It is expected that I will be positive (derived) for the mutation. The results may tell us more about when M222, the marker for R1b1c7, occurred, or at least set a limit on the date.

 

Probably of more use to the Kennedy study will be the results of our first tester with proven ancestry from Co. Donegal. Of course R1b1c7 is a minority even in its ‘heartland’ so it is by no means certain if he will be in this group. If he is, interpretation will need to proceed with great caution; it will be more important than ever to distinguish between connections before and after the adoption of the Kennedy surname. Hopefully my ongoing work in the Glasgow Poor Law records, which give the exact place of birth of each applicant and hold over 50 Kennedys from Co. Donegal, will give me a better idea of the impact Donegal has had on the Kennedys as a group.

 

References

 

(1)   Human Evolutionary Genetics (Jobling, Hurles, Tyler-Smith, 2004) p182

 

(2)   John McLaughlin’s incisive analysis of the problems with the current theories of the origins of R1b1c7 can be found here.

 

(3)   Bryan Sykes’ book Blood of the Isles.

 

(4)   The opening message from David Wilson’s R1b1c7 rootsweb mailing list immediately excludes anyone from the Scottish Highlands (and thus Dal Riata [modern day Argyllshire]!).

 

(5)   Even Highland Scots get their results painted Shamrock Green on John McEwan’s results page

 

(6)   A Y-chromosome signature of hegemony in Gaelic Ireland, American  Journal of Human Genetics 78 (2006), L. Moore, B. McEvoy, E. Cape, K. Simms, D. Bradley

 

(7)   Cenel Conaill and the Donegal Kingdoms, Brian Lacey, Dublin, 2006. A recent re-examination of the origins of Niall Noigiallach and the so-called ‘northern Ui Neill’.