Hamilton, W.D. (1975), Innate social aptitudes of man: an approach from evolutionary genetics, in R. Fox (ed.), Biosocial Anthropology, Malaby Press, London, 133-53.

Innate Social Aptitudes of Man: an Approach from Evolutionary Genetics

W. D. Hamilton

It has become clear that, although learning has great importance in the normal development of nearly all phases of primate behavior, it is not a generalized ability; animals are able to learn some things with great ease and other things only with the greatest difficulty. Learning is part of the adaptive pattern of a species and can be understood only when it is seen as a process of acquiring skills and attitudes that are of evolutionary significance to a species when living in the environment to which it is adapted.

-- Washburn, Jay, and Lancaster (1965)

SURVIVAL OF THE FITTEST

The phrase directs attention to differential survival. Darwin accepted it from Herbert Spencer as adequately expressing the idea of natural selection. While accusations of tautology seem hardly fair on this small phrase itself, it must be admitted that some descendant ideas on the theory of natural selection are open to attack. For example, the idea of measuring ability to survive and reproduce -- biological 'fitness' -- has undoubtedly been useful, but a slight haziness still lingers, a lack of precise and general definition: we do not know exactly what qualities natural selection is after. I think it is doubt like this rather than doubt about the reality and effectiveness of natural selection that inspires a present spirit of caution in evolutionary biology, including caution and distrust towards Spencer's ideogram whenever it renews aspirations to become a slogan.

A part of the difficulty, and the part I am mainly concerned with now, is that of saying exactly what are the things that natural selection is supposed to select. The fittest what? Is it a trait, an individual, a set of individuals bearing a trait, or bearing its determinants expressed or latent? Can it be a population, a whole species, perhaps even an ecosystem? In such a confusion of possibilities (and of fervent opinions either way) the individual organism stands out as one clear and obvious choice, with the number of its offspring as the measure of its fitness. But, beyond the problem of when to count and how to weight offspring for their ages, there is the problem that in sexual species the individual is really a physical composite of contributions from two parents and it may be composite in slightly different ways for different parts. Moreover, Mendel's principles concerning the fair distribution of genes to gametes and fair competition of these in fertilization do not always hold, so that the set of offspring of a given individual may carry a biased sample from a composite. Does this matter? For safe conclusions, do we have to descend to the level of the individual gene, perhaps ultimately to that of changed or added parts of the replicating molecule? Or can we, on the contrary, confidently follow the consensus of biologists to a higher level, in believing that the generally significant selection is at the level of competing groups and species? I shall argue that lower levels of selection are inherently more powerful than higher levels, and that careful thought and factual checks are always needed before lower levels are neglected. In this I follow a recent critical trend in evolutionary thought (Williams 1966; Lewontin 1970; Hamilton 1972, and references to these works). Incidentally, to a biologist, a rather similar critique seems to be invited by the supposition that cultural evolution is independent of evolution in its biological substratum: to come to our notice culture too have to survive and will hardly do so when by their nature they undermine the viability of their bearers.(Note 1) Thus we would expect the genetic system to have various inbuilt safeguards and to provide not a blank sheet for individual cultural development but a sheet at least lightly scrawled with certain tentative outlines. The problem facing a humane civilization may be how to complete a sketch suggesting some massive and brutal edifice -- say the outlines of an Aztec pyramid -- so that it reappears as a Parthenon or a Taj Mahal. These ideas concerning cultural evolution will not be expanded in what follows, but I hope to produce evidence that some things which are often treated as purely cultural in man -- say racial discrimination -- have deep roots in our animal past and thus are quite likely to rest on direct genetic foundations. To be more specific, it is suggested that the ease and accuracy with which an idea like xenophobia strikes the next replicate of itself on the template of human memory may depend on the preparation made for it there by selection -- selection acting, ultimately, at the level of replicating molecules.

Returning to the problem of units of selection, Darwin himself, vague about the process of heredity, based most of his arguments on considerations of the fitness of individuals. He made occasional exceptions, as for the social insects where he treated the "family group" as the unit of selection. I believe even these limited concessions were incautious (Hamilton 1972), and value his judgement more where, discussing the evolution of courage and self-sacrifice in man, he left a difficulty apparent and unsolved. He saw that such traits would naturally be counter-selected _within_ a social group whereas in competition _between_ groups the groups with the most of such qualities would be the ones best fitted to survive and increase. This open problem which Darwin left is really the starting-point of my own argument, but it is historically interesting to note that after some initial wavering between the calls of Spencer, Kropotkin, and others, almost the whole field of biology stampeded in the direction where Darwin had gone circumspectly or not at all.

Until the advent of Mendelism uncritical acceptance of group selection could be understood partly on grounds of vagueness about the hereditary process. For example, courage and self-sacrifice could spread by cultural contagion and, in so spreading, modify heredity as well. But in the event neither the rediscovery of Mendel's work nor the fairly brisk incorporation of Mendelism into evolutionary theory had much effect. From about 1920 to about 1960 a curious situation developed where the models of 'Neo-Darwinism' were all concerned with selection at levels no higher than that of competing individuals, whereas the biological literature as a whole increasingly proclaimed faith in Neo-Darwinism, and at the same time stated almost all its interpretations of adaptation in terms of 'benefit to the species'. The leading theorists did occasionally point out the weakness of this position but on the whole concerned themselves with it surprisingly little (references in Hamilton 1964, 1971, 1972).

With facts mostly neutral and theory silent it seems that we must look at the events and the 'isms' of recent human history to understand how such a situation arose. Marxism, trade unionism, fears of 'social darwinism', and vicissitudes of thought during two world wars seem likely influences. Confronted with common social exhortations, natural selection is easily accused of divisive and reactionary implications unless 'fittest' means fittest species(man) and 'struggle' means struggle against nature (anything but man). 'Benefit-of-the-species' arguments, so freely used during the period in question, are seen in this light as euphemisms for natural selection. They provide the reader (and evidently often the writer as well) an escape from inner conflict, exacting nothing emotionally beyond what most of us learn to accept in childhood, that most forms of life exploit and prey on one another.

LEVELS OF SELECTION

Often the problem is not acute. There are many traits like resistance to disease, good eyesight, dexterity which are clearly beneficial to individual, group, and species. But with most traits that can be called social in a general sense there is some question. For example, as language becomes more sophisitcated there is also more opportunity to pervert its use for selfish ends: fluency is an aid to persuasive lying as well as to conveying complex truths that are socially useful. Consider also the selective value of having a conscience. The more consciences are lacking in a group as a whole, the more energy the group will need to divert to enforcing otherwise tacit rules or else face dissolution. Thus considering one step (individual vs. group) in a hierarchical population structure, having a conscience is an 'altruistic' character. But for the next step -- group vs. supergroup -- it might be selfish, in the sense that the groups with high levels of conscience and orderly behavior may grow too fast and threaten to overexploit the resources on which the whole supergroup depends. As a more biological instance similar considerations apply to sex ratio, and here a considerable amount of data has accumulated for arthropods (Hamilton 1967).

A recent reformulation of natural selection can be adapted to show how two successive levels of the subdivision of a population contribute separately to the overall natural selection (Price 1972). The approach is not limited to Mendelian inheritance but its usefulness in other directions (e.g. cultural evolution) has not yet been explored.

Consider a population consisting of a mixture of particles, and suppose we are interested in the frequency of a certain kind of particle G. Suppose the particles are grouped: let the subscript s denote the sth subpopulation. For subpopulation and for whole we define parameters relevant to natural selection as follows:

Subpopulation Whole population

Number of particles: n[s] N=SUM(n[s])

Frequency of G: q[s] q=SUM(n[s]q[s]/N)

Mean fitness: w[s] w=SUM(n[s]w[s]/N)

Fitness measures the amount of successful replication of particles in one 'generation'. Thus the total population of the next generation will be N'=SUM(n[s]w[s]). The symbol ' denoting 'next generation' is used again in the same sense in teh following further addition to notation:

Subpopulation Whole population

Change in frequency of G in one generation: dq[s] = q'[s]-q[s] dq[s]=q'-q

With such notation it is easy to derive:

w dq = SUM(n[s]w[s](q[s]-q)/N)+SUM(n[s]w[s]dq[s]/N)

= Covariance(w[s],q[s])+Expectation(w[s],dq[s]) (Eq 1)

where _Covariance_ and _Expectation_ are understood to involve weighting by the n[s] as indicated. This is Price's form.(Note 2) The covariance term represents the contribution of _intergroup_ selection, so quantifying the intuitive notion that high q[s] must cause high w[s] for selective change to occur. The expectation term represents the contribution of _intragroup selection_. It is possible to apply the formula within itself, to expand w[s] dq[s]. For example, if the next level is that of diploid individuals and _si_ indexes the _i_th individual of the _s_th group we have n[s]=SUM(2, q[s]=SUM(2q[si]/n[s]) and w[s]=SUM(2w[si]/n[s]) where these summations are understood to cover all _i_ instead of all _s_ as previously. Then w[s]dq[s] decomposes into two terms, one of which represents ordinary diploid selection with strictly Mendelian inheritance, while the second represents the effects of genetic 'drift' (random sampling effects), and 'drive' (non-Mendelian ratios). Even this latter term can be reformulated using (Eq 1), but then our 'groups' are the fundamental particles themselves which, neglecting mutation, must give dq[particle]=0, so that here finally the second term goes out.

An often useful rearrangement of (Eq 1), which shows the dependence of selection on the variability in its units, introduces the regression coefficient of w[s] on q[s]. If beta[q] is this coefficient:

w dq = beta[1] Variance(q[s])+Expectation(w[s]dq[s]) (Eq 2)

Conceptual simplicity, recursiveness, and formal separation of levels of selection are attractive features of these equations. But, of course, being able to point to a relevant and generally non-zero part of selective change is far from showing that group selection can override individual selection when the two are in conflict. Moreover, even the possibility of devising model circumstances in which a positive group-selection term (first term) outweighs a negative individual selection one (second term, assuming no further levels), gives no guarantee that 'altruism' can evolve by group selection: we have to consider whether the population can get into the specified state, and, if it can, whether its present trend will continue. For example, if we suppose persistent groups with no extinction and no intergroup migration it is easy to arrange that the group-beneficial effect (beta[1]), of frequent altruism in a certain group is so large that the rapid expansion of the group with the highest frequency of G (q[m] say) draws the population _q_ rapidly upwards. But _q_ will never reach or pass q[m] and must eventually fall, remaining below the ever-falling value of q[m]. Admittedly, all this is reasonably obvious without the equation; but the equation does emphasize that natural selection depends on a certain variance which in this model must at least die away as the best group increasingly predominates. This is the essential objection to an algebraic model of Haldane (1932). for selection of altruism, which other writers have wrongly treated as the first successful analytical model for altruism. In verbal discussion Haldane himself admitted the necessity of a device to maintain diversity. He suggested that if groups split on reaching a certain size, random assortment of altruists and egoists would raise the frequency of altruists in some daughter moieties, and if the critical size was low enough and the group advantage of altruism high enough, a process having endless overall enrichment in altruism might be devised. Increasing the intergroup variance by random (or, better, associative) division of existing tribes leaves less variance within groups, which, as a development of the equation will shortly make plain, weakens the power of individual selection, and this further improves the case. But Price's equation does not seem to lend itself to a detailed analysis of Haldane's suggestion -- indeed the lack of analysis by Haldane himself suggests that it is not easy. The value of the covariance approach lies not so much in analytical penetration as in clarifying the approach to a problem.

Therefore, noting hopeful auguries in Haldane's tribe-splitting no-migration idea, let us now turn to a model at the opposite extreme in which groups break up completely and re-form in each generation. Suppose that on reaching maturity the young animals take off to form a migrant pool, from which groups of _n_ are randomly selected to be the group of the next generation. Assume completely asexual reproduction (or perfect matriclinal or patriclinal inheritance of a cultural trait in an ordinary population), and assume that an altruist gives up _k_ units of his own fitness in order to add K units to the joint fitness of his (n-1) companions. These companions are a random selection from the gene pool and therefore, in a supposed infinite population, have the expected gene frequency of the gene pool. Thus compared to a non-altruist, the altruist is putting into the next gene pool far fewer of his own genes plus a random handful from the pool of the last generation. Obviously his trait is not enriching the population with genes that cause the trait. The specification of grouping has been a mere gesture. Nevertheless it is instructive to see how (Eq 2) handles the matter. With asexuality individuals are basic particles, so, as already explained, the recursive use of (Eq 2) to expand the second term gives simply:

w dq = beta[1]Var(q[s])+E{beta[0]Var[s](q[si])}

All units are now of the same size, so _Var(iance)_ and _E(xpectation)_ can have their conventional meanings. Since beta[0] does not vary with group constitution,

w dq = beta[1] Var(q[s])+beta[0]E{Var[s](q[si])} (Eq 3)

and the expectation is what is commonly called the within-group variance.

With random grouping, the distribution of the different compositions of groups will be binomial with parameters (q, n). The variance of q[s] is (1/n)pq. Likewise it is easily shown that E{Var[s](q[si])}=((n-1)/n)pq, so that

w dq=(1/n)p q{beta[1]+(n-1)beta[0]} (Eq 4)

This already shows the characteristically greater power of the lower level of selection as dictated by a ratio of variances that is bound to hold when grouping is random or nearly so.

In a group with _v_ of its _n_ members altruistic fitnesses are as follows:

Group mean Selfish member Altruistic member

1+(v/n)(K-k) 1+v(K/(n-1)) 1-k+(v-1)(K/(n-1))

Thus by inspection and by subtraction respectively

beta[1]=K-k and beta[0]=-k-(K/(n-1)) (Eq 5)

Substituting in (Eq 4) we find:

w dq = -k p q (Eq 6)

This confirms the earlier argument that altruism cannot progress in such a model. It seems at first glance that the benefits dispensed by altruists have been entirely null in the working of the model, but they affect it through their involvement in mean fitness:

w=p+q(K-k)

This being the only involvement of _K_, we see that the most that altruism can achieve in the model is a slowing of the rate at which natural selection reduces its frequency -- an effect which I explained earlier as altruism diluting each new gene mixture by adding, as it were, handfuls taken randomly from the previous one. Apart from this minor effect the model, like Haldane's algebraic one, is a failure, in spite of having shifted to the opposite extreme in respect of migration. It reveals a group selection component which is not zero but which is bound in an unchanging subordination to the individual selection component. However, the relation between the two variances in this case suggests how we must change the model to make altruism succeed: Var(q[s]) must be increased relative to E{Var[s](q[si])}. As already mentioned, this can be done by making G assort positively with its own type in settling from the migrant cloud. Suppose it assorts to such a degree that the correlation of two separate randomly selected members of a group is _F_. If this correlation is achieved by having a fraction _F_ of groups made pure for each type and then the remainder again formed randomly, then it is easily shown that the between and within group variances are respectively:

(1/n)p q(1-F+nF) and (1/n)p q(n-1)(1-F)

Putting these results and those of (Eq 5) into equation (Eq 3) we find as the generalization of (Eq 6):

w dq=p q(FK-k) (Eq 7)

so that the criterion for positive selection of altruism is

K/k > 1/F

Now the model can be made to work. Moreover, the simple form and the independence of group size suggest that the criterion may hold beyond the limits of the rather artificial model discussed here. Careful thought confirms that this is indeed the case: the criterion is completely general for asexual models with non-overlapping generations, and also holds for sexual diploid models when the coefficient F is suitably redefined (Hamilton 1964, 1970). The easiest way to see the basis of generality is to notice that the benefits of altruism do not now fall on a random selection of the population and therefore do not simply enlarge the existing gene pool; instead they fall on individuals _more likely to be altruists_ than are random members of the population. Indeed, the existence of the positive correlation _F_ could be interpreted as implying in this case that there is a chance _F_ that the _K_ units of fitness are definitely given to a fellow altruist, while with chance (1-F) they are given (as they always were in the previous version) to a random member of the population.

The redefinition necessary for diploid organisms involves specifying a regression coefficient b[AB], representing the regression of the genotype of recipient B on genotype of donor A. Often this is the same as the correlation coefficient of such genotypes (it always is so in the haploid case), but where they differ it is the regression coefficient that gives the prediction of gene content that we need. To get the form like (Eq 7) which applies to diploid selection other changes are obviously necessary, notably diving _p q_ by two to get the variance of gene frequency between pairs instead of that between individuals and other more complex changes connected with dominance and details of the assortive process. However, it is striking that a criterion like (FK-k)>0 can be shown to determine positive selection of each genotype, and can be generalized to cover cases where A distributes various effects, positive or negative, to numerous individuals B, C, D, ... all having different regression on A. Including A himself in the list of recipients we arrive at the idea of A's 'inclusive fitness'; his basic non-social fitness, plus all the effets caused by his action when each has been devalued by a regression coefficient.

The usefulness of the 'inclusive fitness' approach to social behavior (i.e. an approach using criteria like (b[AB]K-k)>0) is that it is more general than the 'group selection', 'kin selection', or 'reciprocal altruism' approaches and so provides an overview even where regression coefficients and fitness effects are not easy to estimate or specify. As against 'group selection' it provides a useful conceptual tool where no grouping is apparent -- for example, it can deal with an ungrouped viscous population where, owing to restricted migration, an individual's normal neighbours and interactants tend to be his genetical kindred.

Because of the way it was first explained, the approach using inclusive fitness has often been identified with 'kin selection' and presented strictly as an alternative to 'group selection' as a way of establishing altruistic social behavior by natural selection (e.g. Maynard Smith 1964; Lewontin 1970). But the foregoing discussion shows that kinship should be considered just one way of getting positive regression of genotype in the recipient, and that it is this positive regression that is vitally necessary for altruism. Thus the inclusive-fitness concept is more general than 'kin-selection'. Haldane's suggestion about tribe-splitting can be seen in one light as a way of increasing intergroup variance and in another as a way of getting positive regression in the population as a whole by having the groups which happen to have most altruists divide most frequently. In this case, the altruists are helping true relatives. But in the assortive-settling model it obviously makes no difference if altruists settle with altruists because they are related (perhaps never having parted from them) or because they recognize fellow altruists as such, or settle together because of some pleiotropic effect of the gene on habitat preference. If we insist that group selection is different from kin selection the term should be restricted to situations of assortation definitely not involving kin. But it seems on the whole preferable to retain a more flexible use of terms; to use group selection where groups are clearly in evidence and qualify with mention of 'kin' (as in the 'kin-group' selection referred to by Brown 1973), 'relatedness' or 'low migration' (which is often the cause of relatedness in groups), or else 'assortation', as appropriate. The term 'kin selection' appeals most where pedigrees tend to be unbounded and interwoven, as is so often the case with man.

Although correlation between interactants is necessary if altruism is to receive positive selection, it may well be that trying to find regression coefficients is not the best analytical approach to a particular model. Indeed, the problem of formulating them _exactly_ for sexual models proves difficult (Hamilton 1964). One recent model that makes more frequent group extinction the penalty for selfishness (or lack of altruism) has achieved rigorous and striking conclusions without reference to regression or relatedness (Eshel 1972). But reassuringly the conclusions of both this and another similar model (more general but less thorough and much less well explained; Levins 1970) are of the general kind that consideration of regression leads us to expect. The regression is due to relatedness in these cases, but classified by approach these were the first working models of group selection.

TRIBAL FACIES OF SOCIAL BEHAVIOR

One of the conclusions of the models just mentioned is that with a grouped population the migration between groups is crucially important in determining the general level which altruism can reach within a group. This is something which should now seem fairly obvious but which has been surprisingly overlooked in most discussions of group selection previous to Eshel's. The less migration there is the more relatedness will build up within groups. This will permit selection of acts with low gain ratio (i.e. ratios like K/k) but the gain ratios must always exceed one, and this means that the act must actually aid group fitness in some way -- reduce its chance of sudden extinction (Levins 1970; Eshel 1972), or increase its rate of emission of migrants (Wright, mention in various papers from 1945 to 1965). With the last eventuality it is better for altruism if the migrants get together in small groups to found new colonies than if they all enter existing groups, since entering undermines the assumption of low migration -- in other words reduces intergroup variance. If groups of founder migrants are assortive so much the better, although if they are so by coming all from the same parent group this could be treated as fission. Likewise if migrant acceptance is the established mode, so much the better if groups selectively accept altruists. The ability of animals to exercise such discrimination may seem dubious when behavior even in man is rather indefinite in this respect, but it is noticeable that with many of the tight-knit groups of social carnivores and primates the would-be immigrant does go through a probationary period of hostile treatment and low status, which sometimes terminates his attempt to join (Altmann & Altmann 1972). Similar phenomena of possibly similar significance are certainly not lacking in man, witness the harsh requirements of achievement and service for an aspiring Amerindian brave (or neophyte British doctor for that matter) and the general suspicion, hostility, and low position accorded to wealthless immigrants. I should add here that the idea that such behavior is natural in man does not mean that it is right or even sensible under modern conditions. For example, the immigrants may bring new skills and aptitudes, a point to which i return later. And as regards 'altruism', recent tribal immigrants are likely to be net importers of this precious stuff -- themselves the losers when they expose their natural communistic generosity to civilized exploitation. On the other hand, when experience of ambient guile and cupidity has taught them better, such immigrants may learn to confine this generosity again among themselves and to turn outwards a contemptuous and unsympathetic attitude which is also typically tribal; but such expected ambivalence in tribal feeling is another matter to which I must return.

I have carefully spoken of 'migration' rather than 'migration rate' so far, and in so doing intend to emphasize that it is the number of acts of successful migration that is important for mean intragroup relatedness. The size of demes may matter surprisingly little. An indigenous villager may know some of his many connections with other villagers and be aware of a plexus of relationship through the misty past. What might surprise him (as it surprised me) is that relatedness as measured here (and as manifested in physical similarity) builds up just as much eventually in a large unit, say a remote town, as it does in a village, if the same actual numbers enter and leave each generation. In other words, connections which the remote townsman does not so easily know of make up in multiplicity what they lack in close degree. Of course, a large unit usually does have more migration, and consequently less intrarelatedness, but the important thin is that it is the number of migrants rather than the size of colonies that determines this. For Wright's simple island model where migrants go anywhere among infinitely many colonies the approximate formula for mean intragroup relatedness (after migration has occured) is very simple, b=1/(2M+1) where _M_ is the number of migrants (assumed small) per subpopulation per generation. So with one migrant exchanged every other generation we find b=1/2, the same as for siblings in a panmictic population, and we therefore expect the degree of amicability that is normally expressed between siblings. If three migrants go (and three come) every generation we get b=1/7. This is slightly more than the relatedness of outbred cousins (b=1/8), so such colonies should be slightly more intra-amicable than groups of cousins would be. If, as normally happens, migrants tend to go mainly to neighbouring populations, then emigrant and immigrant genotypes will tend to correlate and so a given level of relatedness can be maintained with more migration. However, recent achievements with the analysis of the harder stepping-stone model (Kimura and Weiss 1964; Maruyama 1972), which covers the island model as a special case, show that the difference is not very great as regards own-deme relatedness. Consider the case where the colonies are supposed spread on the plane in a square lattice. Suppose that an act of migration is either 'distant', with the migrant going to any deme among the infinitely many, as in the island model, or 'close' with the migrant going to one of the four neighboring demes, and suppose that the odds on events of the two kinds are specified. With odds 100:1 for 'distant' to 'close' M=.5 leads to b=.5 within colonies to a good approximation: in other words, the change produced by such a small amount of local migration is negligible. If the odds are reversed to 100:1, implying local migration much more probable than distant migration, the Kimura-Weiss solutions show that the relatedness only rises to .68. With 10 times as much migration (i.e. about 5 exchanged per deme per generation) the corresponding relatednesses are .09 and .17, so relatedness still only doubles when migration is local rather than distant. A much greater contrast is apparent in the relatedness of individuals of neighbouring demes: when distant migration preponderates this relatedness tends to be extremely small, but when close migration preponderates members of a neighbour deme can easily have more than half the relatedness that applies to an own-deme member. And up to a point increasing migration _reduces_ the contrast between own and neighbour deme, so that there are genetical as well as cultural reasons why, in man, intergroup migration and marriage should decrease intergroup hostility.

Two other points seem worth making about the stepping-stone model. One is that, in the one-dimensional version of this model which could apply to demes in a linear habitat such as a coastline or river, relatedness holds up much more strongly as local migration is increased, and relatedness to neighbour-deme more strongly still (relative to the two-dimensional case). This means that hardly any extra hostility is expected to members of neighbouring demes. From this point of view, a sea-shore phase of hominid evolution, if it occurred, should have been particularly harmonious. The other point concerns the distribution of gene frequencies. The apparent variability of colonies is expected to change rather sharply at certain critical levels of migration. These are M=.5 for the island model and M=1 for the two-dimensional stepping-stone model with close migration predominant. This means that at about the point where the colony members are related to each other like outbreds sibs it should become relatively easy for individuals to detect a fairly clar difference in appearance when comparing fellow colony members with outsiders. Actually, in the stepping-stone model the possibilities with regard to patchiness and cline-like effects are complex, but, considering simultaneously several traits which are independently inherited and at most weakly selected, the complex overlap of patterns should make possible fairly accurate separation of 'us' and 'them' at the level of colonies. We shall shortly see why natural selection might favour motivation and ability so to discriminate.

What is happening to the ordinary families embedded in these supposedly endogamous colonies? Sibilings, parents, and offspring will still be the individual's closest relatives. Owing to the inbreeding, their relatedness will be above the value of 1/2 that applies under outbreeding. Thus an individual should be more altruistic than usual to his immediate kin. But other neighbours who are not immediate kin are now also closely related, and it is this reduced contrast between neighbours and close kin that will give what is probably the most striking effect: we expect less nepotistic discrimination and more genuine communism of behavior. At the boundary of the local group, however, there is usually a sharp drop in relatedness. If migrants (or whole groups) are very mobile, leading to an 'island' rather than a 'stepping-stone' situation, this drop may be such as to promote active hostility between neighbouring groups.(Note 3) Even though these groups have some relatedness, as practical limitations to distant migration naturally ensure, the contrast is still such that a minor benefit from taking the life of an outsider would make the act adaptive. Recent studies on hunting dogs (Kuhme 1965; Lawick-Goodall & Lawick-Goodall 1971) and hyenas (Kruuk 1972), show strangers sometimes being killed, while within-the-group relations are usually amicable and even communistic. The most serious wounding which Lawick-Goodall (1971) recorded in her study of chimpanzees occurred when two males combined to attack a male of another group. Bygott (1972) witnessed a fierce attack by a group of male chimpanzees on a stranger female. The female escaped but the males caught and ate her infant. Trespassers may sometimes be killed in wolves (Mech 1972), and in rats (Lorenz 1966). In lions (Schaller 1972), langurs (Sugiyama & Parthasanathy 1969), and probably in rats and mice there is also killing of strange young, but this is probably in a rather different category because it is done only by males whose aim seems to be to sire new offspring on the mothers they bereave.

These phenomena are reminiscent of the intercolony hostility so often observed in social insects, where again actual killing may be frequent along the frontiers. With regard to relatedness, the situation is the same except that intrerelatedness in groups is usually due to all colony members being descendants of a single queen. But polygynic ants (e.g. the common red ants of the genus _Myrmica_) may approximate the breeding structure of group hunting carnivores rather well, and this tempts one to apply the super-organism concept often used for social insects to the cooperative social mammals. Such a view would compare the killing of occasional trespassers to the occasional minor wound with death of cells which occurs in the restrained fighting between, say, two individual dogs.

The basis for thinking that group-hunting carnivores are highly related within groups is the known low rate of migration and the reluctance with which migrants are accepted. Why such reluctance? The probable reason has already been touched on: the group has a cooperative job to do, necessary for its survival. This job is the hunting and killing of prey which are too large for one individual to tackle alone. The more work is invested in a task prior to its fruition the more worthwhile a parasitic option of behavior becomes -- at least, until parasites are too numerous (Hamilton 1971). And the more cooperation is involved in any endeavour the more scope there is for the inconspicuous idler. What is to stop a hunting dog from watching the hunt from afar and trotting up, by all possible short cuts, just after the prey has been killed? Probably this has happened and probably groups over full of the offspring of such idlers have found themselves unable to kill prey and have died out. This would give a slow selection for features of pack behavior (either cultural or genetic) that make infiltration progressively more difficult. Simply cutting down on immigration would have the desired effect through raising relatedness but, as mentioned earlier, a selective entrance requirement, with the applicant's behavior watched through a probationary period, would be even better. It should be mentioned here that a development which closed a group's frontiers completely would probably also fail in the long run for reasons of general adaptation: complete inbreeding abandons the obviously important advantages of sexual reproduction, whatever these are (see Bischof, above).

Roughly, as we currently see it, a cunning ape-like creature once pushed boldly out from near niches now held by baboon and chimpanzee. Whether or not (as one quite plausible view holds) it first left its less enterprising cousins to take a holiday on the seashore, eventually it reappears on the African savannah participating in the Pleistocene wild-life bonanza as a group-hunting carnivore. In spite of the now-not-so-prehensile foot which it kept all the while in the door of an omnivorous diet, it seems likely that this creature would have needed the same population structure as the other group-hunting carnivores and for the same but more urgent reasons. It is difficult to see what was the first factor in the escalation in cunning of this particular primate line, but the choice seems mainly between tools and language. The great benefits that these could confer to a cooperative hunter through improved technique and organization would ensure rapid selection for their development. But they would also affect the social situation in significant ways and indirectly this might escalate their selection: (a) both would provide extra cultural clues to group identification; (b) tools (and later other valued artefacts) would give further scope for parasitical behavior, first intragroup but later between groups as well. Tools and possessions could be appropriated instead of made. With language in rapid evolution, learning experience, and even intelligence would become increasingly open to parasitism. Meanwhile, increasing intelligence would make possible a very plastic approach to parasitical and altruistic behavior, which in turn would increase the complexity of the semi-serious deception and coalition games which are so characteristic of behavior within primate groups. Real rewards in food and mating are the incentive to this activity and thus escalate the selection for skill in play. The main point is that intelligence, plus (a) and (b), plus what has been explained about the real differentiation of genetical relatedness suggest the development of an explosive situation. Close frontiers to migrants a little more, or slightly increase group mobility, and it is possible to imagine the sudden success of a policy which makes any frontier incident an occasion for an attempt at violent incursion by the more populous group with losers killed, enslaved, or driven off. Successful occupation of the captured territory would soon bring the victors into contact with still less related 'stones' of the 'stepping-stone' lattice, which they could attack adaptively with even less reason for restraint. Increasing foresight would mean that a group would not necessarily wait until large enough to _need_ neighbouring territory, if attacking a weak group while it is weak helped to ensure that space could be occupied as needed. Increasing ability to abstract and generalize would enable groups to reanimate their intragroup coalition games in the more serious intergroup context. The usual and firmest coalitions would be between related groups, as is the case with the coalitions of individuals (usually males) in wild turkeys (Watts and Stokes 1971), lions (Schaller 1972) and chimpanzees (Lawick-Goodall 1971). Are group fights necessarily more serious for the species if, on the analogy of the super-organism, we are allowed to equate a few deaths to a minor wound? Perhaps not, and of course it is possible that making groups more aggressive would not 'melt' the lattice structure to the extent suggested. Moreover, groups might be units in super-groups that are themselves in a 'stepping-stone' lattice. In such class warfare -- for that is the behavior we now survey -- might carry over from intragroup behavior (or itself spontaneously develop) quite orderly restrained procedures involving little loss of life.

In developing this admittedly speculative outline of certain cultural and genetic processes in tribal evolution, I confess a bias towards discovering the patterns of coalitions, warfare, language, contempt, and so on that are documented in certain remote peoples of the present day, for example the Yanomamo (Chagnon 1968) and various New Guinea highlanders (Matthiessen 1962; Rappaport 1971). Admittedly in these cases there is agriculture; it is possible to claim that most hunter-gatherers are more peaceable. For example why not aim to derive the customs of the Kalahari Bushmen? But most hunter-gatherers are certainly less peaceable than Bushmen. The record of human violence goes back far indeed, even if the earliest attributions (Dart’s cases in Australopithecus_: Ardrey 1961) are doubtful. A trace of homology with the sporadic violence of chimpanzees, actual violence towards outsiders would contrast with restrained violence, or mere threats, used within the group, while within the group too there would be much sharing and cooperation. One neanderthal skeleton of Shanidar Cave had bone damage suggesting a stab wound (Solecki 1971). Another skeleton also had bone defects but of quite different implication: one forearm was lacking, perhaps from birth, certainly for a long time, and a healed injury to the skull showed that one eye was blind. Goodall’s chimpanzees were part hostile, part sympathetic, and part indifferent to comrades sudddenly crippled with polio: they did nothing positive to support them. In contrast, the neanderthals of Shanidar evidently supported a cripple, and on his death they buried him in a cave where, in other graves, they also sometimes buried their dead with flowers. These hints of violence and loyalty and (perhaps most purely human) of incipient love of things for themselves evoke a startlingly familiar and sympathetic portrait. Considering only the same affectional attributes in the present-day tribal and pastoral Kurds (as opposed to attributes connected to the ever-accelerating change in material culture), a recording angel perhaps notes today much the same events in Shanidar Cave as he noted an ice age ago (Hamilton 1937; Solecki 1971).

Probable instances of cannibalism in Sinanthropus and Homo neanderthalensis have been plausibly compared to similar recent cannibalism in New Guinea (Blanc 1961). In New Guinea, it is interesting to note, this practice acted as a kind of population control, since by eating the brain of his victim, it was believed, the headhunter won a name for a child of his own – in effect, won a birthright. Other usually less drastic beliefs and practices affecting fertility that are widespread in human cultures may help to explain how they manage to be as peaceful as they are. In so far as the practices amount to effective birth control, they cut warfare at its demographic root. Unfortunately, it is possible that in doing so they also cut an important link that has escalated the selection for intelligence (Keith 1945; Bigelow 1969; Alexander 1971; Andreski 1971). No hunt needs quite so much forethought or ability to communicate complex instructions as does a war, nor do such drastic demographic consequences hinge on the outcome.

The rewards of the victors in warfare obviously increase for peoples past the neolithic revolution. There are tools, livestock, stores of food, luxury goods to be seized, and even a possibility for the victors to impose themselves for a long period as a parasitical upper class. Hunter-gatherers, on the other hand, at most win only mates and land. It might seem that these things would not repay the expected cost of the fighting, but it has to be remembered that to raise mean fitness in a group either new territory or outside mates have to be obtained somehow. The occurrence of quasi-warlike group interactions in various higher primates(Kummer 1968; Sugiyama & Parthasanathy 1969; and references in Bigelow 1969) strongly suggests that something like warfare may have become adaptive far down in the hominid stock. These primate examples suggest the prototype war party as an all-male group, brothers and kin, practised as a team in successful hunting and at last redirecting its skill towards usurping the females or territory of another group. Out of such cells can be built the somewhat less stable organism of the post-neolithic army. The Homeric Illiad gives a vivid inside view of the process of coalition, while the siege it describes emphasizes the existence of economic surpluses supporting the warriors on both sides (something hunter-gatherer warriors would never have). If the male war party has been adaptive for as long as is surmised here, it is hardly surprising that a similar grouping often reappears spontaneously even in circumstances where its present adaptive value is low or negative, as in modern teenage gangs (Patrick 1973).

Whether or not the neolithic revolution brought an increase in the per capita incidence of violence it does seem that from then on warfare looms larger in the affairs of men. The situation seems reflected in the fact that only one of various series of pre-neolithic cave paintings depicts warfare (Pericot 1961), whereas for most early civilizations the earliest known written records are records of warfare, booty, captives, and the like.

It has been argued that warfare must be a pathological development in man, continually countered by natural selection, and this claim is sometimes based on a sweeping a priori view that habits of mortal intraspecific fighting must always endanger the survival of a species (Lorenz 1966). While endorsing such a view as regards wars between the few frightfully armed super-powers of today, I see no likelihood for it as regards fighting of individuals or of groups up to the level of small nations. Of course, for the species as a whole, and in the short term, war is detrimental from the biological demographic point of view, but, as shown above and elsewhere, detriment to the species does not mean that a genetical proclivity will not spread. Anyway, what is bad at one level may be good at another and the cost to the species may be paid in the long run. The gross inefficiency of warfare may be just what is necessary, or at least an alternative to birth control and infanticide, in order to spare a population’s less resilient resources from dangerous exploitation. Maybe if the mammoth-hunters had attacked each other more and the mammoths less they could be mammoth-hunters still. And the rich ice-age fauna of the Americas might have had time to adapt to the human predator as it adapted in Africa if fighting had induced man to draw his curtain of overkill across the continent less rapidly. Many examples in the living world show that a population can be very successful in spite of a surprising diversion of time and energy into aggressive displays, squabbling, and outright fights. The examples range from bumble bees to European nations. In case all this reads like a paean for fascism let me add one caution from the geological record. Arms and armour seem to weigh down in the end: it is hard in the modern world to see warfare as a stabilizing influence for man.

The relatively peaceable Bushmen may tell us something valuable about the aetiology of wars, but I am doubtful if they tell us much about the role of this factor in the main stream of human evolution. However it is noteworthy that the Bantu who replaced the khoisaniform races in much of the rest of Africa were warlike in ways that evoke comparisons from the dark ages of Europe. The Bantu were, of course, mainly pastoralists and agriculturalists, for whom, as stated above, booty would be an important additional incentive to warfare. Pastoralists tend to be particularly warlike and the histories of civilization are punctuated by their inroads. Pastoral tribes have to be mobile in following or driving their herds and this mixes tribes and reduces relatedness of neighbours. Viewed as booty, the mobility of stock is a great convenience. Both factors must contribute to the warlike propensity.

The incursions of barbaric pastoralists seem to do civilizations less harm in the long run than one might expect. Indeed, two dark ages and renaissances in Europe suggest a recurring pattern in which a renaissance follows an incursion by about 800 years. It may even be suggested that certain genes or traditions of pastoralists revitalize the conquered people with an ingredient of progress which tends to die out in a large panmictic population for the reasons already discussed. I have in mind altruism itself, or the part of the altruism which is perhaps better described as self-sacrificial daring. By the time of the renaissance it may be that the mixing of genes and cultures (or of cultures alone if these are the only vehicles, which I doubt) has continued long enough to bring the old mercantile thoughtfulness and the infused daring into conjunction in a few individuals who then find courage for all kinds of inventive innovation against the resistance of established thought and practice. Often, however, the cost in fitness of such altruism and sublimated pugnacity to the individuals concerned is by no means metaphorical, and the benefits to fitness, such as they are, go to a mass of individuals whose genetic correlation with the innovator must be slight indeed. Thus civilization probably slowly reduces its altruism of all kinds, including the kinds needed for cultural creativity (see also Eshel 1972).

RECIPROCATION AND SOCIAL ENFORCEMENT

The last suggestion is rather different from saying, as has sometimes been said, that civilization selects against all kinds of creative intelligence. It seems to me that there are some aspects of innate intelligence that civilization steadily promotes. Mercantile operations, for example, are an inseparable part of Old World civilizations and need complex models in the minds of their operators, just as military ventures do. The main difference is in more emphasis on prudence and less on daring. It is probable that civilization has given steady selection for the intelligence needed for this mercantile kind of preparatory modelling. The intelligence that gives a good appreciation of the real principles involved in a new technology, as opposed to seeing it as a kind of magic, is probably also constantly favoured, since improvers of a technology avoid the arrows of contempt and penury that face pioneers and can do very well. However, my main reason for turning to the subject of trade is to introduce the idea of another kind of positive social arrangement which thrives in a mercantile and technological atmosphere, for which intelligence is more necessary and relatedness much less so. This refers of course to reciprocation.

Starting perhaps with something like the meat-sharing of chimpanzees (‘feed me while you have plenty and I’ll feed you when I have plenty’), proceeding through barter (where differing aptitudes may begin to be important), reciprocative activity branches out into all the various business-like arrangements of modern man. The key words are ‘client’ and ‘partner’ as opposed to ‘kinsman’ and ‘friend’.

Establishing a basis for reciprocating has problems of natural selection closely similar to those of altruism as discussed so far. It is very frequently necessary for one party to execute his half of a bargain without any way of being certain that the other party will later stick to his. The best response if the other does not reciprocate is to cut off any further benefits to him (Trivers 1971). Unfortunately, this leaves the selfish non-reciprocator better off than the ‘altruistic’ initiator and unless the two are related this is against the habit of reciprocation, at least when the trait is rare (see discussion of Prisoner’s Dilemma in Hamilton 1971; also Boorman & Levitt 1973). However, this initial barrier to selection is a slight one if the rewards of the interaction are high, as they would be when an advantageous exchange can be repeated many times (Trivers 1971; Boorman & Levitt 1973). Once the barrier is passed by genetic drift or the like, non-reciprocation finds itself in the category of maladaptive spite – harming the self to harm others more. Nepotistic altruism, of course, also has an initial barrier to pass when it first occurs by mutation. But once positive selection supervenes the resemblance between the two situations fades: reciprocal altruism of the kind described is less purely altruistic. Indeed, the term altruism may be a misnomer: there is an expectation of benefit of the initiating individual, not just an expectation of benefit to the genotype. TO put the matter another way, reciprocal altruism can never be suicidal, whereas suicidal nepotistic altruism can and has evolved – it is apparent, for example, in worker sacrifices in the social insects.

Whether recirprocation involves altruism or not, we see that in so far as it involves repeated acts between the same two individuals this useful and immensely varigating type of interaction can spread genetically, given only an ability to remember individual faces of those who have helped and those who have cheated in the past. Unforunately, by the very aid it gives to the growth and diversification of social systems, reciprocation tends to undermine the basis of its success. Situations demanding reciprocation just once between individuals destined never to meet again naturally become more common and it becomes easier for cheaters to specialize in these and to hide from retribution. Cheating can also become more subtle, especially along lines which make it hard for victims to be sure just who has cheated them.

In considering this problem I think there may be reason to be glad that human life is a ‘many-person game’ and not just a disjoined collection of ‘two-person games’. Admittedly, it may not seem so at first. At first reding the theory of many-person games may seem to stand to that of two-person games in the relation of sea-sickness to a headache. But given also a little real intragroup altruism endowed from the tribal past, it may turn out that the one is at least a partial cure for the other (see Figure 1). The idea here is that for pairs in isolation the problem of cheating in a single exchange may be insoluble, and that therefore we have all evolved, more or less in proportion to our exposure to civilization (i.e. relatively panmictic) conditions, into potential cheaters. But at the same time we also have every reason to agree as to the parasitical nature of cheating as it affects the welfare of the community as a whole and to deplore its successful practice by others. So detection of cheating arouses indignation in everyone except the accused, and everyone sees a benefit to both group and self in trying to punish the cheater and in forcing restitution (some part of which, as an added incentive, may be diverted as a fee for those who administer the collective justice). The reason I believe that a little real intragroup altruism is also necessary for the evolution of efficient justice on these lines is that individuals must feel the difference between the usefulness of this behavior and the futility of using collective power arbitrarily in ways profitless to the group. A healthy society should feel sea-sick when confronted with the endless internal instabilities of the ‘solutions’, ‘coalition sets’, etc., which the theory of many-person games has had to describe. One hears that game theorists, trying to persuade people to play even two-person games like ‘Prisoner’s Dilemma’, often encounter exasperated remarks like: ‘There ought to be a law against such games!’ Some of the main points of this paper can be summarized as an answer to this comment: that often, in real life, there is a law, and we can see why, and that sadly we also see the protean nature of this Dilemma, which, when suppressed at one level, gathers its strength at another.

Notes

Fox (1967) has also emphasized this in discussing human kinship systems and why some which are easily conceivable never actually occur. His discussion of the incest taboo is also very pertinent to the idea that follows.
Price (1970) first pointed out the generality and usefulness of this relation but earlier partial recognition of it seems to be due to Robertson (1966).
As pointed out elsewhere (Hamilton 1970, 1971), even spiteful behavior, harming oneself in order to harm another more, is a theoretical possibility. The mean relatedness to the entire species population other than self is –1/(N-1) where N is the population. If by inbreeding or otherwise a colony has grouped n identical genotypes together, then relateness to the average outsider is –1/(N-n). Thus with only a few large long-isolated groups spite is more possible.
The game-theoretic situation known as Prisoner’s Dilemma has as paradigm the dilemma of a criminal in custody who is offered the inducement of a light sentence if he will give state evidence to clear up an important crime, while knowing that a confederate, also in custody, has the same offer and that the pay-off situation which both face is as follows, arranged from lightest to heaviest sentence: (c) I confess, he doesn’t > (a) We neither confess > (d) We both confess > (b) He confesses, I don’t. So that (c) can be better than (a) it is assumed that they can certainly be convicted of some minor offence whereas at least one confession is needed in order to settle the major crime. For further information on this ‘game’ see Rapoport & Chammah (1965).

References

ALEXANDER, R. D. 1971. The search for an evolutionary philosphy of man. _Proceedings of the Roy Society of Victoria 84: 99-120.

ALTMANN, S. A. & ALTMANN, J. 1972. Baboon Ecology, African Field Research. New York: S. Karger.

ANDRESKI, S. 1971. The case for war. Science Journal 7 (1):89-92

ARDREY, R. 1961. African Genesis. London: Collins.

BIGELOW, R. 1969. The Dawn Warriors. London: Hutchison.

BLANC, A. C. 1961. Some evidence on the ideologies of early man. In: S. L. Washburn (ed.), The Social Life of Early Man. London: Methuen.

BOORMAN, S. A. & LEVITT, P. R. 1973. A frequency dependent natural selection model for the evolution of social cooperation networks. Proceedings of the National Academy of Sciences (USA) 70: 187-189

BROWN, J. L. 1973. Alternative routes to sociality in jays. American Zoologist 12: 642.

BYGOTT, J. D. 1972. Cannibalism among wild chimpanzees. Nature 238: 410-411.

CHAGNON, N. A. 1968. Yanomamo: The Fierce People. New York: Holt, Rinehart & Wilson.

Eshel, I. 1972 On the neighbour effect and the evolution of altruistic traits. Theoretical Population Biology 3: 258-277.

FOX, R. 1967. Kinship and Marriage. London: Penguin

HALDANE, J. B. S. 1932. The Causes of Evolution. New York City: Cornell University Press 1966.

HAMILTON, A. M. 1937. Road through Kurdistan. London: Faber and Faber.

HAMILTON, W. D. 1964. The genetical evolution of social behavior. I and II Journal of Theoretical Biology 7: 1-52.

1967. Extraordinary sex ratios. Science 156: 477-488.
1970. Selfish and spiteful behavior in an evolutionary model. Nature, London 228: 1218-1220.
1972. Selection of selfish and altruistic behavior in some extreme models, Chapter 2 pp. 57-91. In : J. F. Eisenberg and W. S. Dillon (eds.), Man and Beast: Comparative Social Behavior. Washington, D.C.: Smithsonian Press.
1972. Altruism and related phenomena, mainly in social insects. Annual Review of Ecology and Systematics 3: 1`93-232.

KEITH, A. 1946. Essays on Human Evolution. London: Watts.

KIMURA, M., WEISS, G. H. 1964. The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics 49: 561-576.

Remaining citations to be included.