A | B | C | D | English |
ka | ka | ko | zimi | ‘we’ |
nami | name | nom | soo | ‘two’ |
paluma | paroma | polm | jai | ‘eye’ |
litiki | redege | riik | enefo | ‘mother’ |
kutusu | kodozo | kuts | vaha | ‘water’ |
panu | pano | pon | haaz | ‘eat’ |
In the comparative method, we're looking for regular phonological correspondences (which I'll abbreviate to RPCs). That is, given a set of words from each of the languages in question, we want to see if there is a regular pattern of phonemes that occurs between the items in the set. For example, look at the words for 'we' and 'water'. In all three of languages A, B, and C, these words begin with the phoneme k-. Likewise, in the words for 'eye', and 'eat', all the words in A, B, and C all begin with the phoneme p-. And in the word for 'two', we find all the words in A, B, and C beginning in n-. These are all examples of RPCs.
The phonemes don't all have to be the same to be regular. In the words for 'eye' and 'mother', language A has -l- whereas language B has -r-. Likewise, in 'we', 'two', 'eye', and 'eat, languages A and B have -a- where language C has -o-. These correspondences are just as regular as the k-, p-, and n- correspondences because they recur in multiple sets of words.
Now, what about language D? A close comparison between the different phonemes in language D's words and those of languages A, B, and C, reveals no meaningful correspondences; this suggests that D is not genetically related, or at least not closely genetically related, to languages A, B, and C, which, due to their numerous correspondences, seem to be very closely genetically related.
Now, a set of 6 words is not nearly enough to make conclusions about genetic relationships. Ideally, several hundred word sets should be selected, including not just nouns, verbs, or adjectives, but grammatical items such as plural markers on nouns, person, number, and tense markers on verbs, prepositions, different pronoun forms, etc. The more functional the items examined (those parts of the vocabulary which contribute more structure than meaning), the more solid the conclusions that can be drawn about genetic relationships.
Now, as an exercise for you the reader, here is a set of data from 5 European languages. Many of you will be familiar with at least some of these languages, but I'm going to change their spelling to make them slightly less recognizable, and also to reflect their true pronunciation a little better. Your task is to examine the word sets, look for regular phonological correspondences, and make a guess as to which languages are genetically related to which others (or which are more closely related to which, if it seems that they are all ultimately related).
A | B | C | D | E | English |
ik | io | zhuh | ih | yo | ‘I’ |
maan | luna | lyn | | luna | ‘moon’ |
zyster | sorella | sur | shvester | ermana | ‘sister’ |
vyyr | fwoko | fuh | foyer | fwego | ‘fire’ |
akht | otto | wit | akht | ocho | ‘eight’ |
vut | piede | pie | fus | pie | ‘foot’ |
drinken | bere | bwar | trinken | bever | ‘drink’ |
Bonus question: to which of the above do you think English is most closely related? What phonological correspondences can you name between English and that (or those) languages? Remember to concentrate on pronunciation, not spelling, since how a word is spelled may not reflect its pronunciation accurately.