At least, no other species seems to have songs in which musical intervals (small whole-number ratios between pitches) play a role. No other species seems to create rhythmic patterns by subdivisions of a beat.
There might even be some related evolutionary specialization of the auditory system -- at least, the just noticeable difference in pitch perception is about an order of magnitude smaller for humans than for other mammals.
Then again, maybe this is going too far. What about the songs of whales or songbirds? Don't they have real melodies? Don't apes sometimes beat out rhythms?
The point is, no one seems to get worked up about this question. Books and articles are not written about whether the songs of whales or sparrows are Truly Music. There is plenty of study, scientific and otherwise, of the vocal displays of whales and songbirds. However, the interest is in what animal songs are like -- what they sound like, how they are made, how they are learned, what they are used for -- and not whether they basically the same as, or basically different from, human music.
By comparison, there is enormous and continuing controversy about whether various sorts of animal behavior, both natural and taught by human trainers, are Truly Language.
For some reason, people find it more interesting -- either more attractive or more threatening -- to think of animals as having command of language, than to think of them as having command of music. The debate often seems to take on a quasi-religious tone, as if the issue were whether animals had souls rather than whether they have languages.
As with many such controversies, it's hard to stay out of the fight. Nevertheless, we are going to try, in part by staying away from the vexed case of apes and sign language, and focusing on natural communication among animals.
The Canid Play Bow is an example of what ethologists call a display, meaning roughly a salient pattern of behavior.
Displays are modulated in intensity or clarity of presentation, with similarly gradient implications for their "meaning." For instance, the eastern kingbird has a vocalization said to sound like "zeer" that indicates possible aggressive intent. The sharpness and harshness of the call varies, and the probability that the animal will actually attack increases as the sharpness/harshness of the call increases.
Sometimes displays are quite specific and complex in their behavioral structure and also in their apparent function. Other displays have a specific form, but their function -- or meaning -- that is more difficult to define. For instance, the blacktailed prairie dog has a display called the "Jump-Yip," which involves throwing the forepaws into the air, pointing the nose straight up, and emitting an abrupt two-part vocalization. The Jump-Yip seems to mean "I'm trying to decide whether to run away or do something else that might be more important to me." A bit less than half the time, a Jump-Yip is followed by flight, but there always seems to be an alternative: to make sexual advances to another prairie dog, to challenge a neighbor aggressively, to take a dust bath, or whatever. The display is only used when there is an alternative to flight, but the alternative could be just about anything.
More often, displays are fragments of behavior that may
be combined in various ways, and may shade gradually into what may be considered
an entirely different display. For example, here is one ethologist's version
of the system of dogs' body language, of which the "play bow" is one part:
Changes in body postures and facial expressions by which dogs indicate their feelings and intentions to other dogs.
There are other cases, for instance warning calls, where displays may have a range of fairly specific referential content. For instance, vervet monkeys give one kind of alarm call when they see a snake, another kind when they see a leopard, and a third kind when they see an eagle. This set of referential distinctions is useful, because different kinds of defensive behavior are appropriate in the three cases.
In response to the snake call, the troupe of vervets will all stand up on their hind legs in the open and look around on the ground -- to find the snake. If the predator were an eagle or a leopard, standing up in the open would be very unwise.
In response to the leopard warning call, the members of the troupe run up to the top of the nearest tree, where the heavy leopard can't follow them. This would be just the wrong thing to do to avoid an eagle, and not optimal for a snake either.
In response to the eagle warning call, the members of
the troupe run into a nearby bush or under the lower branches of a nearby
tree. Since this is where leopards hide while stalking, and is as likely
as anywhere to be on top of a snake, this behavior is again a good response
only to the type of predator that the particular call signals.
We can try to make a more complete list of functions. Here is one, adapting from a text on the topic:
Other displays develop from ritualization of intention
movements, i.e. incomplete acts, such as turning away or starting
to fly. A related example is the development of frowning from protective
eyebrow-lowering, which presumably was originally part of more general
preparations for attack or defense. Movements may be redirected or displaced
from their normal context, repeated unnecessarily, etc. as part of ritualization.
All vervet monkeys use the same set of warning cries. However, the particular creatures that evoke them vary from place to place, depending on the local types of predators. Baby vervets start out pretty undiscriminating, giving "eagle" warning calls to any reasonably large bird, including those that are completely harmless, and "leopard" warning calls to any largish terrestrial animal, including grazing animals, anteaters etc. They gradually learn -- apparently from the reactions of their elders -- how not to 'cry wolf.'
Most animals learn to respond appropriately to the warning cries of other species that happen to live near them.
Some species -- for instance some songbirds -- have "dialects", in which some properties of their vocalizations or other displays are learned, during a critical period early in each animal's life. The resulting pattern of geographical variation in the vocal displays then plays a role in social structure -- just like human shibboleths. It is said that such dialect variation tends to occur (in songbirds) just in case the species colonizes a wide range of microhabitats, where each local group becomes especially adapted to their local conditions, and will benefit from breeding within the group to maintain the adaptations intact.
A more elaborate form of social evolution occurs in the songs of humpback whales. Male humpbacks "sing" long (half-hour) and complex "songs", apparently as part of lek behavior. also called arena behavior.
Lek behavior is a kind of courtship exhibited by many species of insects, birds and mammals. A number of males gather in a defined area, each in his own piece of the space, and engage in competitive mating displays. Females cruise through the arena, observe the displays, and choose a mate based on their evaluation.
In the case of the humpbacks, the male display appears to be a song. Humpback songs have been recorded (by Naval sonar engineers) since the WW II era. As a result, the following curious pattern has been observed. At any given time, all the humpbacks in a given region are singing basically the same song. Individual performances may differ in small ways -- an extra trill here, a note left out there, a section repeated somewhere else -- but the basic pattern is the same. Over time, the pattern evolves, until after a year or so, it is completely changed. All the males are still singing the same song, but it is a completely different song than it was before.
After about 50 years of recordings, the old songs have never come back into fashion. The evolution appears to be open-ended.
Humpbacks around Bermuda sing completely different songs from humpbacks around Hawaii, even though they seem to be entirely the same species.
As far as anyone can tell, the patterns in the songs have no meaning at all. They are just abstract patterns. Pure style, no content -- in this sense, at least, like human melodies.
It remains unclear to what extent the development of the song is a matter of social construction -- everyone following a gradually shifting fashion -- and to what extent it is implicit in a given stage of the song -- that is, the "next thing" is obvious to any humpback with an ounce of style.
In any case, it is tempting to analyze this phenomenon by reference to the human phenomenon of fashion. You can -- and must -- distinguish yourself by a slightly individual interpretation of the current style, but last season's stuff is totally lame, and so is some random set of choices that indicate you're completely out of touch with the culture.
The best-studied case is mouse grooming.
Mice use their forepaws to spread secretions from oil glands near their eyes. They do this via a rapid series of motions, which may be forehand or backhand, up or down, from the midline to the side or vice versa, etc. The particular order of such gestures has no functional consequence, as long as there are enough of each type to spread the secretions around.
Any given mouse uses a stereotyped sequence of such gestures -- which can be appropriately described with a finite-state grammar -- but different mice use different sequences. Genetically identical mice use identical sequence patterns.
Thus this is a case where a grammar-like organization of behavior is found, though there seems to be no communicative significance.
The organization of humpback whale songs is different in some ways -- it changes rather than being genetically fixed, and other whale seem to be paying attention to its structure. However, whale song syntax is similar to mouse grooming syntax, in that (as far as we know!) the structure does not seem to have any significance other than itself.
The hypothesis advanced about this constancy of repertoire size is that there is a sort of dynamic equilibrium involved -- old displays tend to be lost as new ones are added. Presumably there are some pressures to prevent the accumulation of displays: they may become ecologically irrelevant; there may be too much investment required in the brain circuitry involved; it may be confusing to have too many "lexical items" in the system, lessening the effectiveness of each.
One clear evolutionary puzzle is why -- if communication systems are as useful as we generally think they are -- the rest of the animal kingdom seems to have put so little emphasis on elaborating them beyond this size of a few tens of hard-wired elements.
The basic issues are the phonological principle and the elaboration of a large (and variable) lexicon; and the existence of a hierarchical compositional semantics, where the meaning of the whole is a function of the meaning of the parts in some way more complex than simple summation of the correlates of independent displays.