Next Article in Journal
Instability of a Moving Bogie: Analysis of Vibrations and Possibility of Instability in Subcritical Velocity Range
Previous Article in Journal
Effect of Viscous Dampers with Variable Capacity on the Response of Steel Buildings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Rhythmic Analysis in Animal Communication, Speech, and Music: The Normalized Pairwise Variability Index Is a Summary Statistic of Rhythm Ratios

by
Yannick Jadoul
1,*,†,
Francesca D’Orazio
1,*,†,
Vesta Eleuteri
2,
Jelle van der Werff
1,
Tommaso Tufarelli
3,
Marco Gamba
4,
Teresa Raimondi
1,‡ and
Andrea Ravignani
1,5,6,‡
1
Department of Human Neurosciences, Sapienza University of Rome, 00185 Rome, Italy
2
Department of Behavioral and Cognitive Biology, University of Vienna, 1030 Vienna, Austria
3
Independent Researcher, Beeston NG9, UK
4
Department of Life Sciences and Systems Biology, University of Turin, 10124 Turin, Italy
5
Center for Music in the Brain, Department of Clinical Medicine, Aarhus University & The Royal Academy of Music Aarhus/Aalborg, 8000 Aarhus, Denmark
6
Research Center of Neuroscience "CRiN-Daniel Bovet", Sapienza University of Rome, 00185 Rome, Italy
*
Authors to whom correspondence should be addressed.
Co-first authors.
Co-last authors.
Vibration 2025, 8(2), 12; https://doi.org/10.3390/vibration8020012
Submission received: 27 November 2024 / Revised: 10 March 2025 / Accepted: 15 March 2025 / Published: 24 March 2025

Abstract

:
Rhythm is fundamental in many physical and biological systems. Rhythm is relevant to a broad range of phenomena across different fields, including animal bioacoustics, speech sciences, and music cognition. As a result, the interest in developing consistent quantitative measures for cross-disciplinary rhythmic analysis is growing. Two quantitative measures that can be directly applied to any temporal structure are the normalized pairwise variability index (nPVI) and rhythm ratios (rk). The nPVI summarizes the overall isochrony of a sequence, i.e., how regularly spaced a sequence’s events are, as a single value. Meanwhile, rk quantifies ratios between a sequence’s adjacent intervals and is often used for identifying rhythmic categories. Here, we show that these two rhythmic measures are fundamentally connected: the nPVI is a summary static of the rk values of a temporal sequence. This result offers a deeper understanding of how these measures are applied. It also opens the door for creating novel, custom measures to quantify rhythmic patterns based on a sequence’s rk distribution and compare rhythmic patterns across different domains. The explicit connection between nPVI and rk is one further step towards a common quantitative toolkit for rhythm research across disciplines.

1. Introduction

1.1. Rhythm in Speech, Music, and Bioacoustics

Most physical and biological systems exhibit transient behavior that changes over time. From simple oscillators to complex emergent behaviors, the temporal evolution of these systems and their components may carry information on their underlying mechanisms. Some complex systems produce discrete events in time: a clock ticking, a dog barking, and a bassist plucking individual strings all produce distinct events with non-random temporal structure. These temporal sequences and their resulting patterns may be described as rhythms. Living organisms are particularly apt at producing and perceiving rhythms because their nervous systems seem to rely on them for various processes, including physiology, locomotion, fine movement, and communication [1,2].
Rhythmic behaviors span wildly different spatial and temporal scales and are of interest to a broad range of scientific disciplines [3,4,5,6,7,8,9,10,11,12]. Three of these disciplines may seem strange bedfellows at first yet feature deep theoretical and methodological commonalities: animal bioacoustics, speech science, and music cognition are all interested in the structural properties of sound as one possible window to study the underlying cognitive mechanisms. Animal bioacoustics focuses on biotic sounds, while the other two disciplines typically focus on human-generated sounds [13]. In all three disciplines, rhythmic structure is most commonly sought in a range spanning millisecond to seconds when zooming in on temporal properties. Because these areas share the common goal of finding rhythm in sound-based systems, their quantitative tools and frameworks are often applicable domain-generally and sometimes exchanged between fields [14].
Theoretical definitions of rhythm abound because of its multidimensionality [15,16,17,18]. One pragmatic approach to rhythm is to use measures that capture the rhythmic properties of a signal; a behavior is then quantified as more or less rhythmic depending on the outcome of the used measure(s). To define and characterize rhythmic behavior, contributions from multiple fields are needed. It is important, among other things, to establish mathematical and statistical characterizations of rhythm. After identifying those measures applicable in animal bioacoustics, speech sciences, and music cognition, a good question is whether some of these measures are orthogonal or unexpectedly related.

1.2. Describing Rhythm Across Disciplines

Speech science, music cognition, and animal bioacoustics share a common interest in the structural properties of sound [19]. A common approach across these domains can support our understanding of quintessentially human behaviors like music, dance, and speech [19,20]. Such an approach can also shed light on the evolutionary origins of music and speech by examining their fundamental building blocks, with the rhythmic structure being pivotal [21,22,23].
Measurement tools applicable to all these fields are crucial to facilitate a cross-disciplinary approach. As such, recent years have seen growing attention towards the development of rhythmic analyses that can be applied to music cognition and speech sciences as well as to animal bioacoustics [14,23,24,25,26,27,28,29,30]. From this perspective, a quantitative measure should be directly applicable to any rhythmic structure, regardless of the context in which rhythm is expressed.
A good starting point for cross-disciplinary rhythmic measures is the basic definition of rhythm as “patterns of events in time” [16]. Adopting such a basic definition is important to avoid assumptions about the structure of language, music, and animal vocalizations, which for the latter often has yet to be established [31,32]. By stripping down a sequence—whether it originates from human speech, music, or animal vocalizations—to its basic temporal structure, one can quantify its rhythmic properties, and draw meaningful parallels across fields [23,24]. One outstanding challenge lies in measuring a multidimensional temporal sequence in a way that captures the rhythmic properties of interest.
Different studies and fields collect drastically different data types, from time series of continuous sensor measurements to sequences of observed behaviors [33,34,35,36]. To apply the type of rhythmic analyses and measures presented here, the first step is identifying the events and intervals of interest to the research question under consideration [14]. This could consist, for example, of finding peaks or abrupt changes in a sensor time series or annotating the onset of different vocalizations in an audio recording. The resulting sequences of events should capture the temporal properties of interest in the original dataset and be stripped of other aspects deemed irrelevant for the current analysis. Existing general rhythmic measures and analyses can then be applied to the resulting temporal sequences. While this approach should by no means exclude other, more specialized analyses of the original data, this transformation into simple temporal sequences allows for the same rhythmic analyses to be applied across a broad range of scientific fields and enables cross-disciplinary comparisons of the results [14].
Below, we provide a short overview of the typical approach to measure the rhythmic properties of such sequences. Note that this is not meant as a complete, in-depth methodological framework for rhythmic analysis; rather, we introduce two commonly used rhythmic measures, and we focus on the relationship between them.

1.3. Measuring Rhythm

Starting from a sequence of discrete events in time, most approaches for rhythmic analysis rely on the intervals between subsequent events (see Figure 1A). If the events have a duration, typically inter-onset intervals (IOIs; i.e., the interval between their onsets) are considered. Working with the duration of inter-event or inter-onset intervals is intuitive, as it highlights temporal patterns rather than the absolute time at which they occur. Additionally, a typical assumption borrowed from music perception studies is that the events’ duration is less relevant for temporal structure than the intervals between their onsets [37,38,39]. Based on the interval durations, other rhythmic measures (such as rhythm ratios, rk; see Figure 1B and below) can be calculated to quantify the intervals’ relationship.
Investigating the distribution of interval durations is often the first step of any rhythm analysis. As illustrated in Figure 1A,C, such a visualization can already reveal initial differences between different types of rhythm and shows whether interval durations are clustered around certain values or continuously distributed. The interval distribution can reveal how different durations are used throughout a sequence, yet it cannot capture any sequential patterns as it neglects the intervals’ order. For example, summary measures of intervals—e.g., their mean, representing the average tempo—reveal little about the detailed rhythmic structure of the sequence. Random drumming or a clock ticking can share the exact same mean interval duration. Consequently, we need measures that capture the relationship between different interval durations and quantify the rhythmic structure in more detail.
Among the various existing rhythmic measures, we focus here on two in particular: the normalized pairwise variability index (nPVI; [40]) and rhythm ratios (rk; [41]; see Figure 1B). As these two measures are generically applicable to any temporal sequence or dataset with multiple sequences, they are both relevant in a cross-disciplinary context. Additionally, as we show below, these two measurements are fundamentally connected, providing deeper insight into the rhythmic properties they measure.

1.4. The Normalized Pairwise Variability Index

As a rhythmic measure, the normalized pairwise variability index (nPVI) focuses on the relationship between pairs of adjacent intervals. More specifically, given the n 1 pairs of adjacent interval durations ( i k and i k + 1 ) in a sequence, the nPVI is defined as
n P V I = 200 n 1 k = 1 n 1 i k i k 1 i k + i k + 1
Broadly speaking, the nPVI value of a sequence measures the level of isochrony, irrespective of tempo, i.e., how close the sequence is to having perfectly equally spaced—or isochronous—events (see Section 2.1 for a detailed explanation of the formula). The smaller the nPVI, the more isochronous a sequence is (e.g., [25]).
The nPVI was initially introduced in linguistics to obtain a more reliable comparison of rhythmic patterns across languages. Grabe and Low [40] proposed the index after finding that, when used to measure variability in durations of adjacent vowels, it strongly correlates to the classic classification of spoken languages into stress-, syllable-, and mora-timed [42,43]. The nPVI is however not specific to linguistic analysis. In fact, by replacing the vowel durations with the duration of intervals between general events, nPVI becomes a rhythmic measure for a broad range of acoustic signals.
In the field of music cognition, this approach has uncovered significant parallels between linguistic and musical rhythm [44]. An analysis of nPVI values in composers’ native languages and their musical works showed that rhythmic variability in music closely mirrors that in language [40], establishing a link between linguistic and musical rhythm [45,46]. Later, this measure was fully adopted to quantify rhythmic changes in musical data in relation to historical trends [47,48], national influences [45,49,50], and compositional styles [51,52].
Similarly, the nPVI has also been applied to analyze complex acoustic signals in animals. It has proven a particularly useful measure for distinguishing more isochronous patterns—marked by low variability between IOIs—from more complex or random ones [23,25]. For example, studies using the nPVI revealed different levels of isochrony in the sounds produced by different fish species [26]. Similarly, the measure was used to quantify isochronous flipper slapping patterns in harbor seals [53].
Despite its utility as a simple, cross-disciplinary, rhythmic measure, the nPVI also has limitations. The resulting value is an average over one or multiple temporal sequences and can oversimplify the complexities of rhythmic patterns and lose potentially crucial within-sequence variation. This is especially important when interval durations follow several distinct categories, e.g., in the context of music. In such cases, the information measured by nPVI can often be summarized using simpler methods, such as counting the proportion of identical successive intervals [54]. However, such methods make more assumptions and are thus less likely to be applicable across contexts and disciplines. All in all, these caveats are part of a necessary tradeoff when capturing the temporal regularity of a sequence in a single and generic metric [23].

1.5. Rhythm Ratios

A second, more recently proposed measure of rhythmic structure is the rhythm ratio (rk; [41]). Similarly to the nPVI, it is calculated based on two adjacent intervals i k and i k + 1 :
r k = i k   i k + i k + 1
Contrary to the nPVI, each r k represents the relationship between two intervals (see Section 2.1) and does not summarize a temporal sequence as a single value. Instead, a temporal sequence will result in a sequence of rk values, and subsequent analyses typically focus on the observed distribution of these values. Figure 1B,C presents an example of the rk values and their associated distributions resulting from four different types of temporal sequences. An rk value of ½ (i.e., 0.5) indicates two successive intervals of equal length (i.e., isochrony); smaller values result from a longer interval following a shorter one, and vice versa. The rk rhythm ratio is particularly useful for certain questions in bioacoustics and music cognition research, where the precise durations of intervals are often less important than the rhythmic patterns that emerge from them. It has also proven useful for characterizing rhythmic behaviors in related disciplines (e.g., language pathology; [55]).
The rk ratio was proposed by Roeske et al. [41] to explore the presence of rhythmic categories in human music and birdsong. They tested the resulting rk distributions for peaks around small-integer ratios; e.g., an rk value of 0.33 corresponds to a pair of intervals where the second one has double the duration of the first (i.e., with intervals following a 1:2 ratio). A significant peak in the distribution around this rk value of 0.33 suggests the presence of two rhythmic categories according to this 1:2 ratio.
The normalization in the rk formula ensures that the measure is tempo-independent, allowing researchers to identify rhythmic categories within temporal sequences and acoustic signals. These categories may correspond to fundamental units that structure speech and music, such as syllables or hierarchical rhythmic units. Specifically, most music genres show interval relationships coinciding with small-integer ratios [56,57,58,59]. Because of this, after its introduction by Roeske et al. [41], the rk measure has been adopted in a broadly cross-disciplinary range of studies. It was applied to test for the presence of small-integer ratios in singing primates [60,61,62,63], non-human apes [64,65,66], and other taxa [53,67,68]. Overall, this method continues to be applied to a large range of animal behaviors, from songs and calls to percussive signals [69] and locomotion patterns [70,71].

2. Results and Discussion

2.1. The Relationship Between the nPVI and Rhythm Ratios

The nPVI and the distribution of rk values originate from different research contexts and are typically applied to complementary research questions. As we outlined above, the nPVI (Equation (1)) typically serves as a simple measure of isochrony, whereas rk values (Equation (2)) are most often applied in search of rhythmic categories. However, looking at the two formulas, we can actually notice some similarities. This is not an accident: here, we mathematically show that the nPVI is a summary statistic of the rk rhythm ratio distribution.
To better understand this, let us revisit the two formulas. Formally, consider a temporal sequence of n + 1 events (or event onsets), at times t 1 ,   t 2 ,   ,   t n + 1 . Logically, there are n intervals between these onsets, i 1 ,   i 2 ,   ,   i n with i k = t k + 1 t k .
The nPVI [40] considers the relative difference between two adjacent intervals i k and i k + 1 in a temporal sequence: the absolute difference between durations i k i k + 1 is divided by their average duration ( i k + i k + 1 ) / 2 . The nPVI then equals the average over all n 1 adjacent intervals pairs, multiplied by 100:
n P V I = 100 n 1 k = 1 n 1 i k i k + 1 i k + i k + 1 / 2 = 200 n 1 k = 1 n 1 i k i k + 1 i k + i k + 1
As described earlier, the lower an nPVI measure, the higher the degree of isochrony: in the case of perfect isochrony, all intervals i k are equal, rendering the whole equation equal to zero. For any other sequence, nPVI can never equal 200 or higher. Values close to 200 are reached by alternating intervals that differ immensely in duration (e.g., if i k     i k + 1 , then i k i k + 1 i k + i k + 1 i k i k = 1 ); when this holds for all adjacent interval pairs in a sequence, nPVI will be almost 200.
The same sequence of n intervals results in n 1 rk values, as given by Equation (2) [41]:
r k = i k i k + i k + 1
Any rk value lies between 0 and 1 [41,72]. Two intervals with the same duration i k = i k + 1 will result in r k = 1 2 , and so, a perfectly isochronous sequence will have a distribution of rk values concentrated at 0.5. Moreover, just like the nPVI, rk values are normalized and independent of the overall tempo and focus on the relative relationship between adjacent intervals.
It is no coincidence that the central part of the nPVI formula, i k i k + 1 i k + i k + 1 , looks similar to the rhythm ratio formula i k i k + i k + 1 . With a few algebraic manipulations, we can rewrite i k i k + 1 i k + i k + 1 as 2 i k i k + i k + 1 1 2 and find a straightforward mathematical connection between these two measures:
n P V I = 400 n 1   k r k 1 2
In other words, we find that the nPVI is in fact a summary statistic of a sequence’s rk values. More precisely, the nPVI quantifies in a single value how much, on average, the measured rk values in a sequence diverge from 0.5 (i.e., isochrony).
Consequently, any distribution of empirical or theoretical rk values has exactly one corresponding nPVI value. On the contrary, based on a specific nPVI value, it is not possible to determine the exact rk distribution of the original sequence. In other words, the distribution of rhythm ratios cannot be directly reconstructed from the nPVI value. Figure 2 illustrates how different rk distributions can result in the same nPVI value, as well as how the nPVI captures the rk distribution’s spread around 0.5. Also note how both the nPVI and as well as the distribution of rk values disregard the internal order of the ratios within a sequence.

2.2. Interpretation of the nPVI and Rhythm Ratios

The relationship between the nPVI and rk values provides insights into both rhythmic measures. Most importantly, our results show that the distribution of rk values is a more comprehensive measure of rhythmicity than the single nPVI summarizing that distribution. While this might not come as a surprise—a distribution of values potentially contains much more information than a single value—our results highlight that the two measures represent and process rhythmic sequences the same way, since they normalize the data in the same way (i.e., dividing by total duration of two adjacent intervals).
Recent work has investigated the mathematical assumptions underlying the rk formula, showing that rk values are uniformly distributed between 0 and 1 for random sequences generated by the highly random Poisson point process [72]. Many analyses of rk values have implicitly assumed this as the baseline against which to compare rhythmic patterns, even though this process might be considered too random in some cases. This finding can now be translated into new insights about the nPVI. A uniform rk distribution between 0 and 1 can be shown to have an expected nPVI of 100. Consequently, the expected nPVI value of a sequence generated by a Poisson process is 100. This means that any nPVI value lower than 100 is closer to isochrony than one generated by a Poisson process. Similarly, higher nPVI values (i.e., between 100 and 200) correspond to sequences that are less isochronous than random, i.e., sequences that alternate more than random between short and long intervals. However, note that many different rk distributions can correspond to the same nPVI. Consequently, an nPVI close to 100 does not necessarily indicate a random temporal sequence. It is equally possible that it results from a highly rhythmic sequence with an intermediate to high amount of alternation (e.g., consisting only of 1:3 and 3:1 integer ratios).
Overall, the rk distribution has the potential to provide a more complete view of the rhythmic structure of a sequence. Conversely, the found mathematical link between the two measures shows that the rk values, while being the more novel approach of the two, do not capture a wholly different rhythmic property. The more recent rk results can be interpreted in a larger context of studies stretching back to almost 20 years before the rk approach by Roeske et al. [41]. This unification of the two measures’ interpretation has the potential of making them even more useful for cross-disciplinary research.

2.3. Alternative and Complementary Summary Statistics of rk

Interpreting the nPVI as a summary statistic also opens the door for alternative measures based on a sequence’s rk distribution. The nPVI is suited for capturing certain rhythmic properties but not others. For example, Figure 2 shows that there is not a perfect correlation between randomness and the nPVI. Complementing the nPVI through the use of different summary statistics of the rk distribution can be more useful for describing rhythmic properties in a more complete manner.
For rk values, we can use existing summary measures for characterizing distributions. For example, the standard deviation of rk values can be calculated to capture the spread of the rk values similarly to the nPVI (i.e., n P V I = 400 n 1 | r k 1 2 | vs. σ r = 1 n 1 r k μ r 2   ). Note that σ r measures a subtly different property compared to the nPVI: it calculates the spread around the mean rk value, rather than around an rk of 0.5 (i.e., isochrony), as is the case for the nPVI. Analogously, the mean or median rk can give an indication of the “average acceleration or deceleration” within a sequence. These common summary statistics have the advantage that their properties and interpretation are thoroughly understood, enhancing their potential for interdisciplinary use.
An altogether different summary of a probability distribution is the entropy. Entropy measures how concentrated or how spread out the rk values are. The main difference to the nPVI or standard deviation is that it does not measure the spread in comparison to a single point (i.e., 0.5 or the mean rk); instead, it characterizes the inherent concentration of values within a distribution. Such a measure can, for example, be used to quantify how strongly the rk distribution displays distinct categories. In practice, the entropy can be calculated either by calculating the Shannon entropy of the discretized distribution or by estimating a continuous version of entropy (such as differential entropy [73]).
There are plenty of other ways in which the rk distribution could be summarized into one or a couple of values. The insight gained in this study allows for finetuning future analyses to more precisely measure the relevant rhythmic properties in temporal data. However, it is essential to remember that alternative rhythmic measures are not meant to replace but to complement the nPVI. Reporting the nPVI—and where possible, the full rk distribution—is still an essential practice to support cross-disciplinary comparisons between studies.

2.4. nPVI and rk Are Inherently Local Measures

Importantly, the nPVI and any summary measure based on rk values are local quantifications of rhythmic structure. These measures only compare adjacent intervals and therefore do not capture any higher-order patterns that cannot be deduced from successive intervals. This is often a desirable tradeoff: rk values and the nPVI preserve some information about the order in which the intervals appear, which is not the case for direct summary measures of interval durations. Additionally, the denominator in both formulas (i.e., i k + i k + 1 ) normalizes for tempo and removes tempo drift (Figure 1). Independence of tempo is often a desirable property of this approach.
Nevertheless, the approach is a tradeoff, and rk and nPVI are not the only rhythm analyses available. For example, the coefficient of variation (CV) is a measure of dispersion calculated by normalizing the standard deviation by the mean. When applied to the distribution of interval durations (i.e., ik), the CV captures how close to isochrony a sequence is, independent of overall tempo. It thus allows for a direct comparison of interval duration variability across different datasets. The main difference between the nPVI and CV is that the latter is globally normalized by the average tempo of a sequence or dataset rather than locally, per pair of adjacent intervals. In contrast to the nPVI, tempo drift across a sequence will affect the CV of interval durations. Figure 3 demonstrates the differences between the nPVI and CV by calculating them for the four different example sequences of Figure 1.
Another related rhythmic measure is beat precision (previously known as universal goodness of fit, or ugof; [24]). Beat precision measures how close each event is to a hypothetical underlying isochronous grid and normalizes by the constant interval duration of the hypothesized grid. Only the distance between events and the grid are measured: a non-isochronous sequence consisting of multiples of the grid’s interval duration is considered beat-precise. While this is a potentially important advantage over the nPVI and CV, beat precision comes with its own tradeoffs and is, for example, impacted more by tempo drift or minor rhythmic irregularities. Similarly, the Fourier transform is a well-known mathematical technique that, in the context of rhythmic analysis, can serve to quantify the global presence of different frequencies in a continuous signal [24,25,26,74].
Different rhythmic measures quantify subtly different rhythmic properties. A thorough understanding of these subtleties is required when choosing which measures to use in which contexts. When in doubt, multiple measures should be used such that they complement each other. As an example, if the nPVI, CV, and beat precision all result in low values, one can be confident about the amount of isochrony in a sequence. If one of the measures indicates differently, this creates a potential for understanding which types of sequence would generate such results. In general, using different methods can provide a more comprehensive understanding of the data, ultimately leading to more robust and reliable conclusions.

3. Conclusions

Many approaches exist to quantify the rhythmic structure in a temporal sequence. Here, we have described a fundamental relationship between two commonly used measures to quantify rhythmic structure, concluding that the normalized pairwise variability index (nPVI) is a summary statistic of rhythm ratios (rk).
This methodological result provides a more nuanced understanding of the two measures and unifies them. For example, insights into rhythm ratios (e.g., [72]) directly translate into intuition about the nPVI: random sequences generated by a Poisson point process result in a uniform distribution of rk ratios, which has an nPVI value of 100. Additionally, regarding this methodological perspective, past studies using the nPVI can now be interpreted in the context of rhythm ratios, and vice versa (e.g., [61,75]).
Our goal in this study was not to infer general mechanisms governing rhythmic structures at the physical, biological, or cognitive levels. Rather, we established an explicit mathematical link between two commonly used quantitative tools. This, in turn, should be used to build a shared quantitative toolkit for rhythm research.
There exists no perfect universal rhythmic measure. Every measure is a tradeoff between succinctly summarizing a sequence to one or a few values on the one hand, and retaining as much temporal information as needed on the other. In general, the full distribution of rk values will result in a more comprehensive understanding of a sequence’s rhythmic structure [14]. However, the insight that nPVI and rk are conceptually related allows researchers to fine-tune future analyses to their precise research question. Summary statistics (e.g., the standard deviation) of rk values complement the nPVI when describing rhythms and quantify subtly different rhythmic properties. The nPVI remains an appropriate choice for quantifying the amount of isochrony in a sequence and enables a cross-disciplinary comparison with previous and future results in the literature.

Author Contributions

Conceptualization, Y.J. and A.R.; formal analysis, Y.J.; investigation, Y.J. and F.D.; writing—original draft preparation, Y.J., F.D., V.E., J.v.d.W., T.R. and A.R.; writing—review and editing, Y.J., F.D., V.E., J.v.d.W., T.T., M.G., T.R. and A.R.; visualization, Y.J., J.v.d.W. and T.R.; supervision, A.R.; project administration, F.D. and A.R.; funding acquisition, A.R. All authors have read and agreed to the published version of the manuscript.

Funding

Y.J., J.v.d.W., T.R., and A.R. are funded by the European Union (ERC, TOHR, 101041885). Y.J., F.D., T.R., and A.R. are also supported by the HFSP research grant RGP0019/2022. V.E. is supported by the Austrian Science Fund (FWF) “DK Cognition and Communication 2” under grant agreement W1262-B29 [10.55776/W1262].

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Larsson, M. Self-Generated Sounds of Locomotion and Ventilation and the Evolution of Human Rhythmic Abilities. Anim. Cogn. 2014, 17, 1–14. [Google Scholar] [CrossRef] [PubMed]
  2. Myers, B.R.; Lense, M.D.; Gordon, R.L. Pushing the Envelope: Developments in Neural Entrainment to Speech and the Biological Underpinnings of Prosody Perception. Brain Sci. 2019, 9, 70. [Google Scholar] [CrossRef] [PubMed]
  3. Bolton, T.L. Rhythm. Am. J. Psychol. 1894, 6, 145–238. [Google Scholar] [CrossRef]
  4. Nolan, F.; Jeon, H.-S. Speech Rhythm: A Metaphor? Phil. Trans. R. Soc. B 2014, 369, 20130396. [Google Scholar] [CrossRef]
  5. Goldbeter, A. Computational Approaches to Cellular Rhythms. Nature 2002, 420, 238–245. [Google Scholar] [CrossRef]
  6. Dotov, D.; Trainor, L.J. Cross-Frequency Coupling Explains the Preference for Simple Ratios in Rhythmic Behaviour and the Relative Stability across Non-Synchronous Patterns. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2021, 376, 20200333. [Google Scholar] [CrossRef]
  7. Bigand, F.; Bianco, R.; Abalde, S.F.; Novembre, G. The Geometry of Interpersonal Synchrony in Human Dance. Curr. Biol. 2024, 34, 3011–3019.e4. [Google Scholar] [CrossRef]
  8. Kadar, E.E.; Schmidt, R.C.; Turvey, M.T. Constants Underlying Frequency Changes in Biological Rhythmic Movements. Biol. Cybern. 1993, 68, 421–430. [Google Scholar] [CrossRef]
  9. Smith, R.; Rathcke, T.; Cummins, F.; Overy, K.; Scott, S. Communicative Rhythms in Brain and Behaviour. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2014, 369, 20130389. [Google Scholar] [CrossRef]
  10. Chay, T.R. Bifurcations in Heart Rhythms. Int. J. Bifurc. Chaos 1995, 5, 1439–1486. [Google Scholar] [CrossRef]
  11. Katz, P.S. Evolution of Central Pattern Generators and Rhythmic Behaviours. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2016, 371, 20150057. [Google Scholar] [CrossRef] [PubMed]
  12. Greenfield, M.D.; Aihara, I.; Amichay, G.; Anichini, M.; Nityananda, V. Rhythm Interaction in Animal Groups: Selective Attention in Communication Networks. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2021, 376, 20200338. [Google Scholar] [CrossRef] [PubMed]
  13. Kello, C.T.; Bella, S.D.; Médé, B.; Balasubramaniam, R. Hierarchical Temporal Structure in Music, Speech and Animal Vocalizations: Jazz Is like a Conversation, Humpbacks Sing like Hermit Thrushes. J. R. Soc. Interface 2017, 14, 20170231. [Google Scholar] [CrossRef] [PubMed]
  14. Hersh, T.A.; Ravignani, A.; Burchardt, L.S. Robust Rhythm Reporting Will Advance Ecological and Evolutionary Research. Methods Ecol. Evol. 2023, 14, 1398–1407. [Google Scholar] [CrossRef]
  15. Toussaint, G.T. The Geometry of Musical Rhythm: What Makes a “Good” Rhythm Good? 2nd ed.; Chapman and Hall/CRC: New York, NY, USA, 2019; ISBN 978-1-351-24777-1. [Google Scholar]
  16. McAuley, J.D. Tempo and Rhythm. In Music Perception; Handbook of Auditory Research; Riess Jones, M., Fay, R.R., Popper, A.N., Eds.; Springer: New York, NY, USA, 2010; Volume 36, pp. 165–199. ISBN 978-1-4419-6113-6. [Google Scholar]
  17. Honing, H.; Bouwer, F.L. Rhythm. In Foundations in Music Psychology: Theory and Research; The MIT Press: Cambridge, MA, USA, 2019; pp. 33–69. ISBN 978-0-262-03927-7. [Google Scholar]
  18. Leow, L.-A.; Grahn, J. Neural Mechanisms of Rhythm Perception: Present Findings and Future Directions. Adv. Exp. Med. Biol. 2014, 829, 325–338. [Google Scholar] [CrossRef]
  19. Rohrmeier, M.; Zuidema, W.; Wiggins, G.A.; Scharff, C. Principles of Structure Building in Music, Language and Animal Song. Philos. Trans. R. Soc. B Biol. Sci. 2015, 370, 20140097. [Google Scholar] [CrossRef]
  20. Fitch, W.T. The Biology and Evolution of Rhythm: Unravelling a Paradox. In Language and Music as Cognitive Systems; Oxford University Press: New York, NY, USA, 2012; pp. 73–95. ISBN 978-0-19-955342-6. [Google Scholar]
  21. Fitch, W.T. The Biology and Evolution of Music: A Comparative Perspective. Cognition 2006, 100, 173–215. [Google Scholar] [CrossRef]
  22. Honing, H.; Ten Cate, C.; Peretz, I.; Trehub, S.E. Without It No Music: Cognition, Biology and Evolution of Musicality. Philos. Trans. R. Soc. B Biol. Sci. 2015, 370, 20140088. [Google Scholar] [CrossRef]
  23. Ravignani, A.; Norton, P. Measuring Rhythmic Complexity: A Primer to Quantify and Compare Temporal Structure in Speech, Movement, and Animal Vocalizations. J. Lang. Evol. 2017, 2, 4–19. [Google Scholar] [CrossRef]
  24. Burchardt, L.; Briefer, E.; Knörnschild, M. Novel Ideas to Further Expand the Applicability of Rhythm Analysis. Ecol. Evol. 2021, 11, 1–9. [Google Scholar] [CrossRef]
  25. Burchardt, L.S.; Knörnschild, M. Comparison of Methods for Rhythm Analysis of Complex Animals’ Acoustic Signals. PLoS Comput. Biol. 2020, 16, e1007755. [Google Scholar] [CrossRef] [PubMed]
  26. Burchardt, L.S.; Picciulin, M.; Parmentier, E.; Bolgan, M. A Primer on Rhythm Quantification for Fish Sounds: A Mediterranean Case Study. R. Soc. Open Sci. 2021, 8, 210494. [Google Scholar] [CrossRef] [PubMed]
  27. Schneider, J.N.; Mercado III, E. Characterizing the Rhythm and Tempo of Sound Production by Singing Whales. Bioacoustics 2019, 28, 239–256. [Google Scholar] [CrossRef]
  28. Le Bot, O.; Bonnel, J.; Mars, J.I.; Gervaise, C. Odontocete Click Train Deinterleaving Using a Single Hydrophone and Rhythm Analysis. In Proceedings of the ICA 2013 Proceedings—Volume 19 of POMA (Proceedings of Meetings on Acoustics), Montreal, QC, Canada, 2–7 June 2013; Volume 19. [Google Scholar]
  29. Malige, F.; Djokic, D.; Patris, J.; Sousa-Lima, R.; Glotin, H. Use of Recurrence Plots for Identification and Extraction of Patterns in Humpback Whale Song Recordings. Bioacoustics 2020, 680–695. [Google Scholar] [CrossRef]
  30. Heinsohn, R.; Zdenek, C.N.; Cunningham, R.B.; Endler, J.A.; Langmore, N.E. Tool-Assisted Rhythmic Drumming in Palm Cockatoos Shares Key Elements of Human Instrumental Music. Sci. Adv. 2017, 3, e1602399. [Google Scholar] [CrossRef]
  31. Kershenbaum, A.; Bowles, A.E.; Freeberg, T.M.; Jin, D.Z.; Lameira, A.R.; Bohn, K. Animal Vocal Sequences: Not the Markov Chains We Thought They Were. Proc. Biol. Sci. 2014, 281, 20141370. [Google Scholar] [CrossRef]
  32. Kershenbaum, A.; Blumstein, D.T.; Roch, M.A.; Akçay, Ç.; Backus, G.; Bee, M.A.; Bohn, K.; Cao, Y.; Carter, G.; Cäsar, C.; et al. Acoustic Sequences in Non-Human Animals: A Tutorial Review and Prospectus. Biol. Rev. Camb. Philos. Soc. 2016, 91, 13–52. [Google Scholar] [CrossRef]
  33. Arnaud, V.; Pellegrino, F.; Keenan, S.; St-Gelais, X.; Mathevon, N.; Levréro, F.; Coupé, C. Improving the Workflow to Crack Small, Unbalanced, Noisy, but Genuine (SUNG) Datasets in Bioacoustics: The Case of Bonobo Calls. PLoS Comput. Biol. 2023, 19, e1010325. [Google Scholar] [CrossRef]
  34. Kvsn, R.R.; Montgomery, J.; Garg, S.; Charleston, M. Bioacoustics Data Analysis—A Taxonomy, Survey and Open Challenges. IEEE Access 2020, 8, 57684–57708. [Google Scholar] [CrossRef]
  35. Oswald, J.N.; Van Cise, A.M.; Dassow, A.; Elliott, T.; Johnson, M.T.; Ravignani, A.; Podos, J. A Collection of Best Practices for the Collection and Analysis of Bioacoustic Data. Appl. Sci. 2022, 12, 12046. [Google Scholar] [CrossRef]
  36. Sousa-Lima, R.S.; Norris, T.F.; Oswald, J.N.; Fernandes, D.P. A review and inventory of fixed autonomous recorders for passive acoustic monitoring of marine mammals. Aquat. Mamm. 2013, 39, 23–53. [Google Scholar] [CrossRef]
  37. Dalla Bella, S.; Farrugia, N.; Benoit, C.E.; Begel, V.; Verga, L.; Harding, E.; Kotz, S.A. BAASTA: Battery for the assessment of auditory sensorimotor and timing abilities. Behav. Res. Methods 2017, 49, 1128–1145. [Google Scholar] [CrossRef] [PubMed]
  38. Fiveash, A.; Bella, S.D.; Bigand, E.; Gordon, R.L.; Tillmann, B. You got rhythm, or more: The multidimensionality of rhythmic abilities. Atten. Percept. Psychophys. 2022, 84, 1370–1392. [Google Scholar] [CrossRef]
  39. Tavano, A.; Maess, B.; Poeppel, D.; Schröger, E. Neural entrainment via perceptual inferences. Eur. J. Neurosci. 2022, 55, 3277–3287. [Google Scholar] [CrossRef]
  40. Grabe, E.; Low, E.L. Durational Variability in Speech and the Rhythm Class Hypothesis. Pap. Lab. Phonol. 2002, 7, 515–546. [Google Scholar]
  41. Roeske, T.C.; Tchernichovski, O.; Poeppel, D.; Jacoby, N. Categorical Rhythms Are Shared between Songbirds and Humans. Curr. Biol. 2020, 30, 3544–3555.e6. [Google Scholar] [CrossRef]
  42. Pike, K.L. The Intonation of American English; Greenwood Press: Westport, CT, USA, 1979; ISBN 978-0-313-20910-9. [Google Scholar]
  43. Abercrombie, D. Studies in Phonetics and Linguistics; Oxford University Press: Oxford, UK, 1965; ISBN 978-0-19-437023-3. [Google Scholar]
  44. Ozaki, Y.; Tierney, A.; Pfordresher, P.Q.; McBride, J.M.; Benetos, E.; Proutskova, P.; Chiba, G.; Liu, F.; Jacoby, N.; Purdy, S.C.; et al. Globally, Songs and Instrumental Melodies Are Slower and Higher and Use More Stable Pitches than Speech: A Registered Report. Sci. Adv. 2024, 10, eadm9797. [Google Scholar] [CrossRef]
  45. Patel, A.D.; Daniele, J.R. An Empirical Comparison of Rhythm in Language and Music. Cognition 2003, 87, B35–B45. [Google Scholar] [CrossRef]
  46. London, J.; Jones, K. Rhythmic Refinements to the nPVI Measure: A Reanalysis of Patel & Daniele (2003a). Music Percept. 2011, 29, 115–121. [Google Scholar] [CrossRef]
  47. Daniele, J.R. A Tool for the Quantitative Anthropology of Music: Use of the nPVI Equation to Analyze Rhythmic Variability within Long-Term Historical Patterns in Music. Empir. Musicol. Rev. 2017, 11, 228–233. [Google Scholar] [CrossRef]
  48. Daniele, J.R.; Patel, A.D. An Empirical Study of Historical Patterns in Musical Rhythm. Music Percept. 2013, 31, 10–18. [Google Scholar] [CrossRef]
  49. Vukovics, K.; Shanahan, D. Rhythmic Variability, Language, and Style: A Replication and Extension of nPVI Findings with the RISM Dataset. J. New Music Res. 2020, 49, 285–297. [Google Scholar] [CrossRef]
  50. Temperley, D. Rhythmic Variability in European Vocal Music. Music Percept. 2017, 35, 193–199. [Google Scholar] [CrossRef]
  51. VanHandel, L. The War of the Romantics: An Alternate Hypothesis Using nPVI for the Quantitative Anthropology of Music. Empir. Musicol. Rev. 2017, 11, 234. [Google Scholar] [CrossRef]
  52. VanHandel, L.; Song, T. The Role of Meter in Compositional Style in 19th Century French and German Art Song. J. New Music Res. 2010, 39, 1–11. [Google Scholar] [CrossRef]
  53. Kocsis, K.; Duengen, D.; Jadoul, Y.; Ravignani, A. Harbour Seals Use Rhythmic Percussive Signalling in Interaction and Display. Anim. Behav. 2024, 207, 223–234. [Google Scholar] [CrossRef]
  54. Condit-Schultz, N. Deconstructing the nPVI. Music Percept. 2019, 36, 300–313. [Google Scholar] [CrossRef]
  55. Rossi, M.; Smit, E.A.; van der Werff, J.; Ravignani, A.; Rathcke, T. Testing Rhythmic Abilities in Developmental Dyslexia: Is Internal Rhythm Generation Intact? Music Percept. 2024, 42, 135–148. [Google Scholar] [CrossRef]
  56. Jacoby, N.; McDermott, J.H. Integer Ratio Priors on Musical Rhythm Revealed Cross-Culturally by Iterated Reproduction. Curr. Biol. 2017, 27, 359–370. [Google Scholar] [CrossRef]
  57. Savage, P.E.; Brown, S.; Sakai, E.; Currie, T.E. Statistical Universals Reveal the Structures and Functions of Human Music. Proc. Natl. Acad. Sci. USA 2015, 112, 8987–8992. [Google Scholar] [CrossRef]
  58. Daikoku, T. Temporal dynamics of uncertainty and prediction error in musical improvisation across different periods. Sci. Rep. 2024, 14, 22297. [Google Scholar] [CrossRef] [PubMed]
  59. Nave, K.; Carrillo, C.; Jacoby, N.; Trainor, L.; Hannon, E. The development of rhythmic categories as revealed through an iterative production task. Cognition 2024, 242, 105634. [Google Scholar] [PubMed]
  60. De Gregorio, C.; Valente, D.; Raimondi, T.; Torti, V.; Miaretsoa, L.; Friard, O.; Giacoma, C.; Ravignani, A.; Gamba, M. Categorical Rhythms in a Singing Primate. Curr. Biol. 2021, 31, R1379–R1380. [Google Scholar] [CrossRef] [PubMed]
  61. De Gregorio, C.D.; Raimondi, T.; Bevilacqua, V.; Pertosa, C.; Valente, D.; Carugati, F.; Bandoli, F.; Favaro, L.; Lefaux, B.; Ravignani, A.; et al. Isochronous Singing in 3 Crested Gibbon Species (Nomascus spp.). Curr. Zool. 2024, 70, 291–297. [Google Scholar]
  62. Ma, H.; Wang, Z.; Han, P.; Fan, P.; Chapman, C.A.; Garber, P.A.; Fan, P. Small Apes Adjust Rhythms to Facilitate Song Coordination. Curr. Biol. 2024, 34, 935–945.e3. [Google Scholar] [CrossRef]
  63. Raimondi, T.; Di Panfilo, G.; Pasquali, M.; Zarantonello, M.; Favaro, L.; Savini, T.; Gamba, M.; Ravignani, A. Isochrony and Rhythmic Interaction in Ape Duetting. Proc. R. Soc. B Biol. Sci. 2023, 290, 20222244. [Google Scholar] [CrossRef]
  64. Lameira, A.R.; Hardus, M.E.; Ravignani, A.; Raimondi, T.; Gamba, M. Recursive Self-Embedded Vocal Motifs in Wild Orangutans. eLife 2024, 12, RP88348. [Google Scholar] [CrossRef]
  65. van der Vleuten, B.J.R.; Hovenkamp, V.A.; Varkevisser, J.M.; Spierings, M.J. Context-dependent rhythmicity in chimpanzee displays. Proc. B 2024, 291, 20242200. [Google Scholar]
  66. De Gregorio, C.; Maiolini, M.; Raimondi, T.; Carugati, F.; Miaretsoa, L.; Valente, D.; Torti, V.; Giacoma, C.; Ravignani, A.; Gamba, M. Isochrony as Ancestral Condition to Call and Song in a Primate. Ann. N. Y. Acad. Sci. 2024, 1537, 41–50. [Google Scholar] [CrossRef]
  67. Demartsev, V.; Haddas-Sasson, M.; Ilany, A.; Koren, L.; Geffen, E. Male Rock Hyraxes That Maintain an Isochronous Song Rhythm Achieve Higher Reproductive Success. J. Anim. Ecol. 2023, 92, 1520–1531. [Google Scholar] [CrossRef]
  68. Xing, J.; Sainburg, T.; Taylor, H.; Gentner, T.Q. Syntactic Modulation of Rhythm in Australian Pied Butcherbird Song. R. Soc. Open Sci. 2022, 9, 220704. [Google Scholar] [CrossRef] [PubMed]
  69. Eleuteri, V.; van der Werff, J.; Wilhelm, W.; Soldati, A.; Crockford, C.; Desai, N.; Fedurek, P.; Fitzgerald, M.; Graham, K.E.; Koops, K.; et al. Chimpanzee Drumming Shows Rhythmicity and Subspecies Variation; University of St Andrews: Scotland, UK, 2024; submitted. [Google Scholar]
  70. Laffi, L.; Raimondi, T.; Ferrante, C.; Pagliara, E.; Bertuglia, A.; Briefer, E.F.; Gamba, M.; Ravignani, A. The Rhythm of Horse Gaits. Ann. N. Y. Acad. Sci. 2024, 1543, 86–93. [Google Scholar] [CrossRef] [PubMed]
  71. Laffi, L.; Bigand, F.; Peham, C.; Novembre, G.; Gamba, M.; Ravignani, A. Rhythmic Categories in Horse Gait Kinematics. J. Anat. 2025, 246, 456–465. [Google Scholar] [CrossRef] [PubMed]
  72. Jadoul, Y.; Tufarelli, T.; Coissac, C.; Gamba, M.; Ravignani, A. Hidden Assumptions of Integer Ratio Analyses in Bioacoustics and Music. arXiv 2025, arXiv:2502.04464. [Google Scholar] [CrossRef]
  73. Feutrill, A.; Roughan, M. A Review of Shannon and Differential Entropy Rate Estimation. Entropy 2021, 23, 1046. [Google Scholar] [CrossRef]
  74. Picciulin, M.; Bolgan, M.; Burchardt, L.S. Rhythmic Properties of Sciaena Umbra Calls across Space and Time in the Mediterranean Sea. PLoS ONE 2024, 19, e0295589. [Google Scholar] [CrossRef]
  75. Osiecka, A.N.; Fearey, J.; Ravignani, A.; Burchardt, L.S. Isochrony in Barks of Cape Fur Seal (Arctocephalus pusillus pusillus ) Pups and Adults. Ecol. Evol. 2024, 14, e11085. [Google Scholar] [CrossRef]
Figure 1. Four example sequences consisting of four different types of rhythm. (A) The sequences represent different signals as might be encountered when performing a rhythm analysis. The first sequence was created by randomly sampling intervals ( i k ) from a uniform distribution. The second sequence represents an isochronous sequence (i.e., consists of a series of intervals of equal duration). The third sequence represents an alternation of short and long intervals whose durational relationship can be described using simple integer ratios (here, 1:2 and 2:1). The final sequence represents a sequence that is speeding up (i.e., accelerando). (B) The same example sequences’ rhythm ratios (rk) are calculated based on their intervals and vary over time. As explained in more detail in the main text, the rk values capture the relationship between two adjacent intervals. (C) For each sequence, the distributions of i k and r k values are given. The similarities between some i k distributions show how certain rhythmic patterns cannot be distinguished by only considering the intervals.
Figure 1. Four example sequences consisting of four different types of rhythm. (A) The sequences represent different signals as might be encountered when performing a rhythm analysis. The first sequence was created by randomly sampling intervals ( i k ) from a uniform distribution. The second sequence represents an isochronous sequence (i.e., consists of a series of intervals of equal duration). The third sequence represents an alternation of short and long intervals whose durational relationship can be described using simple integer ratios (here, 1:2 and 2:1). The final sequence represents a sequence that is speeding up (i.e., accelerando). (B) The same example sequences’ rhythm ratios (rk) are calculated based on their intervals and vary over time. As explained in more detail in the main text, the rk values capture the relationship between two adjacent intervals. (C) For each sequence, the distributions of i k and r k values are given. The similarities between some i k distributions show how certain rhythmic patterns cannot be distinguished by only considering the intervals.
Vibration 08 00012 g001
Figure 2. Different rk distributions yield the same nPVI values. The nPVI is a summary statistic of the distribution of rk values, and so, each distinct distribution has a unique nPVI. However, for each nPVI value (columns), there exist multiple rk distributions. The top row shows the rk distributions for intervals with integer ratios 1:2, 1:1, and 2:1 and shows how the nPVI varies according to their relative probabilities. The second row shows the same for ratios 1:3, 1:1, and 3:1. All distributions in the third row feature only two ratios, n:m and m:n, and are the result of an alternating sequence with two distinct interval durations. The bottom row shows how the nPVI can measure the variation around 0.5 in continuous distributions. Here, the first four plots are the resulting rk distributions for uniformly distributed intervals (see [60,72]). The right-most plot in the bottom row shows a uniform distribution of rk values, with an nPVI of 100 (i.e., corresponding to rhythm ratios resulting from random sequences generated by a Poisson point process; for details, see [72]). The vertical lines above each rk distribution are an example temporal sequence generated according to this distribution. The visible qualitative difference between sequences in the same column further highlights the strength but also limitations of the nPVI in summarizing the rhythmic ratios as a single value: a high nPVI value might indicate a very irregular sequence but could also mask a surprising amount of regularity.
Figure 2. Different rk distributions yield the same nPVI values. The nPVI is a summary statistic of the distribution of rk values, and so, each distinct distribution has a unique nPVI. However, for each nPVI value (columns), there exist multiple rk distributions. The top row shows the rk distributions for intervals with integer ratios 1:2, 1:1, and 2:1 and shows how the nPVI varies according to their relative probabilities. The second row shows the same for ratios 1:3, 1:1, and 3:1. All distributions in the third row feature only two ratios, n:m and m:n, and are the result of an alternating sequence with two distinct interval durations. The bottom row shows how the nPVI can measure the variation around 0.5 in continuous distributions. Here, the first four plots are the resulting rk distributions for uniformly distributed intervals (see [60,72]). The right-most plot in the bottom row shows a uniform distribution of rk values, with an nPVI of 100 (i.e., corresponding to rhythm ratios resulting from random sequences generated by a Poisson point process; for details, see [72]). The vertical lines above each rk distribution are an example temporal sequence generated according to this distribution. The visible qualitative difference between sequences in the same column further highlights the strength but also limitations of the nPVI in summarizing the rhythmic ratios as a single value: a high nPVI value might indicate a very irregular sequence but could also mask a surprising amount of regularity.
Vibration 08 00012 g002
Figure 3. The nPVI and coefficient of variation (CV) as measures of isochrony. These example sequences correspond to the example sequences in Figure 1. For two of these sequences (isochronous and random; top left and bottom right), the nPVI and CV measures align: the isochronous sequence is perfectly isochronous, and its intervals have no variation. The random sequence clearly diverges from isochrony. For the two other sequences, however, the two measures give different results. A sequence built out of alternating 1:2 and 2:1 ratios (top-right) has a high nPVI, whereas its CV is relatively low. This is because the adjacent intervals are ordered such that they have maximal local contrast, resulting in a high nPVI. Conversely, the sequence displaying accelerando (i.e., speeding up; bottom left) has a low nPVI because adjacent intervals differ only minimally. This sequence has a high CV because of tempo drift. These examples highlight an important aspect of the nPVI and rk values: they are inherently local measures of rhythmic variation, while the CV is a global measure of variability.
Figure 3. The nPVI and coefficient of variation (CV) as measures of isochrony. These example sequences correspond to the example sequences in Figure 1. For two of these sequences (isochronous and random; top left and bottom right), the nPVI and CV measures align: the isochronous sequence is perfectly isochronous, and its intervals have no variation. The random sequence clearly diverges from isochrony. For the two other sequences, however, the two measures give different results. A sequence built out of alternating 1:2 and 2:1 ratios (top-right) has a high nPVI, whereas its CV is relatively low. This is because the adjacent intervals are ordered such that they have maximal local contrast, resulting in a high nPVI. Conversely, the sequence displaying accelerando (i.e., speeding up; bottom left) has a low nPVI because adjacent intervals differ only minimally. This sequence has a high CV because of tempo drift. These examples highlight an important aspect of the nPVI and rk values: they are inherently local measures of rhythmic variation, while the CV is a global measure of variability.
Vibration 08 00012 g003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jadoul, Y.; D’Orazio, F.; Eleuteri, V.; van der Werff, J.; Tufarelli, T.; Gamba, M.; Raimondi, T.; Ravignani, A. Rhythmic Analysis in Animal Communication, Speech, and Music: The Normalized Pairwise Variability Index Is a Summary Statistic of Rhythm Ratios. Vibration 2025, 8, 12. https://doi.org/10.3390/vibration8020012

AMA Style

Jadoul Y, D’Orazio F, Eleuteri V, van der Werff J, Tufarelli T, Gamba M, Raimondi T, Ravignani A. Rhythmic Analysis in Animal Communication, Speech, and Music: The Normalized Pairwise Variability Index Is a Summary Statistic of Rhythm Ratios. Vibration. 2025; 8(2):12. https://doi.org/10.3390/vibration8020012

Chicago/Turabian Style

Jadoul, Yannick, Francesca D’Orazio, Vesta Eleuteri, Jelle van der Werff, Tommaso Tufarelli, Marco Gamba, Teresa Raimondi, and Andrea Ravignani. 2025. "Rhythmic Analysis in Animal Communication, Speech, and Music: The Normalized Pairwise Variability Index Is a Summary Statistic of Rhythm Ratios" Vibration 8, no. 2: 12. https://doi.org/10.3390/vibration8020012

APA Style

Jadoul, Y., D’Orazio, F., Eleuteri, V., van der Werff, J., Tufarelli, T., Gamba, M., Raimondi, T., & Ravignani, A. (2025). Rhythmic Analysis in Animal Communication, Speech, and Music: The Normalized Pairwise Variability Index Is a Summary Statistic of Rhythm Ratios. Vibration, 8(2), 12. https://doi.org/10.3390/vibration8020012

Article Metrics

Back to TopTop