Acoustic ranging in poison frogs—it is not about signal amplitude alone

Ringler, Max; Szipl, Georgine; Hödl, Walter; Khil, Leander; Kofler, Barbara; Lonauer, Michael; Provin, Christina; Ringler, Eva

doi:10.1007/s00265-017-2340-2

Acoustic ranging in poison frogs—it is not about signal amplitude alone

Original Article
Open access
Published: 12 July 2017

Volume 71, article number 114, (2017)
Cite this article

Download PDF

You have full access to this open access article

Behavioral Ecology and Sociobiology Aims and scope Submit manuscript

Acoustic ranging in poison frogs—it is not about signal amplitude alone

Download PDF

Max Ringler ORCID: orcid.org/0000-0002-4530-4919^1,2,
Georgine Szipl^3,4,
Walter Hödl²,
Leander Khil²,
Barbara Kofler²,
Michael Lonauer²,
Christina Provin² &
…
Eva Ringler^1,2,5

4111 Accesses
20 Citations
24 Altmetric
2 Mentions
Explore all metrics

Abstract

Acoustic ranging allows identifying the distance of a sound source and mediates inter-individual spacing and aggression in territorial species. Birds and mammals are known to use more complex cues than only sound pressure level (SPL), which can be influenced by the signaller and signal transmission in non-predictable ways and thus is not reliable by itself. For frogs, only SPL is currently known to mediate inter-individual distances, but we hypothesise that the strong territoriality of Dendrobatids could make the use of complex cues for ranging highly beneficial for this family. Therefore, we tested the ranging abilities of territorial males of Allobates femoralis (Dendrobatidae, Aromobatinae) in playback trials, using amplitude-normalized signals that were naturally degraded over distance, and synthetic signals that were masked with different levels of noise. Frogs responded significantly less to signals recorded from larger distances, regardless of SPL and signal-to-noise ratio (SNR), but showed no differential response to natural minimum and maximum SNRs across the typical communication range in wild populations. This indicates that frogs used signal amplitude and SNR only as ancillary cues when assessing the distance of sound sources and relied instead mainly on more complex cues, such as spectral degradation or reverberation. We suggest that this ability mediates territorial spacing and mate choice in A. femoralis. Good ranging abilities might also play a role in the remarkable orientation performance of this species, probably by enabling the establishment of a mental acoustic map of the habitat.

Significance statement

Acoustic ranging allows the distance of vocalizing competitors and mates to be identified. While birds and mammals are known to use complex cues such as temporal degradation, frequency-dependent attenuation and reverberation for ranging, previous research indicated that frogs rely only on signal amplitude (sound pressure level) to assess the distance of other callers. The present study shows for the first time that also poison frogs can make use of more complex cues, an ability which is likely to be highly beneficial in their territorial social organization and probably can also be used for orientation.

Calling amplitude flexibility and acoustic spacing in the territorial frog Allobates femoralis

Article Open access 05 June 2020

Camilo Rodríguez, Adolfo Amézquita, … Walter Hödl

Differential effects of sound level and temporal structure of calls on phonotaxis by female gray treefrogs, Hyla versicolor

Article 29 March 2019

Kevin W. Christie, Johannes Schul & Albert S. Feng

Auditory and distance cues interact to modulate female gray treefrog preferences for male advertisement calls

Article 17 June 2021

Sunny K. Boyd & Noah M. Gordon

Introduction

The ability to localize the direction and distance of vocalizing conspecifics is generally advantageous for animals that use sound to communicate as it allows early decisions to be made before direct contact occurs and reduces the risk of unnecessary aggressive responses (Erulkar 1972; McGregor 1994; Bradbury and Vehrencamp 2011; Hardy and Briffa 2013; Bee et al. 2016). Together with individual identification and eavesdropping, acoustic distance assessment plays a crucial role in territorial social systems of vocal species (McGregor 1993) and commonly mediates inter-individual spacing by informing territory holders about the proximity and thus the threat potential of nearby callers (Brown and Orians 1970; Robertson 1984; Naguib et al. 2008, 2011). Thus, acoustic territory advertisement and ranging allow animals to avoid more costly physical contests and fights over territories (Whitney and Krebs 1975; Richards 1981; Morton 1986; Bee et al. 2016).

Physically, a sound’s most straightforward distance cue is its amplitude, since it depends directly on the distance from the sound source. Acoustic signal amplitude corresponds to sound pressure, which is the local pressure deviation from the ambient atmospheric pressure caused by a sound event. The sound pressure level (SPL) is usually given in decibel (dB) in a logarithmic relation to 20 μPa (for airborne sound), the threshold of human hearing. Under free spherical, atmospheric spreading, sound pressure follows the inverse distance law by 1/r with distance r from the sound source, resulting in an SPL drop of −6 dB per doubling of the distance r (Rossing 2007; Bradbury and Vehrencamp 2011). Thus, if a receiver knows the original SPL of a sound at its source, or when it moves back and forth in its far field, a receiver should be able to estimate signaller distance from the perceived SPL (Naguib 1997b; Nelson 2000). However, SPL alone is not a reliable cue for distance assessment, as a caller could not only be closer to or further away from the receiver, but could also actively vary call amplitude (Richards 1981; Morton 1982). And once emitted, the signal could undergo unpredictable excess attenuation during transmission, caused by vegetation or ground structures, or distortion by wind and temperature gradients (Erulkar 1972; Morton 1975, 1986; Naguib 1997b; Ellinger and Hödl 2003; Kreutz-Erdtmann and Lima Pimentel 2013).

As an adaptation to this restriction, at least birds and mammals (for humans, see Zahorik et al. (2005)) have evolved the ability to also use several more complex cues that do not follow a simple physical law to assess the distance of a sound source. The ranging hypothesis identifies overall temporal degradation, frequency-specific degradation, frequency-specific attenuation and/or reverberation of a signal as cues for distance assessment and has been mainly tested in birds (Richards and Wiley 1980; Morton 1982, 1986; Dabelsteen et al. 1993; McGregor 1993, 1994; Holland et al. 2001; Naguib and Wiley 2001). These cues are more complex than SPL in that they require the concurrent perception and assessment of temporal, spectral and intensity characteristics of a signal. Furthermore, they are highly habitat dependent and not every cue might be present in every vocalization context (e.g. tonal and atonal call components (Sun et al. 2000; Bernal et al. 2009; Bonachea and Ryan 2011) or tonal advertisement and atonal courtship calls (Weygoldt 1980; Simões et al. 2010; Kollarits et al. 2017)); thus, their correct interpretation for ranging depends on experience with the signal (Morton 1982, 1998); but see Naguib (1997a) for an example of inexperienced ranging and Naguib (1998) and Wiley (1998) for a discussion of the role of experience in ranging).

In frogs, only signal amplitude/SPL has been identified so far as a cue for acoustic distance assessment and spacing. This is the case for several species: Blanchard’s cricket frogs (Acris crepitans blanchardi, Wagner Jr. 1989), common tink frogs (Diasporus (Eleutherodactylus) diastema, Wilczynski and Brenowitz 1988), barking treefrogs (Hyla gratiosa, Murphy and Floyd 2005), Pacific treefrogs (Hyla regilla, (Whitney and Krebs 1975; Brenowitz 1989), gray treefrogs (Hyla versicolor, Fellers 1979), painted reedfrogs (Hyperolius mamoratus, Telford 1985), strawberry poison frogs (Oophaga (Dendrobates) pumilio, Bunnel 1973), spring peepers (Pseudacris (Hyla) crucifer, Gerhardt et al. 1989) and wrinkled toadlets (Uperoleia rugosa, Robertson 1984). In these species, SPL threshold values maintain inter-individual distances in aggregations and elicit aggressive responses, or mediate graded responses which are expressed proportionally to SPL (Velez et al. 2013). So far, only little evidence exists that frogs also use more complex cues for ranging, although the possibility has been discussed previously (Ryan and Sullivan 1989; Murphy 2008). However, owing to their poikilotherm physiology, frogs have a highly sedentary lifestyle (Wells 2007), and therefore, it would presumably be highly beneficial to use all available information to remotely assess competitors to optimize energy expenditure in contests (cf. Dyson et al. 2013; Bee et al. 2016).

The lack of evidence for ranging in anurans may partly be due to a lack of research, as unlike the multitude of studies on directional localization and source segregation in anurans (see Gerhardt and Huber 2002; Christensen-Dalsgaard 2005; Narins et al. 2006; Bee and Christensen-Dalsgaard 2016), so far very few studies have specifically addressed cues used in anuran distance assessment. In playback trials with barking treefrogs (Hyla gratiosa), Murphy (2008) did not find evidence for the proposed mechanisms of anuran distance assessment and suggested the use of more complex methods such as triangulation during movement. Using playbacks of normalized, naturally degraded calls of male gray treefrogs (Hyla versicolor), Schwartz et al. (2016) demonstrated that females in this species prefer undegraded calls from closer callers, indicating the perception of signal degradation. Venator et al. (2017) found evidence for the use of temporal degradation for distance assessment in addition to signal amplitude in male cricket frogs (Acris crepitans); however, they did not rule out concurrent inhibitory effects of signal degradation on call recognition in their study. For the austral forest frogs Eupsophus emiliopugini and E. calcaratus, Penna et al. (2017) found a strong dependence of vocal responses to stimulus SPL but no difference in the frogs’ calling activity in response to synthetic pulse amplitude modulation of stimuli that were mimicking naturally degraded calls.

Previous research showed that frogs can recognize and interpret temporally degraded signals (Kuczynski et al. 2010; see Göd et al. (2007) and Vélez et al. ( 2012) for Allobates femoralis) and that they are capable of long-term integration in their auditory system (Alder and Rose 1998). However, other neurological and cognitive constraints on signal processing (cf. Narins et al. 2006) might still limit ranging mechanisms to simpler cues in frogs. In contrast to most birdsong, the vocalizations of frogs are not learned but inherited and highly stereotypic and receive only little contextual or experience-based modification (Hauser 1996; Narins et al. 2006). This means that frogs come with an innate template of their own calls and probably do not need to gain experience with the calls of conspecifics.

Neotropical poison frogs (Dendrobatidae) are known for their territorial social organization, and males of many species advertise their territories through prolonged calling to repel competitors and attract mates (Pröhl 2005; Lötters et al. 2007). Therefore, good ranging skills, integrating more complex cues than SPL alone to make best use of all available information, would be especially beneficial for frogs of this family, rendering them a promising group to search for such abilities in anurans. We tested the ranging abilities of A. femoralis (Dendrobatidae, Aromobatinae) by examining the response of males in playback trials with amplitude-normalized, naturally distance-degraded, conspecific advertisement calls, while controlling for distance and perceived SPL.

In the absence of SPL-related cues, we expected males to still exhibit stronger responses to test signals recorded from closer distances, potentially within an individual’s territory. In turn, responses should be weaker to signals that were recorded across longer distances, likely from outside an individual’s territory (McGregor 1994). The phonotactic responses in A. femoralis are hierarchical, consisting of initial head body orientation (HBO), followed by movement towards the potential intruder, and eventually resulting in a full approach. Given proper signal recognition, we expected a wider initial response (HBO) across signaller ranges, as frogs should also pay attention to lesser potential threats, for which subsequent evaluation may lead to no further aggressive response (movement, approach). In turn, under ranging the discrimination between near and far signals in the later response categories ‘movement’ and ‘approach’ should be more pronounced when threats are correctly assessed, as only higher threats require a full aggressive response. To assess the impact of noise and the signal-to-noise ratio (SNR) on the phonotactic responses of A. femoralis males, we conducted a follow-up experiment with test signals where we only manipulated the SNR of the test signal to the minimum and maximum SNR found in the experiment with naturally degraded signals. Similar phonotactic responses of the tested frogs to both conditions would be indicative that SNR and noise level in the range of the test signals in the first experiment did not play a role in distance assessment with the naturally degraded signals.

Materials and methods

Study species

Allobates femoralis is a Neotropical poison frog (Dendrobatidae) from a species complex with a pan-Amazonian distribution (Amézquita et al. 2009; Fouquet et al. 2012). Males are highly territorial throughout the prolonged breeding season (Roithmair 1992; Ringler et al. 2009), when they announce their territories and try to attract females with extensive calling from elevated perches on the forest floor (Narins et al. 2005). The advertisement call, emitted at 92 dB SPL (re 20 μPa) measured at a distance of 50 cm (Hödl 1987), consists of four notes, each sweeping upwards in the frequency range of 2900–3900 Hz ((Narins et al. 2003; Gasser et al. 2009); see insert in Fig. 2). Males call from sunrise until sunset, and calling activity peaks from 1500 to 1600 h and is lowest around 1200 h (Hödl 1983; Kaefer et al. 2012). Possessing a territory is a prerequisite for male reproductive success (Ursprung et al. 2011), and as a consequence, territories are vigorously defended against calling intruders which are approached and attacked immediately (Narins et al. 2003; Ringler et al. 2011). Territory intrusion can be simulated by broadcasting conspecific calls with a loudspeaker, which is immediately approached phonotactically by males in playback trials (Hödl 1983; Ursprung et al. 2009; Ringler et al. 2011). However, to elicit a full physical attack, the additional optical stimulus of the pulsating vocal sac is required, as was demonstrated using robotic model frogs (Narins et al. 2003, 2005). This phonotactic response is only exhibited above a certain SPL threshold, which was found to be 56–68 dB for a head-body-orientation (HBO) and subsequent antiphonal calling towards the source and >68 dB for a phonotactic approach in a Peruvian population of A. femoralis (Hödl 1987).

Study period and site

The playback trials with normalized, naturally degraded signals were conducted between 30 January and 17 February 2015, and the trials testing the effect of SNR were performed between 01 and 08 March 2017. Hence, we conducted this study at the onset of the rainy season, when males were highly territorial and their calling activity was high due to the concurrent breeding season. We performed the playback experiments in an A. femoralis population located in an 8.3 ha lowland rainforest plot (Ringler et al. 2015b; Ringler et al. 2016) near the field camp ‘Saut Pararé’ (4°02′ N, 52°41′ W, WGS84) of the CNRS Nouragues Ecological Research Station (http://www.nouragues.cnrs.fr; Bongers et al. 2001).

Experiment 1—playbacks with degraded signals

Test signals

The test signals for the conflicting-properties playbacks to assess the ranging abilities of frogs were obtained during a study on understory sound transmission characteristics (MR et al. unpublished data). We used the original synthetic call by Narins et al. (2003; termed ‘standard call’ by Ursprung et al. (2009)) as the base signal, which was composed from natural recordings to feature the average spectral and temporal call parameters of an A. femoralis population near ‘Camp Arataï’ (Gasser et al. 2009), ~35 km downstream from the Pararé population. As such, this call represented a neutral intruding individual for frogs in the Pararé population, unknown to all tested individuals and likely to elicit an equal and reliable aggressive response across all tested individuals. The original recordings for this synthetic call were made with a cassette tape recorder (Professional Walkman WMD6C, Sony, Tokyo, Japan) on cassette tapes (D60 (Type I), TDK, Tokyo, Japan), using a directional condenser microphone (C 568 EB, AKG, Vienna, Austria) placed at ~100 cm in front of the focal male. The recordings were then digitized at a bit-depth of 16 bit and a sampling frequency of 44.1 kHz, using a laptop computer (PowerBook G3, Apple, Cupertino, CA, USA) and the sound processing software Canary 1.2.4 (Charif et al. 1995). Single call notes of the digitized recordings were then cut and re-aligned, using the acoustic software SoundEdit 2.0.7 (Macromedia; now Adobe, San José, CA, USA), to exhibit the average call properties of 15 males from the Arataï population as follows: number of notes per call, 4; note duration and frequency sweep range of note 1: 32.4 ms, 3011–3450 Hz; note 2: 66.1 ms, 2985–3846 Hz; note 3: 50.8 ms, 3004–3767 Hz; note 4: 64.0 ms, 3026–3932 Hz; inter-note intervals: notes 1 and 2: 50.2 ms; notes 2 and 3: 96.2 ms; notes 3 and 4: 43.9 ms; number of calls per bout: 10; inter-call interval (ICI): 458 ms; and inter-bout interval: 8.2 s.

We broadcast and re-recorded this call in all four cardinal compass directions at 14 evenly spaced locations in the study plot (Fig. 1a) from 11 February to 11 March 2009 in the rainy season. At a few locations, one direction could not be recorded due to obstacles (large trees, river) along the recording transects, resulting in 52 unique recording sessions, yielding 312 unique signals. Transmission recordings were conducted between 1000 and 1400 h, the time of the lowest calling activity of A. femoralis males (Hödl 1983; Kaefer et al. 2012), to avoid interference with the recordings. All our recordings of naturally degraded A. femoralis calls were thus free from natural conspecific masking, i.e. contained no natural calls that were either audible or visible in the spectra. Additionally, the general background noise, emitted mainly by crickets and katydids, was lowest during this time period (MR pers. obs.; cf. Ellinger and Hödl (2003) for a rainforerst in Venezuela and Lang et al. (2005) for a rainforest in Panama).

The ‘standard call’, a 16-bit, 44.1-kHz WAV-file that contained a calling bout with 10 calls, was played using a portable audio player (G-Flash 512, Maxfield, Düsseldorf, Germany; company liquidated) and a portable battery-powered car-audio amplifier (Toxic TXC-500, RTO, Hamburg, Germany; maximum RMS power: 2 × 75 W, frequency range: 10–40,000 Hz) driving a portable full-range outdoor speaker (Symbol Pro 130, Magnat, Pulheim, Germany; frequency range: 35–30,000 Hz; high-frequency tweeter disabled for single membrane emission) with the speaker placed directly on the soil (centre of the membrane 9.25 cm above the soil, ~5 cm above the leaf litter). The sound reproduction system was calibrated before every broadcast, using a continuous, pure 400 Hz reference signal to produce an SPL of 95 dB (re 20 μPa; C, fast) at a distance of 0.75 m, resulting in an SPL of 97.8 dB (re 20 μPa; C, fast) of the A. femoralis test signal at this distance. The broadcast A. femoralis call was re-recorded simultaneously at six distances from the speaker. For recording, we used a portable outdoor computer (Toughbook CF-19, Panasonic, Osaka, Japan) and a USB-powered 6-channel audio A/D-interface (am6|2, Emagic, now Apple, Cupertino, CA, USA) with the audio recording software Audition 3.0 (Adobe, San José, CA, USA) at a bit-depth of 24 bit and a sampling frequency of 44.1 kHz. We used a directional microphone (ME66, Sennheiser, Wedemark-Wennebostel, Germany) at 0.75 m to record an immediate, unaltered far-field signal from the loudspeaker. At 1.5, 3, 6, 12 and 24 m, we used omnidirectional microphones (ME62, Sennheiser, Wedemark-Wennebostel, Germany) to capture the natural degradation as well as the reverberation signature of the signal when broadcast across the typical inter-individual communication range of A. femoralis. The microphones were mounted on small table-top tripods, ~10 cm above the soil, ~5 cm above the leaf litter and aligned horizontally, parallel to the forest floor, perpendicular to and pointing directly towards the membrane of the speaker. The position and vertical alignment of the speaker and microphones thus resembled the natural calling and listening positions of A. femoralis males, which call and listen on the leaf litter and from perches that are slightly elevated (10–20 cm; Hödl 1983).

We measured the signal-to-noise ratio (SNR) of the recordings using the ‘Inband Power (dB)’ (IP(dB)) measurement of the audio analysis software Raven Pro 1.5 (Bioacoustics Research Program 2011), which measures the sum of the square magnitudes of the Fourier coefficients in a selection, divided by the product of the DFT size and the number of spectrogram frames in the selection (Pitzrick 2016a), following the suggestions of the program’s developer (Pitzrick 2016b). Inband power was measured separately for each of the four notes of the first call in each recording and across the frequency range of the first formant of each note. The inband power of the background noise was measured from the 2-s period immediately before the first note, with separate measurements across the respective frequency range of each of the four following notes. All logarithmic measurements of inband power were then converted into linear (digital sampling) units (IP(u)) by using the formula IP(u) = 10^(IP(dB)/10). We then calculated a linear SNR(u) for each of the four notes using the formula SNR(u) = (IP(u)_signal − IP(u)_noise)/IP(u)_noise. We then calculated an SNR(u) for each recording by averaging the linear SNR(u) of each of the four notes and obtained an average SNR(u) for each distance by calculating the mean of all SNR(u)s for each distance. To obtain more commonly used logarithmic SNR measurements, we then reconverted SNR(u) to SNR(dB) using the formula SNR(dB) = 10 × log₁₀ SNR(u).

To create the test signals for the playback trials with frogs, blocks of 8.8 s, containing the 10 calls and the reverberation after the last call, were cut from the recordings. The clips were high-pass filtered at 1.3 kHz and low-pass filtered at 12.0 kHz, to eliminate any noise below and above the A. femoralis call and its harmonics, and then amplitude-normalized to 0 dBFS (100%), using the (peak) normalization function of Adobe Audition (see insert in Fig. 2 for spectrograms). We looped the 8.2 s 10-call bouts and their 8.2 s lead-out (final reverberation plus 7.6 s silence) 10 times and added a silent lead-in of 3 s. Thus, a full test signal lasted for 2:47 min and presented 10 bouts of 10 calls, each consisting of 4 notes. We randomly ordered and stored all 312 unique test signals on a portable audio player for later use in the playback trials.

Playback trials

We performed playback trials with a portable audio player (Odys Smart 2 GB, Axdia International, Willich, Germany) driving a portable, amplified outdoor speaker with internal batteries (EcoxBT, Grace Digital, San Diego, CA, USA; maximum RMS power: 2 × 3 W; frequency range: 135–17,000 Hz, S/N-ratio: 88 dB ± 3 dB). We played the test signals from 24-bit, 44.1-kHz WAV-files, which were stored in randomized order on the audio player. We calibrated the playback setup twice per day on the forest floor to produce the undegraded ‘standard call’ with an SPL of 69 dB (re 20 μPa; A; fast) at a distance of 2 m, just above the threshold for phonotactic approaches in A. femoralis as reported by Hödl (1987).

For playbacks, we opportunistically approached calling A. femoralis males in the study area (‘mainland plot’ sensu Ringler et al. (2016)) and recorded the males’ location on a digital map on a portable GIS device (MobileMapper10, SpectraPrecision, Westminster, CO, USA). We carefully approached calling males and placed the speaker on the ground at ~2 m, pointing towards the focal male. The exact playback distance was measured with a rigid, foldable metre immediately after the trial. Playback directions were selected opportunistically to allow for unobstructed playback paths between focal frogs and the speaker and to give an unobstructed view to playback observers. After a resting period of at least 1 min after the speaker had been set up, we started the playback at times when the focal male, as well as any other immediate neighbouring males, were not calling. For the playback, we used the test signals consecutively in the random order as they had been stored in on the audio player. As A. femoralis males only show phonotaxis while a signal is present (Ursprung et al. 2009), it was not possible for us to follow the recommendations of Naguib and Wiley (2001) to play only short single signals to elicit a response in the focal individuals. Thus, we had to present the test signals throughout the trials.

During the playbacks, we commented behavioural observations using a small voice recorder (ICD-PX333, Sony, Tokyo, Japan). We recorded the following hierarchical phonotactic responses: ‘head-body-orientation’ (HBO), as soon as the frog oriented towards the speaker, followed by ‘movement’, as soon as the frog made the first jump towards the speaker, and eventually followed by a successful ‘approach’ within 20 cm of the speaker. We later transcribed and coded the recorded comments as binary responses for performed HBO, movement (following HBO), and approach (following HBO and movement). A trial was rated valid as soon as a frog showed at least HBO, and scored with all responses taking place until the end of the 2:47-min test signal. We stopped trials when the frog approached the speaker within 20 cm. When a frog showed no phonotactic response at all to a test signal, we immediately conducted a second trial without changing the speaker location by broadcasting the next random test signal and following the same trial protocol. If the frog showed a phonotactic response (at least HBO) to the second test signal, we scored a valid negative trial (no response) for the first test signal and a valid positive trial with the respective responses for the second test signal. If a frog also showed no response to the second test signal, we immediately conducted a third trial without changing the speaker location by broadcasting the next random test signal and following the same trial protocol. If the frog showed a phonotactic response (at least HBO) to the third test signal, we scored valid negative trials (no response) for the first and second test signal, and a valid positive trial with the respective responses for the third test signal. If a frog showed no phonotactic response to all three different random test signals, we immediately conducted a control playback, using the original, undegraded ‘standard call’, amplified by 6 dB, to verify the frogs’ general reactivity and territorial status. For frogs that showed a phonotactic response (at least HBO) to the control signal, we scored three valid negative trials for the three previous test signals, respectively. Frogs that also did not respond to the control signal were classified as non-territorial or unmotivated, and all three playback trials were discarded and not scored. When we observed another frog that was interacting physically or acoustically with the focal male, we stopped and discarded the trial and caught both individuals for registration. Frogs that had participated in a playback trial were not approached and tested again on the same day, but could be tested again in further trials on subsequent days.

After the trials, we caught the focal frog and took a digital picture (TG-620, Olympus, Tokyo, Japan) of the ventral pattern for identification, and a dorsal picture on mm-paper for the subsequent measurement of the snout-urostyle-length (SUL) in imageJ (Rasband 1997–2017). Then, we measured the playback distance between focal male and speaker, and the SPL (A; fast) of every signal used in this trial at the initial location of the focal male, using a sound pressure meter (Voltcraft SL-100, Conrad, Hirschau, Germany) to correct for the playback distance and the actually received SPL of the signal in the GLMM analysis. We also measured the ambient temperature and relative humidity at the location of the focal frog, using a thermo-hygrometer (GFTH 95, GHM Messtechnik, Regenstauf, Germany).

Sample size

In 175 playback sessions, we tested 117 different A. femoralis males. Because of equipment failures (interrupted or low playback sound due to lose contacts), we had to discard 6 sessions, leaving 169 sessions (Fig. 1b) in which 214 different test signals were used with 114 individuals. Of these males, 78 participated in one playback session, 21 twice, 11 three times and 4 four times. During these playback sessions, 68 males received 1 test signal, 17 received 2 different test signals, 17 received 3 different test signals, 6 received 4 different test signals, 2 received 5 different test signals, 2 received 6 different test signals, 1 received 7 different test signals and 1 received 8 different test signals. We aimed at performing at least 30 trials with each of the conditions (recording distance) of the test signals. Overall, we used 39 test signals that were recorded from 0.75 m, 41 test signals that were recorded from 1.5 m, 34 test signals that were recorded from 3 m, 37 test signals that were recorded from 6 m and 32 test signals that were recorded from 24 m. The exact number of test signals, their conditions and the breakdown of which male received which and how many test signals and during how many playback sessions is provided in the supplementary raw data.

Our unbalanced repeated measures design resulted from logistic constraints when conducting this study. Locating the frogs without disturbing them, setting up the playback equipment and catching the frogs and measuring environmental and trial parameters afterwards were considerably more time consuming than conducting the actual playbacks. Therefore, we aimed at allocating time available in the field most efficiently by using additional test signals with frogs that showed no response to the initial signals. We later accounted for this unbalanced design and repeated measurements on individuals with a varying number of test signals by integrating a corresponding random factor in our analysis.

Blinded methods

Playback trials were performed blindly to the extent that territorial males were approached opportunistically and their identity was only assessed after the trials; however, territoriality and site fidelity of the males provided information on male identity in the case of repeatedly tested individuals. The test signals were used in the randomized order they had been stored in on the music player previously, with the files specified only with a running number. However, to a certain degree, we could acoustically discern the test condition presented to the frog. Transcription and behavioural coding were done blindly, and acoustic characteristics of the test signals that would have allowed the test condition to be identified were barely audible in the voice-protocol recordings.

Statistics

To analyse the frogs’ responses to naturally degraded signals, we used generalized linear mixed models with the binary response variables ‘HBO’, ‘movement’ and ‘approach’. We included the signal characteristics ‘recording distance of the playback signal’ (distance_rec), ‘location recorded’ (location) and ‘direction recorded’ (direction) as fixed factors for our analysis. We included ‘day of trial’ (day), ‘time of trial’ (minute), ‘SPL’, ‘playback distance of the trial’ (distance_trial), ‘temperature during trial’ (temperature) and ‘humidity during trial’ (humidity) as characteristics of the playback trial. Finally, we used ‘snout-urostyle-length’ (SUL) as a physical characteristic of the tested frogs.

Variance inflation factors were calculated beforehand for all fixed factors in the model (distance_rec, location, direction, day, minute, SPL, distance _trial, temperature, humidity and SUL) to identify collinear parameters (Zuur et al. 2009), and ‘location’ and ‘day’ were excluded due to multicollinearity with other factors. A test of proportions was conducted to investigate whether the proportion of responses differed between recording locations and testing days. The proportions of responses did not vary with ‘location’ (HBO: χ ² = 10.66, df = 13, p = 0.6393; movement: χ ² = 10.514, df = 13, p = 0.6515; approach: χ ² = 11.555, df = 13, p = 0.5644) or ‘day’ (HBO: χ ² = 11.282, df = 11, p = 0.4199; movement: χ ² = 8.3831, df = 11, p = 0.6786; approach: χ ² = 6.5373, df = 11, p = 0.8352).

With the remaining factors, we fitted generalized linear mixed models for each response variable (HBO, movement, approach) using a binomial distribution with a logit link function within the lme4 package (Bates et al. 2016) in R (R Core Team 2017). A nested term including ‘signals received’ by each individual was used as random factor to account for repeated trials per individual. Starting with the full model that contained all fixed factors, a stepwise reduction was applied using likelihood ratio tests to determine whether the deletion of a factor from the model would significantly increase model fit. The full models (Table 1) did not vary significantly from the reduced models for either of the response variables (likelihood ratio tests: HBO: χ ² = 4.452, df = 7, p = 0.7265; movement: χ ² = 4.131, df = 4, p = 0.3886; approach: χ ² = 2.113, df = 2, p = 0.3477), indicating that the full model had the best fit and explained variation in response variables best. Therefore, we present and discuss the results of the full model. To assess whether the probability of fully approaching the speaker up to 20 cm varied between the recording distances of the signals, we conducted post hoc Chi-square tests (Table 2). p values presented in Table 2 are two-tailed and alpha was set to 0.008 to account for multiple testing.

Table 1 Coefficients of the full generalized linear mixed-effect models with estimates

Full size table

Table 2 χ²-tests of the probabilities to approach the speaker compared to chance, in response to distance-degraded signals

Full size table

To investigate the influence of SNR on the frogs’ responses in the playback trials, we normalized the measured, linear SNR(u) values within each distance to a scale from 0 to 1 (SNR_norm). As the factor SNR_norm encoded the recording distance of the playback signals, this factor could not be tested in the same model that included recording distance. Thus, we calculated a separate model with a binomial distribution and a logit link function using a nested term that included the ‘signals received’ by each individual as a random factor and the normalized SNR(u) as a fixed factor.