Introduction

Genetic data have increasingly been used in recent years for population monitoring and conservation (Schwartz et al. 2007a). Genetic information has helped to infer key ecological information in mammals, such as population density (Sollmann et al. 2013), kinship (Schmidt et al. 2016), dispersal behaviour (Mateo-Sánchez et al. 2015) and survival rates (De Barba et al. 2010; Woodruff et al. 2016). In general, genetic data are gathered using samples collected from live-trapping (e.g. tissue, blood), where individuals are physically captured and tagged for capture-mark-recapture analyses. For many years, capture-mark-recapture has provided accurate estimates of population density (Grenier et al. 2009), but genetic data can reveal cryptic patterns that are impossible to access through capture data alone, such as inbreeding rates and genetic diversity (Melosik et al. 2017; Taylor et al. 2017), or even the population structure and gene flow (Zimmerman et al. 2015; Walsh et al. 2016). The proper evaluation of these parameters is essential not only for assessing species ecology, but also for understanding how species respond to environmental change, anthropogenic habitat loss and for developing suitable conservation strategies (Swift and Hannon 2010; Lindenmayer and Fischer 2013).

While genetic analyses tied with live-trapping may improve population monitoring, the overall accuracy of the results depends on the number of samples, which can be low, particularly when focusing on traditional capture methods of rare and/or elusive mammals (Thompson 2013). Moreover, live-trapping is a potential source of stress and/or trauma for captured animals, which is a major concern when dealing with sensitive and threatened species (Powell and Proulx 2003; Pauli et al. 2010). An alternative to live-trapping for estimating population parameters is genetic non-invasive sampling (gNIS), which relies on the use of DNA retrieved from presence signs in the field (e.g. faeces, hairs, urine), without manipulating or even seeing the animals (see Waits and Paetkau 2005 and Beja-Pereira et al. 2009 for reviews).

gNIS has become a common method for population monitoring, especially in large mammals (e.g. carnivores and ungulates), and is often used to estimate population size (Woodruff et al. 2016; Gulsby et al. 2016), determine connectivity and gene flow (Walker et al. 2007; Stansbury et al. 2016) or even assess reproductive behaviour (Stanton et al. 2015; Schmidt et al. 2016). A notable advantage of the use of gNIS in mammals is the considerable increase in the number of available samples that can be processed when live-captures are low (Sollmann et al. 2013; Gillet et al. 2015; Mestre et al. 2015) or logistically difficult (Henry and Russello 2011; Silva et al. 2015). Furthermore, the use of gNIS in low density species may provide the same accuracy as that obtained from traditional live-trapping (Marucco et al. 2011; Mumma et al. 2015) and may even replace it altogether (Trinca et al. 2013).

Despite the great potential of gNIS, there is a general perception that this methodology requires complex logistics and data analysis (Lampa et al. 2013), and that it is more expensive than traditional live-trapping due to putative high laboratory costs. Nevertheless, few studies have explicitly compared the costs and practical effectiveness between these two approaches (Kilpatrick et al. 2013; Cheng et al. 2017). This comparison would provide important information for population monitoring, particularly in the case of species for which live-trapping is still the most common sampling method, such as small mammals (Watkins et al. 2010). While some studies have applied gNIS in small mammals, namely by using faecal samples and/or owl pellets to identify species (Agata et al. 2011; Galan et al. 2012; Barbosa et al. 2013), assess taxonomy (Kuch et al. 2002) and determine phylogeographic patterns (Jaarola and Searle 2004; Miller et al. 2006), it has only been applied once for population monitoring (Gillet et al. 2016).

In the present study, we evaluate the applicability and cost-effectiveness of gNIS for monitoring small mammals. We use as a model species a rodent with low capture rates, the Cabrera vole (Microtus cabrerae), which is an endangered species endemic to the Iberian Peninsula (Fig. 1a). This species inhabits fast disappearing damp grasslands (Pita et al. 2014), which have become highly fragmented during the past decades due to agriculture intensification and livestock grazing (Garrido-García and Soriguer-Escofet 2012; Laplana and Sevilla 2013). As a consequence, the species often shows a metapopulation-like spatial structure and extinction-(re)colonisation dynamics of available habitat patches (Pita et al. 2007). To understand the factors affecting their metapopulation dynamics, it is important to have information on population densities, fecundity, survival and dispersal rates, and the interaction among these, but this has largely been hampered by the difficulty of applying conventional capture-mark-recapture studies due to the low capture rates. Hence, in this work we assess the effectiveness of gNIS compared with live-trapping for monitoring a metapopulation of M. cabrerae in the south-western Iberian Peninsula, namely by gathering information on their genetic diversity, reproductive behaviour and dispersal. To achieve these goals, we use microsatellite data obtained from non-invasive (faeces) and invasive (tissue) samples, and compare inferences from both methods in terms of the number of identified individuals, genetic diversity parameters, kinship and dispersal events. In a companion paper, the same dataset was used to evaluate the use of gNIS for estimating population densities of the Cabrera vole (Sabino-Marques et al. 2018). We discuss the cost-effectiveness of live-trapping versus gNIS sampling methods, and the potential advantages and caveats of gNIS to monitor small mammal populations.

Fig. 1
figure 1

Maps of the study area, showing the potential dispersal barriers (roads and rivers) to Microtus cabrerae dispersal, and the habitat patches that are suitable for the presence of M. cabrerae. Patches where presence signs were found are marked as ‘occupied’, and were further sampled through live-trapping and genetic non-invasive sampling (gNIS). a Global distribution of M. cabrerae (grey shading). b Location of the 78 ha circular area (black circle) within Portugal (outlined), inside the distribution of M. cabrerae (grey shading). c Locations of the collected and extracted tissue samples (△, n = 31). d Locations of the collected and extracted faecal samples (■, n = 323)

Material and methods

Sampling and DNA extraction

Sampling was performed in a 78-ha circular study area near Vila Nova de Milfontes, south-western Portugal (centroid: 37.754720, −8.751256; Fig. 1b). This study area (Fig. 1c) is located within an agropastoral landscape, and included 37 suitable habitat patches (small damp areas with tall grass), initially surveyed for presence signs of M. cabrerae (latrines, grass clippings and grass tunnels; Pita et al. 2006). Samples from live-trapping were collected between November and December 2011 in 23 occupied patches, using Sherman traps in each patch for 9–10 nights (details in Sabino-Marques et al. 2018). A total of 371 traps were placed at likely capture sites and were monitored every 12 h. All individuals captured for the first time were marked with an individual passive integrated transponder (PIT) tag, sexed, weighed and a small ear biopsy was collected (tissue sample). Individuals weighing less than 28 g were considered juveniles (Fernández-Salvador et al. 2001). All individuals were released at the trapping site immediately after manipulation. Recaptured individuals identified through the PIT tag were released without further manipulation. A total of 31 tissue samples were collected in nine patches (Fig. 1c), and were stored in 96% ethanol. These samples were kept at −20 °C until extraction, using the EasySpin® Genomic DNA Minipreps Tissue Kit (Citomed, Lisbon, Portugal) following the manufacturer’s instructions.

Non-invasive samples (faeces) were collected from 21 out of 23 occupied habitat patches, the two missing due to logistic constraints (details in Sabino-Marques et al. 2018). A total of 487 faecal samples were collected in four consecutive days for each patch, to allow for capture-recapture of the genotyped individuals (total of 5 sampling days). The time spent during collection was proportional to the patch area, ranging between 20 and 120 min. The faecal samples were collected as isolated faeces or from small latrines (< 20 faeces) to avoid cross-contamination from conspecifics, given that voles often have communal latrines (Woodroffe et al. 1990). Up to 20 faeces per sample were collected using a sterilised sampling kit (collection tube with 96% ethanol, tweezers and latex gloves), and kept at −20 °C until extraction. Only faeces recognised in the field to be ‘fresh’ were collected, to maximise DNA extraction and amplification success. Of the 487 faecal samples collected, 323 (66.3%) were selected for DNA extraction based on number of faeces per sample (those with more faeces being preferred) and to represent the most comprehensive spatial coverage of each patch (Fig. 1d). At least 10 faecal samples were extracted per patch, except in the case of patches with less than 10 samples, from which all samples were extracted. DNA was extracted using the QIAamp® DNA Micro kit (Qiagen), using three to six faeces per sample. We followed the ‘forensic case work samples’ protocol, without vortexing during the digestion step, to minimise both the destruction of the faeces and the release of plant inhibitors (Wehausen et al. 2004).

Genotyping and sexing

In the 31 tissue samples from the captured individuals, we amplified a total of 20 microsatellite loci, from which 9 were cross-amplified loci from other species of the genus Microtus, and 11 were species-specific loci obtained through next-generation pyrosequencing with the 454 FLX Titanium sequencer (Roche Applied Science, Meylan, France; see Online Resources 1 and 2 for details on the microsatellite optimisation and amplification). We repeated the amplification for all loci with missing data and small allele peaks. The 20 loci were also re-amplified in at least 15% of the tissue samples to confirm the genotyping results and to estimate error rates.

From the set of 20 microsatellite loci used for the tissue samples, 11 were selected for amplification in the faecal samples, on the basis of small PCR fragment size (up to 250 base pairs), high amplification success rates for the tissue samples and also high estimates of genetic diversity (Table 1, Online Resources 1 and 2). For these 11 loci, we estimated the values of probability of identity among unrelated (PI = 3.3 × 10−12) and related individuals (PIsibs = 4.2 × 10−5), and the probability of exclusion (PE = 0.999), using the genotypes from the 31 tissue samples and the software GenAlEx 6.5 (Peakall and Smouse 2012). These values are within the recommended range for individual identification by gNIS (Mills et al. 2000; Waits et al. 2001; McKelvey and Schwartz 2004). The 11 loci were combined in amplification mixes with a smaller number of markers (two to three loci), as recommended for gNIS (Beja-Pereira et al. 2009; see Online Resources 1 and 2 for details).

Table 1 Summary characteristics of the 20 microsatellite loci used for genotyping Microtus cabrerae samples. The values are shown according to each of the three datasets: TS—tissue samples; FC—faecal samples; TS + FC—tissue and faecal samples

For genotyping the faecal samples, we followed a stepwise approach, where we used the microsatellite primer mix with three loci (multiplex ‘Mix 1’) to screen the quality of the samples, before amplifying the remaining eight loci (multiplex ‘Mix 2’ and ‘Mix 3’, Fig. 2, Online Resources 1, 2 and 3). If the samples failed to amplify one of the three loci, or two loci displayed genotyping errors, the samples were excluded from further analyses. Amplification for the 11 loci was replicated a minimum of four times, and a maximum of eight times in instances when the initial four replicates displayed genotyping errors (Fig. 2, Online Resource 3). The PCR protocol for all microsatellites consisted of a touch-down annealing step ranging from 55 to 52 °C for 60 s and a final extension step at 60 °C for 30 min.

Fig. 2
figure 2

Workflow for genotyping non-invasive samples. Allele scoring and analysis were conducted independently by two users, to minimise bias (for detailed guidelines see Online Resource 3). Successful extractions and amplifications are shown as continuous lines (-), repetitions of the PCR amplification are shown as dashed lines (- -) and failed amplifications are shown as dotted lines (•••)

To confirm species identification and the sample quality, the complete cytochrome b (cyt-b) gene was amplified and sequenced for the 31 tissue samples using the primers L14727-SP and H15915-SP (Jaarola and Searle 2002), and for the faecal samples successfully amplified with ‘Mix 1’ a smaller fragment of cyt-b developed for all Iberian rodents was used (see Barbosa et al. 2013 for details). The small fragment of cyt-b was also amplified for at least 20% of all the discarded faecal samples (which failed to amplify with Mix 1, Fig. 2), to ensure that estimates of overall genotyping success rate for gNIS were not biased by attributing to M. cabrerae faeces belonging to non-target species (Fig. 2 and Online Resource 3).

The faecal samples that passed quality control were sexed by amplifying small fragments of two sex-linked introns, DBX5-S and DBY7-S (Online Resources 1 and 2). To account for possible genotyping errors, the PCR reaction for sexing was replicated three times and scored according to the defined criteria (Fig. 2 and Online Resource 3). The 31 tissue samples were also amplified for the two sex-linked introns in one replicate, to confirm genetically the sexing of the individuals. The PCR protocol consisted of a touch-down annealing step ranging from 59 to 53 °C for 45 s and a final extension of 60 °C for 5 min.

All PCR products were run on an ABI3130 capillary sequencer (Applied Biosystems), and while the complete sequence of cyt-b was sequenced for both strands in the tissue samples, for the faecal samples the small fragment was sequenced only for the reverse strand. The software GeneMapper version 4.0 (Applied Biosystems) was used for scoring the microsatellites and small-sized introns. The cyt-b gene sequences were analysed with the software Geneious 8 (Kearse et al. 2012).

A consensus genotype for the faecal samples was built based on a set of guidelines defined for quality control of samples and for scoring alleles (Online Resource 3). We compared the genotypes using the software Gimlet (Valière, 2002). We performed additional re-amplification of genotypes differing by up to two loci (mismatch) or having missing data in up to three loci, to verify the occurrence of genotyping errors (dropout or false alleles) and to correct or complete the genotype (Online Resource 3). If the new PCR reactions failed or produced the same result, the data for the relevant loci was classified as missing.

The genotyped samples from the tissue and faecal samples were assigned to individuals, taking into account that individuals could differ by up to two loci. This value was defined based on the number of mismatches that would minimise assignment errors, by using two approaches. First, we used the values of PI and PIsibs to estimate the probability that two random genotypes match at an increasing number of loci (as described in Lounsberry et al. 2015), which indicated that the probability of siblings sharing nine and eight loci was <1% and < 2.5%, respectively. Then, we analysed the frequencies of pairwise comparisons between all consensus genotypes (that could be from the same or from different individuals) matching at 0 to 11 loci. The results of this analysis indicated that separate samples from the same individual should share nine or more loci (Mckelvey and Schwartz 2004; see also Sabino-Marques et al. 2018 for details). Matches with incomplete genotypes with up to two missing loci were also accepted.

Data analyses

Data analyses were performed separately for the 31 tissue samples (TS dataset; 20 loci) and for the consensus genotypes from the 323 faecal samples (FC dataset; 11 loci). Tissue and faecal samples were also included in a combined dataset (TS + FC) and analysed for the 11 microsatellite loci. Species identification of the TS and FC samples was made by comparing their cyt-b sequences to the Barbosa et al. (2013) reference dataset. Cyt-b haplotype diversity for the TS and FC datasets was estimated in Arlequin 3.5 (Excoffier and Lischer 2010).

Amplification success rates per locus were calculated using the TS dataset and considering all PCR replicates performed during genotyping, for both multiplex and individual amplifications. The genotyping success rate for FC was calculated as the number of successfully amplified consensus genotypes over the sum of all faecal samples analysed (n = 323). Genotyping error rates (dropout and false allele) were estimated for all loci and for both TS and FC with the software Pedant 1.0 (Johnson and Haydon 2007). Since this software is limited to two replicates, and up to eight could be obtained for a single sample in the FC dataset, we followed the author’s instructions and randomly chose four replicates through a RANDOM function in Microsoft Excel, compared these replicates sequentially (e.g. rep1 vs rep2, rep2 vs rep3, etc.) and then averaged the results. For the TS dataset, there were at most only two replicates, simplifying the analysis.

The number of alleles (Na), expected heterozygosity (HE) and observed heterozygosity (HO) for each locus were calculated using the software GenAlEx, and overall inbreeding (FIS) was estimated in the program INEST 2.0 (Chybicki and Burczyk 2009) using 2 × 105 iterations, with 50 iterations of thinning and a burn-in of 2 × 104 iterations. These estimates were obtained separately for the TS, FC and TS + FC datasets. Deviations from Hardy-Weinberg equilibrium (HWE) and linkage disequilibrium (LD) were calculated in genepop 4.3 (Rousset 2008) using a dememorisation of 1 × 104, with a batch length of 5 × 104 and batch number of 2000 for all datasets. The probability (p) values from the multiple HWE and LD tests were corrected using a false discovery rate (Benjamini and Hochberg 1995).

Kinship relations were assessed for the TS, FC and TS + FC datasets using the genetic parentage analysis software COLONY 2.0.5.9 (Jones and Wang 2010). In the analysis, we assumed a monogamous mating system, because previous ecological studies provided support for monogamous behaviour in the Cabrera vole (Fernández-Salvador et al. 2001; Pita et al. 2010; Gomes et al. 2013). Also, preliminary analyses in COLONY considering a polygamous system failed to provide consistent results among different runs (results not shown). Another relevant parameter in this analysis is the proportion of sampled parents in the dataset. Although this could not be estimated directly from our data, preliminary analyses showed that different proportions produced the same results (results not shown). Therefore, we used a default value of 0.5 for both parents in the kinship analyses, as suggested by the authors (Jones and Wang 2010).

To aid in kinship analysis, we compared the cyt-b haplotypes among individuals and used that information for excluding maternity and sibship in the input for COLONY. We also excluded as potential parents any juveniles identified during live-trapping. The ‘very long run’ option was used in COLONY, and six independent runs were performed by changing the seed numbers, with 20 replicates for each run. Only kinship relations consistent in four out of six runs with a likelihood of >95% were considered to be valid. For each dataset, we used the kinship results to detect possible inter-patch dispersal, by evaluating if a pair of related individuals (dyad) was found in different habitat patches.

We tested for sex-biased dispersal in our population using the software COANCESTRY 1.0.1.5 (Wang 2011) for all datasets. We estimated and compared the mean relatedness coefficient among males and females, using the ‘Queller and Goodnight’ estimator (Queller and Goodnight 1989). Moreover, we calculated the mean assignment index correction (AIc) values for each sex and tested for differences between sexes with a non-parametric Mann–Whitney U test in GenAlEx (Mossman and Waser 1999).

Sampling costs and logistics

The costs of live-trapping and gNIS were estimated for field and laboratory work as the sum of human resources (field and laboratory technicians), fieldwork expenses (lodging, fuel, sample collection material, etc.) and laboratory work costs (extraction kits, PCR, genotyping runs, etc.). Base values for human resources were obtained from Fundação para a Ciência e a Tecnologia (FCT) stipend values. We included the material that was especially ordered for this study (collection tubes, gloves, extraction kits, etc.). Equipment that is not specific to this study, such as vehicles and Sherman traps for field work, or thermocycler machines for laboratory work, were not considered for estimating costs. The costs and time associated with field preparation (assessment of suitable patches, monitoring for presence signs before trapping, etc.) and laboratory workflow optimisation (extraction kit trials, primer optimisation, etc.) were also not considered in the final estimates. The total costs were used to calculate the costs per genotype produced, per identified individual and per capture event (including recaptures).

Results

Genotyping success

We obtained complete microsatellite genotypes for all tissue samples (TS dataset), with amplification success among PCR replicates varying between 21.5 to 100% for multiplex reactions, and 50 to 100% for individual reactions (Online Resource 2). No evidence of genotyping errors was found in the TS samples. The cyt-b sequencing of the 31 samples in the TS dataset confirmed the species identification and revealed three different cyt-b haplotypes with four polymorphic sites and an estimated haplotype genetic diversity of 0.667 ± 0.030 (GenBank accession number: KY380621, KY380624, KY380629; see Barbosa et al. 2017). The amplification of the sex-linked introns revealed that the sex of 5 reproductively inactive individuals was misidentified in the field. Two individuals among those captured had the same genotype and cyt-b haplotype, even after re-extraction and re-amplification for all markers. Given the high probability that these individuals were identical twins, only one of them was used in genetic diversity and kinship analysis.

From the 323 selected faecal samples (FC dataset), we obtained consensus microsatellite genotypes for 115 of the samples (35.6%), all confirmed by cyt-b sequencing to be from M. cabrerae. From the cyt-b sequencing of 54 of the 208 discarded samples, we obtained 53 sequences of M. cabrerae, and one of the southwestern water vole (Arvicola sapidus). Only one cyt-b haplotype was found in all the M. cabrerae faecal samples analysed (n = 168), as the small cyt-b fragment amplified in these samples is located in a conservative region of the complete gene for M. cabrerae, and the three haplotypes detected in the TS dataset could not be differentiated in that region. From the total of 208 samples discarded during microsatellite genotyping (Fig. 2), six had three or more alleles for at least two loci, showing evidence of different individuals using the same latrine (conspecific contamination). The error rates were low for the 11 loci among the 115 genotyped samples, with dropout rates ranging from 0 to 1.8% and no evidence of false alleles (Table 1).

Genetic diversity

The genetic diversity and locus size ranges showed little variation among the three datasets (TS, FC and TS + FC, see Table 1). The Na per locus ranged from 2 to 12, HE varied between 0.20–0.93 and HO varied between 0.28–0.93. The overall FIS value was low in the three datasets (TS: 0.036; FC: 0.016; TS + FC: 0.005).

Only one species-specific locus (Mc07) in one dataset (the TS) showed significant (p < 0.05) departure from HWE due to heterozygote deficiency. We observed that all males showed only one allele for this locus, and a BLAST search (via http://blast.ncbi.nlm.nih.gov) against the assembled genome of the prairie vole (Microtus ochrogaster) retrieved the highest similarity to a genomic scaffold (NW_004949134) associated with the X chromosome. Given the likelihood that the locus Mc07 is on the X chromosome, we removed this locus from further analyses, but used the Mc07 genotype information to exclude potential paternities and maternities before running COLONY to improve performance of the software. LD was found in single comparisons in both the TS (Mc30-Mc34, p value = 0.000) and the FC (Mar03-Mar113, p value = 0.000) datasets. LD analyses were repeated using only unrelated individuals (according to the output from COLONY, see below), and these instances of LD disappeared, suggesting that the initial results were caused by related individuals sharing a higher number of alleles.

Kinship and dispersal

There were 81 capture events from 31 individuals during live-trapping in nine patches, with 50 recaptures observed in six patches. From the 115 genotypes obtained from the faecal samples (‘capture events’), we identified 65 individuals (‘captures’) in 14 patches, with 50 ‘recaptures’ observed in 11 of these patches (Table 2). In both the TS and FC datasets, none of the individuals dispersed outside the habitat patch (inter-patch dispersal). When pooling the 65 individuals (FC dataset) with the 31 tissue samples (TS dataset), we identified a total of 87 individuals (TS + FC dataset, Fig. 3a) in 15 habitat patches, with 109 ‘recaptures’ observed in 13 patches (Fig. 3b). Of these, 22 were ‘recaptures’ in the FC dataset of nine live-trapped individuals (TS dataset). The comparison of the consensus genotypes obtained for these nine individuals in the TS and FC datasets showed that only two of 22 genotypes in the FC dataset were different (one genotype had missing data at one locus and the other displayed dropout at two loci). From the overall ‘recaptures’ we identified one inter-patch dispersal event by a male to a nearby patch separated by 249 m (Fig. 4a).

Table 2 Summary statistics obtained from live-trapping and genetic non-invasive sampling (gNIS)
Fig. 3
figure 3

Map of the study area, showing the location of individuals at: a first capture (white circles); b recapture (grey circles); combining data from both live-trapping and genetic non-invasive sampling (gNIS)

Fig. 4
figure 4

Location of females (white circles) and males (dark grey circles) identified from the 31 tissue and 115 faecal samples of Microtus cabrerae. Inter-patch dispersal events inferred by: a capture-recapture (TS + FC ═══) and kinship relations (TS + FC □ □ □) using all types of samples; b kinship relations for tissue (TS □□□) and faecal samples (FC □ □ □)

The parentage analysis in COLONY for the TS dataset revealed four related pairs of individuals (dyads), adding to the already described identical twins (total of five dyads, involving seven individuals out of the 31 individuals genotyped, 22.58%). These seven individuals clustered in three family groups (Online Resource 4), all consisting of siblings. Two siblings were found in two different patches at a distance of 177 m, corresponding to at least one possible inter-patch dispersal event (Table 2, Fig. 4b, Online Resource 4).

For the FC dataset, we identified 27 dyads in COLONY, involving 29 out of the 65 individuals genotyped (44.62%). We found 10 family groups among the 27 dyads, with two instances of paternity (father-offspring), four instances of maternity (mother-offspring) and 21 pairs of siblings (Online Resource 4). There were a minimum of four possible inter-patch dispersal events (91-245 m) detected in three family groups (Table 2, Fig. 4b, Online Resource 4).

For the TS + FC data, the genetic parentage analysis in COLONY recovered a total of 42 dyads (including the identical twins; five of these dyads were found in the TS dataset and 19 found in the FC dataset), involving 48 individuals out of the 87 genotyped (55.17%). We identified 18 family groups, in which five dyads involved paternity, three involved maternity and the remaining 34 were siblings (Online Resource 4). We found a total of six possible inter-patch dispersal events (121–377 m), including that already detected from the TS dataset and another from the FC dataset (Fig. 4a, Online Resource 4).

In the TS dataset, we found a significant difference (mean difference = −0.038, 95% confidence interval, CI = −0.036 to 0.036) in the mean relatedness (MR) among males (MR = −0.010, SE = 0.02) and females (MR = −0.048, SE = 0.03) in COANCESTRY, but no evidence of sex-biased dispersal in GenAlEx (Mann-Whitney U test: Z = 1.721, p = 0.085, two-tailed). For the FC and TS + FC datasets, no significant differences were found in the mean relatedness (FC: mean difference = −0.002, 95% confidence interval, CI = −0.018 to 0.018; TS + FC: mean difference = −0.005, 95% confidence interval, CI = −0.013 to 0.013), among males (FC: MR = −0.019, SE = 0.03; TS + FC: MR = −0.012, SE = 0.03) and females (FC: MR = −0.021, SE = 0.03; TS + FC: MR = −0.017, SE = 0.03) in COANCESTRY, nor in the mean AIc values (FC: Z = −0.364, p = 0.716, two-tailed; TS + FC: Z = 0.278, p = 0.393, two-tailed) obtained with GenAlEx.

Sampling costs and logistics

For live-trapping, a total of 21 days were spent in the field for capture-recapture sessions, representing a cost of 9224.00€ (Table 3). In the laboratory, a total of 37 h were spent analysing the 31 tissue samples (bench work and analysis), with a final cost of 1194.00€. These costs include all replicates and re-extractions done to confirm the genotypes and error rates. For gNIS, a total of five days were spent in the field for faecal sample collection, representing a cost of 2130.00€ (Table 3). A total of 118 h were spent in the laboratory for genotyping the faecal samples, with a final cost of 9722.00€. Considering the results obtained by each method, for live-trapping the estimated cost per individual (and respective genotype) was 336.06€. However, the costs dropped to 128.62€, when we consider the cost per capture event, since animals were tagged with a PIT tag which allowed recapture information to be obtained without genotyping. For gNIS, each genotype corresponds to a capture event, so the estimated cost per genotype and ‘capture’ event was 103.06€, while the cost per individual increased to 182.34€.

Table 3 Summary of the costs (in euros) of field and laboratory work for live-trapping and genetic non-invasive sampling (gNIS), costs per genotype, per individual and capture event (including recaptures) produced in each method. For the time spent, we counted each day used on the field during capture-recapture sessions and sample collection; for the laboratory, we summed the bench time spent for each reaction (in hours), without considering the optimisation period for the molecular markers

Discussion

Workflow and genotyping success

The easy identification of M. cabrerae faeces in our study area allowed for a targeted collection of faecal samples, thereby yielding a species identification accuracy of 99.4%. The only misidentification was a sample belonging to A. sapidus, which is a larger vole that occurs in the same habitat types (Pita et al. 2016), and from which the faeces of juveniles may be mistaken for those of adult M. cabrerae (Garrido-García and Soriguer 2014). This result ensures that failure in microsatellite genotyping was mostly due to low DNA yield rather than non-target sampling. There was also a low rate of conspecific contamination (1.9% of 323 samples), which likely resulted from our strategy to avoid collecting faeces from large and potentially communal latrines.

The genotyping success rate in gNIS (35.6%) was similar to nuclear amplification rates from another study with rodents (43%, Barbosa et al. 2013), but smaller when compared with other herbivorous mammals (e.g. Lepus americanus: 54–69%, Schwartz et al. 2007b; North African ungulates: 80%, Silva et al. 2015). This presumably reflects faecal pellet size and the impact that has on quantities and accessibility of the epithelial cells that provide DNA, with the smaller pellets of rodents resulting in a smaller quantity of DNA that can be extracted per sample. Besides faecal pellet size, the low genotyping success rate might also be due to the rapid DNA degradation owing to the dampness of the grassland habitat in which samples were collected. This is supported by the low genotyping success rates observed in other mammals inhabiting damp habitats (e.g. otters in streamside habitats: Lutra lutra: 41–46%, Prigioni et al. 2006, Lampa et al. 2013; Lontra canadensis: 24%, Mowry et al. 2011), or in mammals for which faeces were exposed to rainy conditions (Odocoileus virginianus: 28%, Goode et al. 2014). The presence of PCR inhibitors—which are common in herbivore faecal samples (Huber et al. 2002, Wehausen et al. 2004)—may also have contributed to reduced genotyping success, even in fresh samples with high DNA yield. In our study, the inhibitors were minimised by extracting only intact faeces and by avoiding vortexing during the digestion step, thus minimising the release of plant content and its inhibitors during the extraction protocol. However, novel extraction protocols may help decrease the PCR inhibitors while improving the DNA amplification rates, by removing inhibitors during the extraction process (Costa et al. 2017), or even reduce them during the collection process (Ramón-Laca et al. 2015).

In contrast to nuclear DNA, the mitochondrial gene cyt-b was successfully amplified in all reactions (n = 169). The higher success in mitochondrial DNA amplification was expected and is similar to that found in other studies, due to the much higher copy numbers of mitochondrial DNA molecules (Beja-Pereira et al. 2009).

When analysing non-invasive samples, it is important to screen for DNA quantity and quality, to avoid unnecessary laboratory costs on samples that will likely show high genotyping error rates and low amplification success with nuclear genes (Beja-Pereira et al. 2009; Lampa et al. 2013). Mitochondrial genes are often used in quality control owing to their greater amplification success rates because of high copy numbers in the cells. However, the higher success in mtDNA amplification may not reflect the quantity and quality of the nDNA present in the faecal samples. Therefore, we chose a small subset of 3 microsatellites (‘Mix 1’) for quality control, following initial optimisation tests where we observed that samples which failed to amplify ‘Mix 1’ also failed to amplify the remaining eight microsatellite loci and sex-linked introns (Online Resource 1). By creating a workflow where the faecal samples were initially screened for these three loci, we were able to restrict the remaining steps to the best quality samples, thereby reducing costs associated with genotyping (Fig. 2 and Online Resource 3). Future modifications to our workflow could include incorporation of real-time PCR (qPCR), with greater precision than simple PCR for quality control (although more expense as well). Another improvement with qPCR could be species specific primers located in the sex chromosomes, allowing species identification and sexing in a single reaction (Moran et al. 2008; O'Neill et al., 2013).

Overall, therefore, by taking a variety of methodological approaches aimed to maximise genotyping success in gNIS, we were able to establish a stepwise workflow starting from field sample collection to laboratory procedures and data analyses. Furthermore, we ensured low genotyping errors with a careful selection and optimisation of the loci for gNIS and the use of technical replicates. High genotyping error rates can result in incorrect genotypes, with subsequent errors in individual and population assignments and downstream analyses (Lampa et al. 2013). Therefore, we highlight the importance of establishing standardised protocols, including technical replication, to reduce bias (including possible human errors) during laboratory work and genotype scoring.

Genetic diversity, kinship and dispersal

Our results showed that either the set of 20 or 11 microsatellites (in tissue and faecal samples, respectively) are highly informative for estimating the genetic diversity and kinship in M. cabrerae. The estimate of average genetic diversity was similar when considering both sets of microsatellites, demonstrating that the smaller set (11) used for faecal samples was sufficient to provide accurate diversity values (Table 1). The metapopulation studied showed moderate to high levels of genetic diversity, with low inbreeding rates.

Overall, the use of gNIS doubled the number of individuals detected (n = 65) in comparison with live-trapping (n = 31). The higher number of individuals identified from the TS to the FC and TS + FC datasets also resulted in an increase in kinship relations detected (Table 2, Online Resource 4). Interestingly, we observed inconsistencies among the kinship relations detected with the FC and TS + FC datasets. The greater sample size in TS + FC might have increased the accuracy of the COLONY analysis, resulting in an overall consistency of the runs for this dataset (only 4 out of 6 runs were consistent in FC). Also incorporated into the analyses of the TS and TS + FC datasets was information drawn from live-trapping, such as the individual’s age class, and additional genetic data obtained from tissue samples (cyt-b haplotypes and genotypes from the X-linked locus). These parameters allowed the exclusion of unlikely dyads from the analysis in TS and TS + FC datasets in comparison to the FC dataset. Generalising from our findings and those of other studies (see Harrison et al. 2013), it is clear that the accuracy of kinship assignments can be improved by increased sampling effort, but this comes at increased cost per unique individual, because greater sampling leads to increased recapture rates. It is also clear that incorporation of a highly variable mitochondrial gene and sex-linked microsatellites is of value in kinship assignments (but again increasing the overall cost). Even better for precision of kinship assignments would be to incorporate high throughput sequencing to allow the capture of whole mitogenomes from faecal samples (van der Valk et al. 2017).

Despite the differences in kinship analyses results between datasets, dispersal distances obtained in the three datasets had overlapping results (TS: 177 m; FC: 91–245 m; TS + FC: 121–378 m). This allowed unseen dispersal events in the landscape to be detected with a maximum dispersal distance of 378 m. This maximum distance is however smaller than those previously detected through telemetry (448 m, Pita et al. 2010) or inferred from stochastic patch occupancy modelling (median: 1147–4837 m, Mestre et al. 2017), likely due to the small size of our study area.

We found no significant differences in sex-biased dispersal in all datasets, and the comparison of mean relatedness among sexes showed no evidence of a philopatric sex. The exception was the TS dataset, in which females were significantly less related than males. However, this was probably an artefact of small sample sizes, as other analysis using the same dataset did not provide evidence for sex-biased dispersal. Overall, lack of philopatry and sex-biased dispersal may be taken as an indication of social monogamy in mammals (Perrin et al. 2000; Lawson Handley and Perrin 2007; Le Galliard et al. 2012; Brom et al. 2016), which is in line with previous observations on the mating system of the Cabrera vole. In fact, social monogamy was previously suggested based on the lack of sexual dimorphism in the species (Ventura et al. 1998), as well as through studies on ecology and home ranges (Fernández-Salvador et al. 2001; Pita et al. 2010; Gomes et al. 2013). However, further research is required to confirm the genetic monogamy of M. cabrerae.

Effectiveness and cost

Using gNIS we were able to detect more individuals with less field effort than conventional live-trapping, which makes the method more efficient timewise for monitoring programs. In fact, using gNIS we detected on average 13.0 different individuals/day (23.0 ‘capture’ events/day, including ‘recaptures’), while live-trapping yielded only 1.5 different individuals/day (3.9 capture events/day) (Table 3). As a consequence, the final costs per obtained genotype, including total fieldwork and genetic analyses, were three times higher when using tissue samples obtained from live-trapping (336.06€) than when using gNIS (103.06€). However, the costs per ‘capture’ event were only slightly higher for live-trapping (128.62€) than for gNIS (103.06€). The higher costs of traditional tissue sampling were associated with the low capture rates observed during live-trapping. In contrast, faecal samples were easier to collect, took less time in the field and provided larger sample sizes, which resulted in more cost-effective field logistics. Thus, even though the laboratory analyses were cheaper and less labour intensive for tissue samples than for gNIS, because the good DNA quality of the tissue samples enables higher genotyping rates with less replicates per locus, gNIS is more cost-effective when considering the price per genotype, because a higher number of individuals were identified. The higher sample sizes obtained with gNIS thus produce better estimates of attributes based on genetic data of major applicability to conservation and monitoring programs, including sexing, kinship, dispersal and social behaviour.

The threshold at which the advantages of gNIS surpass those of live-trapping must be carefully assessed on a case-by-case basis. Based on the research questions and the required information, researchers should consider the time needed in the field for collecting samples and the number and type of samples to be analysed in the laboratory. These factors will impact the costs associated with each sampling strategy, but also the amount of information obtained. For instance, the cost-effectiveness of our gNIS study was high because the faecal samples of M. cabrerae were easy to identify and collect in the field. Also, the small latrines of M. cabrerae seem to be used almost always by a single individual, which makes genotyping easier than if there was conspecific or interspecific contamination. Population density may also affect the cost-effectiveness of gNIS. This is illustrated by two studies with lagomorphs evaluating the costs associated with each sampling method, with results similar to ours obtained for the endangered New England cottontail (Sylvilagus transitionalis) (Kilpatrick et al. 2013), while the cost-effectiveness of gNIS did not surpass that of live-trapping in high density populations of the snowshoe hare (Lepus americanus) (Cheng et al. 2017). Therefore, for abundant small mammal species with high capture rates, we suggest that live-trapping may still be the most cost-effective monitoring technique, as gNIS would not bring any further advantage for field and laboratory logistics, sample size and data analyses. However, this may change in the near future due to the ever declining costs and increasing performance of molecular analysis.

While dedicated laboratory facilities help during non-invasive amplification (e.g. separate room, UV lamps - see Beja-Pereira et al. 2009 and Lampa et al. 2013 for a detailed review), new high-throughput sequencing techniques could also improve the amount of data obtainable through gNIS, and avoid the issues encountered with microsatellites (Guichoux et al. 2011, Shafer et al. 2015). However, microsatellite markers are still widely used in gNIS, as the high-throughput methods require large quantities and good quality of DNA, hampering the implementation of widely used techniques such as genotyping-by-sequencing (Russello et al. 2015). Nevertheless, high-throughput sequencing has been applied for microsatellite genotyping of faecal samples in brown bears, and provided a higher amplification success with lower genotyping error rates than traditional Sanger sequencing (De Barba et al. 2017). High-throughput methods applied to microsatellites can help reduce the cost and time spent on the laboratory, as well as overcome the technical difficulties present in allele scoring, providing a second-life for microsatellites and improving the cost-effectiveness of gNIS for monitoring.

Conclusion

gNIS has been increasingly used in recent years, with many of its drawbacks being overcome by careful sample collection procedures and the optimisation of laboratory work, namely extraction protocols, use of species-specific genetic markers, and design of a proper workflow for increasing genotyping success. While rarely applied to small mammals, our study and the companion paper by Sabino-Marques et al. (2018) have shown that it is possible to use gNIS to monitor these species when presence signs are easily observed, with a clear overall cost-effectiveness compared with traditional live-trapping approaches. In particular, we found that using gNIS provides more information at reduced field effort and lower cost, thus allowing more precise estimates of population parameters such as genetic diversity, inbreeding, dispersal rates, and kinship structure, among others. Furthermore, this information can be obtained without the need to capture, handle and manipulate the individuals (tagging them or taking biopsies), which can be a great source of stress or even mortality (Powell and Proulx 2003; Pauli et al. 2010). Therefore, we propose that gNIS may generally provide a practical and ethically preferable alternative to live-trapping for elusive or rare species with low capture rates, reducing logistical difficulties and expenses. gNIS is thus a promising tool for monitoring small mammals, and we strongly suggest the consideration of this approach for threatened and rare species.