Introduction

The predominately used markers for human identity testing are the autosomal short tandem repeat (aSTR) loci. These loci have relatively low to no positive predictive power to disclose personally identifiable information (PII) or personal data, such as health data or personal traits, about an individual. Personal data refer to “any information relating to an identified or identifiable natural person... if it can be directly or indirectly be used to identify a person or persons” [1]. The aSTRs can be (and are) used for kinship analyses (almost exclusively for first degree relative associations), and they do have a low predictive power for population affinity [2,3,4]. The same features of low to no positive predictive power regarding personal data can be attributed to Y-STRs. The Y-STRs can be more informative for inferring population affinity [5, 6] and extending the reach of kinship along the paternal line [7,8,9,10,11] than can be achieved with aSTRs. The maternal line counterpart, the mitochondrial DNA (mtDNA) genome, may provide some genetic information about the health and risk of an individual, although the positive predictive value still is low, and mechanisms for deleting known disease-associated variants have been proposed for forensic analyses [12, 13]. The value of mtDNA for estimation or prediction of population affinity and extended kinship is similar to that of Y-STRs [14,15,16,17]. One could take the position that these risks are nominal, are well managed in the forensic and medicolegal communities, and thus may or may not need to be disclosed when obtaining informed consent to acquire reference samples from individuals to perform applicable forensic casework.

Principles and foundations of informed consent

Informed consent has its origins with the Nuremburg Code [18] (although the principles may have been developed earlier (see [19])) and has been codified by the Belmont Report [20], the Declaration of Helsinki [21, 22], many agencies and organizations (for example, see [23, 24]) and more recently the European Society of Human Genetics addressing direct to consumer (DTC) testing [25], to name a few. Additionally, these principles have been applied broadly to research on human subjects beyond medical research [1]. All espouse basic ethical principles that should be considered when conducting biomedical and behavioral research involving human subjects. These principles, outlined in the Belmont Report, are (1) respect for the individual to make adequately informed judgments freely (i.e., voluntarily and with autonomy) as well as to protect children and those persons with diminished capacity to effect autonomy; additionally worth noting is that as information/technology changes the individual (i.e., donor) may need to be updated to effect sound judgment decision processes; (2) beneficence particularly in the concept of do no harm, while maximizing potential benefits (taking note to avoid overestimation of benefits) and minimizing potential harm considering individual and society levels; and (3) justice so that all are treated fairly or equally.

These principles form the foundations of informed consent which in itself is grounded on three fundamental elements: information, comprehension, and voluntary participation. Informed consent procedures and supporting documents should provide sufficient information so participants can be made aware of the purpose and the research to be performed and the associated benefits and risks. The principle of transparency requires that information be conveyed in a comprehensible manner [26], and it falls on the researcher to ascertain that the potential participant understands the information. Risks should be well-described, and as risks become more severe, the requirement that they are comprehended should be rigorously followed. The agreement by the person to partake in the research study must be voluntarily given and free from coercion and undue influence. The participants should be provided ample opportunity to ask questions, seek to document the opportunity, and be allowed to withdraw at any time from the research. Foremost, personal data should be safeguarded to protect the integrity, dignity, and rights of individuals [1, 26]. Consent must be informed following the above principles for establishing a legal basis for processing samples and data [26].

While the principles and practices of informed consent were developed with medical and behavioral research in mind, they have been adopted willingly in the forensic science arena primarily to acquire reference samples from family members to assist in identifying missing persons (although some countries also obtain informed consent for reference samples from convicted offenders; see [27]). Thus, the forensic science community would appear to have embraced that the same principles of informed consent for medical research on humans also should be applied for practical work for human identification. Although the forensic community has been proactive with informed consent in missing persons and human remains cases, one could envision a sort of waiver of the consent requirement because (a) no foreseeable harm is expected to result from the sampling and analyses, and (b) sampling is permitted, e.g., by law or other regulations, or (c) if an ethical review committee has approved the non-disclosure of certain information [28].

There are examples of consent forms, frequently asked questions (FAQs) and information on processes, and/or recommended policies from a number of sources such as International Committee of the Red Cross (ICRC) [29], Interpol’s I-Familia [30, 31], Armed Forces DNA Identification Laboratory [32, 33], Budowle et al. [34], Australian Federal Police-National DNA Program for Unidentified and Missing Persons [35], and FBI’s Missing Persons Program [36] to name a few. Various aspects of informed consent for missing person identification have been summarized well by Katsanis et al. [37]. The information in these and other documents form a good basis to follow and need not be repeated herein. However, depending on the document(s), risk disclosure may be minimal or not stated, likely because the chance and/or severity of risk to an individual(s) were considered nominal at best. Additionally, as Katsanis et al. [37] point out “Unlike biomedical research requirements, the description of risks to the participant is not required by law.”

Enhanced technical capabilities

With advances in molecular biology technologies, data mining, and access to metadata, it is important to assess whether the past informed consent process and in particular associated risks are concomitant with these increased capabilities. In more recent years, genetic markers (single nucleotide polymorphisms (SNPs)) have been used to predict externally visible traits, such as eye color, hair color, skin pigmentation, and freckles, to support investigative leads [38,39,40,41,42,43]. This technology, which is known as forensic DNA phenotyping [42], can be considered the forerunner to predict some personal information about an individual(s). Predicting these visible traits may not be considered a privacy risk as the features are readily identifiable without genetic testing, and an evidence sample has no rights [38]. Additionally, the use of such data can reduce bias by re-focusing investigations away from minority or vulnerable populations ([38], see [44] for more in-depth discussion on defining vulnerable populations), similar to how a Y-STR haplotype was used in the Vaastraa case to indicate that the donor of biological evidence was not from an immigrant population but from Western Europe [45]. However, some have asserted that forensic DNA phenotyping (visible features and particularly with biogeographical ancestry or ancestry informative markers [46, 47]) could contribute to the risk of stigmatization [48,49,50,51]. Beyond technical issues of, for example, causative versus non-causative SNPs, there are aspects that may be considered regarding the use, maintenance, and extended applications of such forensic predictive data that extend to the entire legal system to include “ethical principles such as autonomy, justice, dignity, confidentiality, and solidarity; legal principles such as due process and proportionality; and democratic values such as equality, transparency, and pluralism” as well as privacy [50]. Indeed, some of these aspects are integral to a proper informed consent and accentuate the need moving forward to enhance the informed consent process.

Forensic genomics capabilities have expanded to where it is now feasible to sequence an entire human genome by massively parallel sequencing, scan the genome with microarrays, and/or perform targeted sequencing of a relatively large number of loci or entire genes of reference and evidence samples [52,53,54,55,56,57,58]. Thus, vast amounts of genetic data can be generated at an individual or population level and potential access to such data may bring substantial benefits. However, access to the data (and samples) also pose greater risks than were not conceivable with STRs and mtDNA to those individuals who have been typed as well as under some certain circumstances the relatives of the genetically analyzed individuals. Consideration and disclosure of risks are important and should not be thought of an obstacle to dissuade participants but instead to ensure that they are well-informed to make autonomous decisions. Additionally, good stewardship can serve to protect the overall use of the technology for the intended benefits afforded.

It is not easy to predict future capabilities and associated efforts and thus previous informed consents should not be condemned for not being sufficiently informative about current advances. But all should be mindful of possible limitations of the use of legacy samples and data. Depending on the language in previous informed consents, it may be necessary to modify or expand documentation and/or processes in a continuous manner to address the enhanced genetic tools and risks of today [59]. Also, the donors of samples that were collected previously and archived may not be proper to use, given at least the spirit of informed consent. However, proportionality principles may tip the balance otherwise. Proportionality in this context could be that the actions proposed or taken should be weighed between the good that can be generated and the harm that may be caused. For example, it would be difficult for law enforcement to unlearn or ignore investigative leads, if the donor subsequently withdrew consent. It would be incumbent upon those who maintain the samples to reach back to update and likely obtain a new informed consent, if feasible or to consider the need to do so based on proportionality (a complex issue in itself for legacy samples and legacy data).

Casework areas impacted by informed consent

There are at least four forensic scenarios in which informed consent may need to be modified and augmented based on these new capabilities: (1) reference samples from family members in missing persons or unidentified human remains cases; (2) targeted analysis of an individual(s) during forensic genetic genealogy (FGG) cases to reduce an investigative burden; (3) donors who provide their samples for validation studies (to include population studies and entry into databases that would be applied to forensic statistical calculations) to support implementation of procedures and operations of the forensic laboratory; and (4) family members that may contribute samples or obtain genetic information from a molecular autopsy [60]. While there are similar requirements for informed consent among these four scenarios, the latter (#4) likely should include a need for genetic counseling, informing relatives of potential harm (also applies to #1), and discussion about disposition and use of postmortem tissue samples that may be collected during and stored after autopsy [61].

While the purpose and benefits with advanced forensic genomic data may still be the same for these four areas as they have been since their inception, the informed consent clearly would have to be updated to describe the new technologies and how they are to be used to include the aims of use (or project), scope of use (or project), and type of data (see [26, 32]). The areas of informed consent that likely would require more thought are risks to the individual, risks to family members, perhaps risks to society, and particularly the issue that sample anonymity may be difficult to maintain and how readily easy it has become to mine genomic data and determine associated medical risks, biogeographical ancestry, kinship, etc. with these capabilities. While some risks may be considered low probability, but as conveyed in the Belmont Report [20], as the severity of the risk increases, the disclosure of the risk should be considered part of the information provided to the donors. One could consider disclosing reasonable risks versus low likelihood risks (possibly because the mechanisms in place by the laboratory system substantially reduce the chance of the particular risk from occurring) and disclose only those risks that are more likely to occur. However, with the potential of security compromises [62] and relative ease to mine data and determine source of an anonymized sample, a more exhaustive risk list with probability of occurrence (either qualitatively or quantitatively) may be more prudent to provide with informed consent. Katsanis et al. [37] advocate that it “is tantamount in bioethical principles for informed consent, allowing a potential participant to weigh the pros and cons of participation.” We concur and strongly urge those obtaining informed consent to put more effort into risk and benefit assessments to adequately inform sample donors.

Informed consent process

The informed consent process should cover (1) purpose for collection of samples; (2) process to analyze the samples (to include type of data); (3) benefits (to donor, target, family, community, etc. as applicable); (4) risks (to donor, target, family, community, etc. as applicable); (5) access to data/reports by the donor; (6) sample disposition; (7) removal of data process (i.e., expungement); (8) process to ask questions/assessment of comprehension; (9) follow-up processes; and (10) voluntary, signed, and dated consent (Table 1). These topics need to be conveyed in lay terms to achieve comprehension.

Table 1 Major topics for consideration in the informed consent form and/or FAQs

Our goal is to raise awareness. The circumstances can be quite varied; but some existing consent forms and FAQs already form a good foundation that may only need to be updated or expanded accordingly. The information could be contained within a consent form. But given the degree of explanation of the purpose, the technology (and its limitations), benefits, risks, and accompanying processes placing such information in the consent form may become unwieldy. An accompanying FAQs document could contain most of the necessary information. However, since ensuring comprehension is an important aspect of proper informed consent, it may be moot about separating consent documentation and information or containing all in one document. We leave it to those involved in obtaining informed consent to determine which approach is more practical for their systems. Instead, Table 1 lists the above ten areas and some topics/concepts for consideration to expand on to be fit for purpose. The topics are not exhaustive but attempt to cover major topics.

Risk is likely higher with acquisition of more genetic data and thus important to consider when obtaining informed consent. The laboratory or whomever seeks consent should determine if occurrence of risk is low, moderate, or high and whether the impact would be low, moderate, or severe. Such assessments can help guide whether the risk should be disclosed or better yet identify measures to mitigate the risk. Below we discuss some areas of risk (not exhaustive) and in some cases what may be ways to mitigate the risks. While it is important to inform the participant about the potential risks, especially those risks that are highly likely and those that may have severe consequences, if they occur, the steps taken to mitigate risk could be perceived as a benefit.

Privacy and security

The European Union’s General Data Protection Regulation (GDPR) [64], ICRC [26, 29], Budowle et al. [34], and I-Familia [30] stress that privacy by design and intended use are requisite for the use case. It is important to emphasize that the samples will be used for the intended purpose(s). Also, what may happen with the data obtained that are not requisite for the intended purpose should be disclosed by specific, blanket, broad information, tiered, presumed with explicit opt out opportunity, waived, or dynamic consent information [59] (see Table 2 for general privacy concepts). For example, it would be prudent to inform donors of reference samples for identifying relatives that their profiles may be searched in specific databases which may reside in another country, if that process is intended [30, 31, 34]. Another example would be during a disaster victim identification, where it may be determined that a father is not biologically related to one of his children. This finding is not directly relevant to the identification of the victim, and the donor(s) of the sample(s) should be made aware of the policy if such information will be or will not be disclosed (for example see [30]). We strongly advocate the latter to maintain the principle of intended use only, as well as privacy considerations.

Table 2 General data privacy concepts for best practices. Note that the concepts are not an exhaustive list but are provided for insight/examples into the processes

Data security is critical for effective data protection [26]. Security measures, or more appropriately lack thereof, can be considered a serious risk, should be a major focus for handling of samples and data, and should be discussed when obtaining informed consent. Physical and IT measures and the degree that they are implemented impact risks (see Table 3 for general security measures). These measures may include training, access rights, physical security of the facility or data storage sites, password protection, encryption, data sharing agreements, destruction of personal data, data management, and retention procedures, to name a few [26]. De-identifying samples (and data) has been one of the primary ways to maintain sample anonymity. However, with technology today, the genetic data may be and likely will be sufficient to identify a donor [52]. Indeed, FGG is based on the use of genomic data to identify relatives of the sample donor and with public records triangulate to the donor [54]. Y-STRs alone have been used to do the same thing [11, 53]. Thus, the risk of identifying the donor (even if the genetic profile is anonymized in the traditional sense) is high. Of course, the risk could be reduced by not disclosing the genetic data or minimizing the data disclosed (part of a mitigation process).

Table 3 General data security measures to mitigate risk. Note that the measures are not an exhaustive list but are provided for insight/examples into the processes

There can be risks with public (see residual newborn bloodspot discussion below) and private sector groups who may exploit the samples and data for purposes not originally consented. If there are potential conflicts of interest with additional parties involved, those risks should be conveyed. Currently, most FGG genomic data are in the hands of the private sector (at least for now) but capabilities have been developed to transfer the methodology into the operational forensic laboratory [57, 65, 66]. National DNA databases have built secure measures to reduce unauthorized access, and, for example, there has been no report of a breach of CODIS. However, a private lab was able to gain access indirectly to CODIS profiles by convincing a government lab to search the National DNA Index System on its behalf [67]. Some of the private providers of FGG (either typing service or owners of databases) have continued to evolve to inform donors and users primarily about legal obligations, security, and privacy. For example, in its Terms of Service and Privacy Policy, GEDmatch.com [68] addresses the purpose of the database and its tools, and the acceptable samples for searching by law enforcement are from biological evidence from a violent crime or for identification of human remains. Users also are required to actively opt-in to sharing data for searching by law enforcement for violent crime investigations. Additionally, users agree they will not upload raw data in GEDmatch.com in violation of the intended use of the platform and that raw data will not be disclosed. Explicit guidance on risk minimization includes the following: the requirement to disclose data under legal obligations; privacy as a balance between risk of linking data to an individual and the need (i.e., purpose) to share information; conversion of raw data to a compressed binary format via a process called tokenization (which is not encryption in the usual sense but is an attempt to limit the ability to reverse engineer an originally uploaded file); deletion and archiving of data; the risk of hacking and decoding of data; access of data by GEDmatch personnel and volunteers on a need to know basis; use of data for internal research; the data will not be shared with third parties for research; and notification requirement if data are shared with third parties. Privately maintained data likely will vary substantially regarding security measures to protect data, and private entities should embrace informed consent and describe the use of acquired data for intended purposes.

Anonymity or de-identification

Given sufficient genetic data, computational power, and accessible metadata, maintaining anonymity may no longer be achievable. Indeed, FGG and familial searching [69] demonstrate that a person can be identified with current tools. Thus, it may be necessary to move from the concept of anonymity to that of pseudonymization. Pseudonymization is defined as “the Processing of Personal Data in such a manner that the Personal Data can no longer be attributed to a specific Data Subject without the use of additional information, provided that such additional information is kept separately and is subject to technical and organizational measures to ensure that the Personal Data are not attributed to an identified or identifiable natural person” [26]. The European Union’s GDPR has endorsed pseudonymization as a measure to protect personal data (Article 4(5), Under Article 4(5) [64]). The capability to anonymize or better yet pseudonymize personal data should be conveyed to the human subject.

Measures to preserve privacy of genomic data are more complicated and require cryptographic techniques for data encryption and protection. Homomorphic encryption is one example of a mechanism to enable data analyses for forensic purposes and maintain pseudonymization [70]. Audits should be undertaken on a regular basis of holders of genomic data for forensic purposes to confirm that the data are secure, data are expunged, and the samples are used for only intended purposes according to policies or contracts in place. It also is important to disclose if audits are performed and relevant findings will or will not be disclosed. At a minimum, genomic data searched in private databases should be used for the intended purpose only and should be expunged when the search is complete (after an adequate time frame). Additionally, all new developments or capabilities should be evaluated via a Privacy Impact Assessment (PIA), or equivalent, to ensure that new feature(s) being included in the software accounts for privacy by design. PIA assessments also evaluate whether new capabilities may unintentionally introduce security vulnerabilities into the software. Regular database penetration testing strategies as part of a robust information security management system can help highlight security risks as well options for mitigation. Interestingly, how security risks are addressed could be perceived as a benefit (at least compared to other options for donor participation) if the measures taken provide more confidence of protection of personal data.

Disclosure and disposition policies

Disclosure and mechanisms to enable or restrict disclosure of genetic data are critical issues. For example, one of us (BB) when becoming Director of the Center for Human Identification became aware that DNA profiles were placed in reports. The original intent may have been to be informative and helpful but may not have considered privacy issues and statutes (which can impact donors such as victims as well as the laboratory). As stated above, STRs (and mtDNA to a slightly greater extent) might have been considered low risk from a privacy perspective. But under the Federal DNA Identification Act 34 U.S.C. § 12592 and Texas Government Code § 411.153, unauthorized disclosure is prohibited and is a state jail felony. The practice of placing DNA results in reports was stopped. Presumably, similar legislation exists in other jurisdictions. Some jurisdictions still may include DNA profiles in reports and may be legally permissible to do so. However, even if not stated in a report, the data are still maintained in case files. One possible way to protect the privacy of individuals is to encrypt the DNA profiles in a manner that the genetic data are not disclosed but in a format that still allows for direct or relationship comparisons. A judge could order disclosure of the genetic data. Possibly a protective order could be placed on the genetic data; but once disclosed, the data are not under control of the laboratory, and thus there is a risk that the data are no longer secure, may be shared with others, may be used for purposes other than it was intended, etc. Disclosure policies and risks should be described to explain who may have access, under what general conditions, and in what data format. Additionally, the fact that disclosure of data can occur regardless of the laboratory’s efforts should be disclosed to the potential participant.

Disposition and particularly retention of the sample(s) and/or profile(s) can be considered a risk. Risk attributes with raw data, especially dense SNP data, are much higher than with STR markers. As long as samples and/or data are maintained, someone unauthorized or even an authorized person may gain access to the samples and/or data and use them beyond intended purposes. Duration of maintenance of the samples and/or data should be described. Policy and/or legislation may dictate the status of samples and profiles. One could see the value of maintaining the DNA sample ad infinitum for re-testing with new technologies or confirmation purposes as well as having as complete a genetic data set per individual for population analyses. Alternatively, there is potential risk that maintaining a sample may allow access for unintended or unauthorized purposes. So, disposal and expungement policies may be desirable. For example, there would not seem to be a need to maintain reference samples and profiles once an identification of human remains is made. One could consider maintaining the reference profiles if the remains were partial. But if an identification was made, then the profile from the remains could serve as a direct comparison profile for additional remains that may be recovered. This approach assumes that the identification is made with a high degree of confidence.

Third-party usage

Another challenge to the protection of personal data (and foremost in maintaining the integrity and dignity of individuals) is that the data generated, for example, for a population study to establish allelic frequency data, may be submitted to third party databases to support forensic applications. The two most noted examples are the Y-STR Haplotype Reference Database (YHRD) [71] and the European DNA Profiling Group’s Mitochondrial DNA Population Database (EMPOP) [72] databases. These databases (and others, for example see [73, 74]) are invaluable resources that support the forensic DNA community. However, the good intentions of the operators of these databases and others can be marred if those in the supply chains of data do not obtain proper informed consent or do not consider in their informed consent (or by permissible legislation) first, the vulnerability of individuals or certain subpopulation groups such as Australia’s Aboriginal and Torres Strait Islander people, Native Americans, Saami in northern Europe, and Roma (see, for example, [75]). For databases such as EMPOP and YHRD that collect data based on population affinity, such vulnerable groups need to be represented, if possible, to perform statistical calculations that likely will be overstated for such groups if not adequately represented. As described in a Report of the International Bioethics Committee of UNESCO [44], “Vulnerability is an inescapable dimension of the life of individuals and the shaping of human relationships. To take into account human vulnerability acknowledges that we all may lack at some point the ability or the means to protect ourselves, our health and our well-being.” Additionally, “While some groups of people can always be considered vulnerable because of their status (e.g. children), others may be vulnerable in one situation but not in another. Therefore, vulnerability cannot be considered as a one-off concept.” Thus, it is incumbent upon scientists to assess vulnerability in each study undertaken. Criteria to consider for assessing vulnerability include the following: rights of every human being; marginalization of peoples; social and educational conditions; gender discrimination; personal liberty; hierarchical relationships; war; exploitation; to name a few [44]. A second consideration is the potential use of the data by a third party (i.e., the databases and the forensic community that rely on the database data). Both of these issues affect autonomy and especially once the personal data are shared. While the operators of the databases have a responsibility to ensure that proper informed consent has been obtained and already take measures to protect the data once in their possession, those scientists undertaking the population genetic studies can effect a better process by having proper and effective informed consent. Recognizing what constitutes a vulnerable population and how to convey in a comprehensible manner the intent, goals, processes, risks, and benefits, as well as engagement with the individuals, can go a long way to empower individuals with the autonomy to make decisions whether to contribute or not to a study without coercion. Regarding obtaining family reference samples from outside the country to identify human remains found within the country, Budowle et al. [43] (see appendix 5 FAQs) require that informed consent cover that the donor understands and agrees that the sample may be searched in national and non-national databases. It should be noted that population data that are published are intended to be used further than for just the sake of publication. In a similar vein, it may be inferred or those who perform population studies believe it to be inferred that data will be placed in certain third party databases and since publication of population data is conditioned on submission to respective databases, going forward, perhaps this third party aspect should be expressed explicitly. This recommendation is particularly critical with dense SNP data where anonymity may no longer be maintained. Lastly, while researchers collecting samples for forensic genetic analyses are obviously the main persons responsible for collecting informed consent and providing the necessary information in an unbiased and non-coercive way, journals publishing the data also can serve as gatekeepers. Most journals have policies that address informed consent predominately relying on the researcher to affirm conditions have been met. Further review of the policies may be worth considering given genomic advances and the blurred line between legally allowed and vulnerable populations.

Considerations in medicolegal autopsy

A medicolegal autopsy may require collection of reference samples for human identification purposes, and the same informed consent approaches described above should apply. However, a molecular autopsy to assist in determining cause and/or manner of death, predominately in unexplained or sudden unexpected deaths, relies on genetic diagnostics [76,77,78]. In this medicolegal area, genetic markers with positive predictive power are employed and that information may impact (i.e., be a risk or reduce harm) family members of the deceased. Family members, a priori, would have increased risk based on sharing of genomes. Also, some family members may be requested to provide a reference sample(s) to better interpret the potential risk of observed genetic variants. Sajantila and Budowle [60] described the risks about information disclosure as the medicolegal context (requests performed under judicial mandate or request) is different than for standard clinical work.

The risks include “(1) genetic information of deceased individuals resides in files of judicial institutions; (2) there is no institutional body within the judicial infrastructure who would naturally meet the medical/genetic consultation needs of the relatives, even if the relatives were to seek advice; and (3) the genetic information in the reports can be elicited to the public during court proceedings.” The risks to privacy are greater (especially with the ease of genetic data analyses and that a molecular autopsy would point directly to a gene(s) and specific genetic variants) and should be included in the informed consent documentation with a greater need by medicolegal personnel to ensure the information is comprehended by sample donors and potentially impacted family members. Additionally, although not part of informed consent, these genetic data may be disclosed publicly and/or reported to family members. There is little or no guidance, i.e., genetic counseling, to such individuals regarding the risks associated with the genetic findings.

This scenario of potential risk to family members gained during a molecular autopsy confounds the personal data integrity process, and the principle of proportionality applies here. We do not attempt to address the issue herein but intend to raise awareness of the complexities of genetic data obtained in the medicolegal context. As stated by Fellmann et al. [61], the focus of a medicolegal investigation is to ascertain whether the cause of death can be assigned to an underlying disease or if there are any legal implications, i.e., determining if the cause of death is natural as opposed to unnatural. Informing family members of the underlying cause of death is not necessarily part of the aim of the medicolegal investigation. These authors further point out that if the cause of death may be attributed to a genetic condition, there now is knowledge that relatives may be at increased risk and that knowledge could be particularly relevant if there is a treatment that may prevent family members from succumbing. Therefore, an autopsy may go beyond determining cause of death, and practitioners should consider informing family members that they may have an increased risk of developing the genetic disease (especially if that risk may be high). Informed consent is usually not required for postmortem investigations. It is not possible to obtain consent from a deceased individual and yet some relatives may have a vested interest in being informed about the genetic results obtained from the deceased individual. Fellmann et al. [61] advocate that if serious harm might be prevented through providing this knowledge to family members, proportionality would support disclosure as opposed to obligations to maintain confidentiality. We concur. However, access to postmortem tissue samples, who will be informed, and when and what information will be given are challenging issues. It will take much more thought than merely raising herein awareness of the complexities of informed consent and subsequent data sharing in the molecular autopsy context.

Providing samples for validation studies

Often, laboratory personnel donate their DNA to support validation studies that are necessary for a quality system. One could submit that in countries where disclosure of such data is possible or expected to meet legal obligations and/or maintain transparency, there could be risk of access to the donors’ genetic data. It may seem that under such legal systems, consent is inferred as anyone employed in those forensic laboratories should be aware to some degree of disclosure requirements. However, it may be prudent to obtain informed consent from donors to validation studies as they may not fully appreciate the ramifications of disclosure and subsequent risks which are much greater with for example dense SNP data, as well as to ensure that agreement is not considered coercion. An alternate source of samples for research and validation studies has been from casework. Nonprobative samples often have been used for validation studies, for example samples from victims. If not already obtained, it may be prudent, going forward, to obtain informed consent to use such samples. In contrast, some jurisdictions may interpret use of these samples obtained without informed consent is legally and/or ethically permissible (see [79] as an example of use of vulnerable population samples without informed consent, but legally permissible, for research and validation studies).

Risk from other sources

The ICRC [29] recommends disclosing data to donors who request the information, which is similar to policies of the DTC companies. With access to data, accuracy of results can be verified or challenged. There is merit in this approach as it could garner more participation and greater transparency. Alternatively, this approach could be considered undue influence prompting people to participate without fully considering the associated risks. The data are complex, and as such there may be a tendency to consider access to the data to be a low risk because of the belief that a bioinformatician is required to interpret the genomic data. A number of students today, however, are being trained on bioinformatics and thus the pool of experts is increasing and likely will continue to increase. Moreover, genomic data mining is feasible by untrained individuals. There are online tools that allow individuals independently to upload their genomic data and determine genetic variants that pose health risks, determine bioancestry, and find relatives (e.g., [80,81,82]). These tools can be quite empowering and, in some cases, quite helpful (e.g., finding an adopted person’s biological parents, although in some cases the parent(s) may not want to be located). However, the untrained may (and likely will) misinterpret the findings which can lead to false positives, false negatives, and over- or underestimation of the genetic risk [82]. Importantly, the emergence of off-the-shelf tools and automation workflows likely will reduce the need for bioinformatics or even genetic training. If genomic data are shared with the participant, proper cautions and disclaimers by the laboratory should be in the informed consent. Of course, sharing the data which then are subsequently used by the donor for another purpose could be interpreted as facilitating use for an unintended purpose (balanced against the autonomy of the donor). Since the genomic data are intended for forensic purposes only, it may be more prudent to not provide to the donor the raw data or profile. Access or no access to data and accordingly the associated risks should be part of informed consent.

There is little discussion about the risk of bad actors, likely because most of us believe the risk to be low and predicting such activity is difficult. After all, ultimately, trust and integrity are essential to the profession, and violation of these principles is considered reprehensible. However, there have been fraudulent or corrupt individuals, most of which are single individuals (e.g., see [83]), although not always by a single actor (e.g., see [84]). Although such occurrences are unlikely, the consequences often are quite severe. Completely eliminating the risk of a single bad actor may not be feasible. Security risks and measures to reduce bad actions can be instituted and possibly described during the informed consent process.

Risk assessment

In order to determine risk and possible severity, a risk analysis should be part of the laboratory, criminal justice system, and/or agencies processes. It is important to convey risks and consequences to properly inform donors. While it is not the purpose of this paper to instruct on how to perform a risk assessment, it is important to stress herein that risk be assessed to address the likelihood of an event and the consequences that may occur [26]. For risk assessment, one should consider (1) the risk and under what circumstances the risk may occur; (2) the consequences should the risk occur; (3) the likelihood of the consequences to occur; and (4) what processes or mechanisms are in place or need to be put in place to control or minimize the risk [85]. These assessments are part of a good quality assurance program to address root cause analysis, corrective action, and preventive action [86]. One approach is a probability/consequence matrix method to assess the overall severity of the contributing factors to each potential risk [87].

Access to samples collected for other intended purposes

Throughout this paper, informed consent has been discussed in light of applications that forensic geneticists and medicolegal scientists may have more direct interactions. Perhaps there is a fifth scenario to consider in which another sample resource that may be used for law enforcement purposes that may require discussion on informed consent. That resource is the vast medical and clinical samples maintained in the healthcare arena. As an example, many countries require newborn screening, within the first 48 h, for a number of medical and genetic disorders [88, 89]. These analyses are legally mandated, and there seems to be little controversy (and possibly little awareness) regarding sample collection and the analyses. However, the residual blood spots (RBS), also known as Guthrie cards, have been retained for years creating a substantial biological resource that may be exploited for other purposes. Indeed, these samples have been used successfully, for example, for disaster victim identification [90, 91]. Very recently, the New Jersey State Police subpoenaed the State’s Newborn Screening Laboratory to obtain a specific blood card to hone in on the donor’s father during an FGG investigation [92, 93]. The lead appears to have been successful; however, a lawsuit has been filed claiming that law enforcement violated its constitutional obligation of demonstrating probable cause to obtain a warrant, as well as there was no informed consent regarding the use of the samples [93]. In a similar fashion over a decade ago in the case of Beleno v. Texas Department of State Health Services (No. 5:09-cv-00188-FB (W.D.Tex., San Antonio Division filed Mar. 12, 2009)), five families filed a lawsuit claiming that the State Agency and Texas A&M University violated federal and state law by using the legally mandated newborn blood spots for unspecified research without the parents’ knowledge or consent [94]. Apparently, the samples were provided to various agencies for research, one being AFDIL to generate a population database [88, 94]. The issue of informed consent was side-stepped in the case, but one consequence was that between 4 and 5 million blood cards that existed at the time were destroyed. The storage and use of RBS samples for secondary purposes raise socio-ethical and public policy issues around privacy, transparency, and autonomy. In the absence of clear guidelines around the use of RBS, common positions held by the research community are (a) RBS samples are considered community property since everyone benefits from the research, and (b) anonymizing the de-identified data is sufficient to protect the privacy of the donor. However, with genomic capabilities today, anonymization may no longer be feasible. In cases where RBS samples are used for the specific forensic purpose of identification, one could envision requiring informed consent from the parents (on behalf of the child) as well as the ability to “opt-in” to forensic use. We are not weighing in on whether these samples can be used for purposes beyond the intended purpose, whether use is legally permissible (although note that such samples have been used legally under exigent circumstances, such as a disaster (see [90])), or whether informed consent is required for newborn screening (all of which shall be left to other discussions); we instead are raising awareness that such samples exist (to include many other medical samples such as biopsy samples or samples collected for whole genome sequencing studies [95] and for that matter the non-medical collection of samples by the military) and may be sought to support criminal and humanitarian investigations. Laboratories should seek guidance on the usage of such samples and be aware that they may not have been obtained with informed consent before undertaking analyses as the consequences could be severe. We strongly advocate that the principle of intended use is maintained, and proactively looking forward with the option to actively opt-in to secondary use of a sample in order to preserve privacy.

Conclusion

Our goal is to raise awareness and facilitate discussion on these important and fundamental aspects of personal data, the role of informed consent, and the need to update the process to be concomitant with current technological capabilities and the times. “The development of new technologies allowing for easier and faster Processing of ever-increasing quantities of Personal Data in an inter-connected world has given rise to concerns about the possible intrusion into the private sphere of individuals” [26]. We are not suggesting that every issue raised herein must be addressed in every informed consent. Instead, we urge that forensic scientists and their law enforcement/legal system counterparts consider these various issues to ensure that a proper informed consent is met relevant to the intended work. While an in-depth informed consent may be perceived as cumbersome, the benefits in the short and long term are individuals will be protected and allowed to make sound decisions (i.e., autonomy) and the use of powerful technologies to investigate crime and identify missing persons and human remains will be sustained. There have been forays recently with using an electronic informed consent (i.e., e-consent), for example, in clinical research and in biobanks [96, 97]. An e-consent makes use of digital technologies incorporating multimedia to convey information and obtain informed consent. There are some indications that e-consent could enhance the informed consent process, particularly for format, subject understanding through greater interactivity, storage, and potential streamlining. While there are some issues to address, either generally or on a case-by-case basis, such as security, legal validity, acceptability by ethics boards, computer literacy with some groups, trust, privacy, and cost, to name a few [98], some guidance for developing and enacting an effective e-consent has been provided [99]. The forensic community should consider investigating e-consent as it may facilitate the task of fully meeting informed consent.

As with all use of genetic data, balance is an important aspect. Discussed herein are examples in which samples may be used legally even though informed consent may not have been obtained and a scenario in which law enforcement may have obtained information and the donor may choose to withdraw consent after the fact. Thus, there may be situations where informed consent, at least in spirit or by intent, may not be necessary or overridden for the greater good. Within the European Economic Area, the entering into force of the GDPR saw a distinct move away from the privileged status of informed consent as the basis for processing personal data. This action does not, however, mean that informed consent should not be used when genetic data are collected from the donor. To the contrary, it is a prerequisite for ethical use of the data as stated, for example, in the WMA Declaration of Taipei version [100]. In terms of the GDPR, it is possible to process genetic data based on law which provides for the purpose and further conditions of the use and eventual possibilities for storing the data (Art. 6.1. (e) or 6.1. (c) together with Art. 9.2. (g)). Genetic data still may be processed on the basis of consent (6.1. (a) and 9.2 (a)), which has to conform to the requirements of the GDPR (Art. 7). The GDPR also opens the door for a broad consent presupposing that it keeps within recognized ethical standards of scientific research (Recital 33). A dynamic electronic consent, in particular, is recommended by the European Data Protection Board.

Vulnerability should be considered and addressed to obtain a valid consent. The social, cultural, and religious norms of the potential vulnerable group of which the human subject(s) belongs should be considered, and these individuals should be treated with dignity and respect. While individualized informed consent is desired, there can be circumstances that may make it difficult to meet this goal. For example, in a mass disaster where there are exigent circumstances and a chaotic environment, it may be more effective to inform the public and/or those directly impacted individuals through media so that they may have an exigent opportunity to understand and agree to intended use(s), processes, risks and benefits of the sample, subsequent generated data, and other personal data that may be gathered. All activities need to be weighed under the principle of proportionality [26]. Finally, we all must hold ourselves accountable to the principles of informed consent and actions or consequences that may ensue with the samples and data obtained. If we do so, we can protect the individuals, give respect and dignity to the individuals, and ultimately contribute effectively to our mission of applying forensic science to support criminal and humanitarian investigations.