Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Characterizing the neighborhood risk environment in multisite clinic-based cohort studies: A practical geocoding and data linkages protocol for protected health information

  • Ariann Nassel ,

    Contributed equally to this work with: Ariann Nassel, Marta G. Wilson-Barthes

    Roles Conceptualization, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Lister Hill Center for Health Policy, School of Public Health, University of Alabama at Birmingham, Birmingham, Alabama, United States of America

  • Marta G. Wilson-Barthes ,

    Contributed equally to this work with: Ariann Nassel, Marta G. Wilson-Barthes

    Roles Conceptualization, Methodology, Writing – original draft, Writing – review & editing

    marta_wilson-barthes@brown.edu

    Affiliation Center for Epidemiologic Research, Department of Epidemiology, Brown University School of Public Health, Providence, Rhode Island, United States of America

  • Chanelle J. Howe,

    Roles Conceptualization, Funding acquisition, Methodology, Writing – review & editing

    Affiliation Center for Epidemiologic Research, Department of Epidemiology, Brown University School of Public Health, Providence, Rhode Island, United States of America

  • Sonia Napravnik,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Division of Infectious Diseases, Department of Medicine, School of Medicine, Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America

  • Michael J. Mugavero,

    Roles Conceptualization, Methodology, Writing – review & editing

    Affiliation Division of Infectious Diseases, Department of Medicine, Center for AIDS Research, University of Alabama at Birmingham, Birmingham, Alabama, United States of America

  • Deana Agil,

    Roles Methodology, Writing – review & editing

    Affiliation Division of Infectious Diseases, Department of Medicine, School of Medicine, Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America

  • Akilah J. Dulin

    Roles Conceptualization, Funding acquisition, Methodology, Writing – original draft, Writing – review & editing

    Affiliation Center for Health Promotion and Health Equity, Department of Behavioral and Social Sciences, Brown University School of Public Health, Providence, Rhode Island, United States of America

Abstract

Background

Maintaining patient privacy when geocoding and linking residential address information with neighborhood-level data can create challenges during research. Challenges may arise when study staff have limited training in geocoding and linking data, or when non-study staff with appropriate expertise have limited availability, are unfamiliar with a study’s population or objectives, or are not affordable for the study team. Opportunities for data breaches may also arise when working with non-study staff who are not on-site. We detail a free, user-friendly protocol for constructing indices of the neighborhood risk environment during multisite, clinic-based cohort studies that rely on participants’ protected health information. This protocol can be implemented by study staff who do not have prior training in Geographic Information Systems (GIS) and can help minimize the operational costs of integrating geographic data into public health projects.

Methods

This protocol demonstrates how to: (1) securely geocode patients’ residential addresses in a clinic setting and match geocoded addresses to census tracts using Geographic Information System software (Esri, Redlands, CA); (2) ascertain contextual variables of the risk environment from the American Community Survey and ArcGIS Business Analyst (Esri, Redlands, CA); (3) use geoidentifiers to link neighborhood risk data to census tracts containing geocoded addresses; and (4) assign randomly generated identifiers to census tracts and strip census tracts of their geoidentifiers to maintain patient confidentiality.

Results

Completion of this protocol generates three neighborhood risk indices (i.e., Neighborhood Disadvantage Index, Murder Rate Index, and Assault Rate Index) for patients’ coded census tract locations.

Conclusions

This protocol can be used by research personnel without prior GIS experience to easily create objective indices of the neighborhood risk environment while upholding patient confidentiality. Future studies can adapt this protocol to fit their specific patient populations and analytic objectives.

Introduction

In the United States (US), disadvantaged and socially disordered neighborhoods–those characterized by structural racism that leads to scarcity of resources, poverty, low high school completion rates, racial minority segregation, and/or violence–can have direct and indirect negative effects on health [15]. For example, individuals residing in disadvantaged and disordered neighborhoods are more likely to experience multiple stressors [6, 7] and increased risk of chronic disease such as heart disease, hypertension, and HIV [3, 812] compared to residents of more advantaged and less disordered neighborhoods. Despite the well-established and growing literature surrounding neighborhood effects on health [13, 14], the methods that drive this research vary widely and continue to change over time [1519]. Thus, there is motivation to develop new, accessible and reproducible tools that perform consistently across studies and can be adapted to diverse research objectives.

Geocoding (assigning a latitude and longitude coordinate to a postal address) is one available tool that is increasingly used in public health [20, 21] to help researchers characterize neighborhood environments and assess how individual-level health outcomes vary due to differences in these environments. However, challenges can arise when using geocoding in research studies that rely on participants’ protected health information (PHI), which includes residential address information [22]. Some of the current geocoding approaches rely on sharing de-identified health data outside of the setting in which the data were originally generated so that external mapping experts can conduct in-depth analysis [23]. Alongside potentially incurring additional costs to a project, this “out of house” data sharing approach can create opportunities for data breaches when residential address locations can potentially be identified from published maps via techniques such as reverse geocoding [24], spatial re-engineering [25] and digital scanning [25]. Other geocoding approaches rely on online, cloud-based services [26] such as Google Maps, ArcGIS Online, Census Geocoder, or OpenStreetMap [27, 28], which can create additional opportunities to unlawfully access PHI if cloud-based geocoding is conducted behind an identifiable Internet Protocol (IP) address.

Protecting geocoded patient address data can present additional challenges during multisite clinic-based cohort studies. Multisite studies must frequently choose between performing geocoding and data linkages “in-house” to protect participants’ PHI or involving the external expertise of a geoscientist with specific knowledge of geographic information systems (GIS). For many institutions, the former “in-house” approach will be limited to the GIS knowledge of research personnel [23, 29]. For the latter approach, involving an external geospatial collaborator can require lengthy approvals from the governing Institutional Review Board (IRB) which can delay timely data collection [23]. An external GIS specialist may also be unfamiliar with a site’s specific patient population(s), which can introduce errors during data merging and analyses. Lastly, ensuring comparable levels of geocoding accuracy can be difficult if or when the same geocoding techniques are not uniformly applied across sites.

To address these challenges, we present a publicly available geocoding protocol for characterizing the neighborhood risk environment that can be implemented during multisite clinical cohort studies working with PHI. The protocol can be understood and implemented by staff without prior GIS experience to geocode patients’ residential addresses, link these addresses to crime and socioeconomic data, and create anonymized coded census tracts to construct objective indices of the neighborhood risk environment. The process of anonymizing census tracts via this protocol can enable analyses to be performed at the individual-level while ensuring a patient’s PHI is not linked to census-designated geoidentifiers. This protocol is informed by our prior experience [30] geocoding and linking patient data from two HIV clinic-based cohorts in the United States [3133] and offers a standardized, user-friendly tool that can be adapted for future studies.

Materials and methods

This protocol follows two main phases based on our work [30]. Phase 1 includes (1) geocoding patients’ residential addresses and joining geocoded addresses to census tracts, and (2) abstracting indicators of neighborhood crime and socioeconomic disadvantage at the census tract level. Phase 2 includes (1) linking the aforementioned patient- and neighborhood-level data by census tract, and (2) creating coded census tracts to anonymize the census tract within which a patient lives. All data in this protocol are illustrative and do not include any actual patient information. Fig 1 summarizes the geocoding and data linkages process.

thumbnail
Fig 1. Protocol flow diagram.

Visual depiction of each phase of the geocoding and data linkages protocol. ACS: American Community Survey.

https://doi.org/10.1371/journal.pone.0278672.g001

The full geocoding and data linkages protocol is provided in S1 File and publicly available on the protocols.io platform at dx.doi.org/10.17504/protocols.io.b3dvqi66 [PROTOCOL DOI].

Phase 1: Geocoding and data abstraction

1A. Geocoding residential addresses and joining geocoded addresses to census tract boundaries.

Objective: Geocode patients’ residential addresses and link the geocoded address locations to US census tracts

Requirements: Esri ArcGIS 10.5.1 or later; StreetMap Premium Software for Esri ArcGIS 10.5.1 or later (annual license required)

Detailed instructions: Part 3 of S1 File

To begin implementing this protocol, residential address information should be abstracted from patient medical records and saved locally (e.g., on a secure server protected by the clinic’s firewall) as an Excel Comma-Separated Values (CSV) file. Address information should contain, at minimum, the street address, city, state, and zip code for each patient record. Parts 1 and 2 of the protocol (S1 File) provide instruction for formatting residential address information and preparing to geocode.

Clinic-based research staff should use the latest available version of ArcGIS Desktop and StreetMap Premium (Esri, Redlands, CA) to geocode patients’ residential addresses within a computing environment that meets security requirements for storing PHI [21, 34]. If an address cannot initially be geocoded, a rematch should be run in ArcMap and a suggested address with a minimum match score of 85% should be selected [35, 36]. The match score indicates how closely a residential address matches the geospatial reference data in ArcGIS, where a score of 100% indicates a perfect match. Addresses outside of the study state or Post Office Boxes cannot be assumed to approximate a patient’s neighborhood environment and should not be geocoded.

The geographic coordinates for each patient’s residential address should then be joined to a data layer of census tract boundaries by performing a spatial join in ArcMap. Geocoded addresses linked to census tract boundaries should be abstracted from the ArcMap Attribute Table and reformatted in Excel to include the minimum data fields required to for census tract identification.

1B. Abstracting crime indicators from Esri’s business analyst online.

Objective: Ascertain neighborhood crime risk indicators for census tracts containing geocoded patient addresses

Requirements: Esri Business Analyst Online version 5.82 or later

Detailed instructions: Part 4 of S1 File

Indicators of neighborhood crime risk should be obtained at the census tract level using the latest available version of ArcGIS Business Analyst Online (Esri, Redlands, CA), a web-based program offering additional de-identified census-level data (e.g., crime, business, sociodemographic). In this example protocol, two neighborhood crime indices–a Murder Rate Index and an Assault Rate Index–are obtained from Business Analyst because of their known associations with HIV outcomes [3740]. Business Analyst derives its data from Applied Geographic Solutions (AGS) CrimeRisk© indices (Applied Geographic Solutions, 2020B) [41, 42] which are based on FBI Uniform Crime Reports and local police departments. Other crime indices include, but are not limited to, Rape, Burglary, and Motor Vehicle Theft Indices and composite indices of Personal, Property, and Total Crime. Alternative crime indices can be selected from Business Analyst to reflect a study’s specific neighborhood environment.

Within Business Analyst, the geography should be set to census tracts and each census tract within the study state should be selected. Using the SmartMap Search feature, data for the relevant crime indicators should be selected and downloaded as a CSV file. Downloaded data will contain a numeric census tract identifier and data for the relevant index. This process will need to be repeated in stages because Business Analyst does not permit data abstraction for all census tracts at one time.

1C. Abstracting socioeconomic indicators from data.census.gov.

Objective: Ascertain neighborhood socioeconomic data for census tracts containing geocoded patient addresses

Requirements: American Community Survey Five-Year Estimates from the US Census Bureau

Detailed instructions: Part 5 of S1 File

Census tract level-indicators of socioeconomic disadvantage should be obtained using the US Census Bureau’s American Community Survey (ACS) 5-year estimates that most closely align with the study period (i.e., the most recent 5-year estimates for prospective studies or earlier 5-year estimates for retrospective studies) [43]. Compared to single-year and three-year estimates, ACS 5-year estimates provide increased statistical reliability of the data particularly for smaller geographic areas and subpopulations [44]. In this example protocol, three socioeconomic indicators–unemployment, high school education, and poverty–are obtained from ACS based on their known associations with HIV outcomes [3740]. Alternative socioeconomic indicators can be selected from the more than 20,000 variables available in ACS [43].

Using the Advanced Search option in data.census.gov, the relevant Table ID number should be entered and all census tracts within the study state should be selected and downloaded as a CSV file. Identifying the Table that contains study-specific variables can be done by typing the subject area of interest into the Explore Census Data search box at data.census.gov. Downloaded data will contain a numeric census tract identifier, a census tract label, and data for relevant socioeconomic indicator(s). In this example protocol, only percentage estimates of each indicator are preserved for data linkages but other formats (e.g., total or median estimates) can be selected from the downloaded data.

Phase 2: Data linkages & anonymizing census tracts

2A. Linking geocoded addresses and abstracted neighborhood risk data by census tract.

Objective: Link abstracted socioeconomic and crime risk data to each census tract containing a geocoded patient address

Requirements: Macro-enabled Excel File (S2 File)

Detailed instructions: Part 6 of S1 File

In this step, four CSV files from Phase I should be linked using the Macro-enabled Excel file (version 2019) developed for this protocol. (S2 File) The first file will contain geocoded patient address information, the second file will contain census tract-level crime risk data, the third file will contain census tract-level poverty and unemployment data (or other study-specific socioeconomic data), and the fourth file will contain census-tract level education data (or other study-specific socioeconomic data). The geoidentifiers and associated data in these four files should be copied into the relevant sheets in the Excel file and the customized Macros–“ACSMatcher” and “CrimeMatcher”–should be run to link all data by census tract. Census tracts containing geocoded addresses may include missing data for some or all neighborhood risk indicators depending on the data available in ACS and Business Analyst at the time of abstraction. Though not required for this protocol, users wishing to edit or reference the coding of the ACSMatcher and/or CrimeMatcher Macros can do so by clicking “Macros”, “View Macros” and then “Edit” in S2 File.

2B. Creating coded census tracts.

Objective: Anonymize census tracts that are linked to neighborhood risk data and contain geocoded patient addresses

Requirements: Macro-enabled Excel File (S2 File)

Detailed instructions: Part 7 of S1 File

To minimize the risk of patients being identified from their census tract information, census tracts that have been linked to socioeconomic and crime risk data will need to be stripped of their census-designated geoidentifiers and assigned a randomly generated number to create anonymized “coded” census tracts. First, for each study record in the Excel file, a random digit should be generated using Excel’s “RAND” function. Second, each randomly generated digit should then be ranked using Excel’s “RANK” function to convert random digits to whole integers. Third, a unique site identifier should be attached to each ranked digit using Excel’s “CONCATENATE” function. Assigning a site-specific identifier to the coded census tract prevents duplication of coded census tracts across study sites. Last, each record should be stripped of its original census tract identifiers so that only the coded census tract is preserved. Since multiple patients may reside within the same census tract, Excel’s “Conditional Formatting” function should be used to ensure duplicate census tracts within each clinic site are assigned the same coded census tract identifier. Fig 2 diagrams the process for creating coded census tracts.

thumbnail
Fig 2. Process for creating coded census tracts during multisite study.

Process for stripping census tracts containing geocoded patient address information of their census-designated identifiers and assigning a randomly generated number to create anonymized “coded” census tracts. The design of Fig 2 was adapted from a figure published by Brokamp et al., 2018 in the open-access journal J Am Med Inform Assoc [51].

https://doi.org/10.1371/journal.pone.0278672.g002

Creating coded census tracts ensures that (1) a single residential address cannot be identified during analysis that occurs outside of the clinic setting where the data were originally generated, and (2) the data layer cannot be mapped or visually displayed.

The Excel file containing a study ID, coded census tract, and the linked socioeconomic and crime risk data for each record can then be sent from each clinic site to the coordinating institution for analysis using IRB-approved (if necessary) data sharing platforms. The coordinating institution can use the study ID assigned to each participant’s record to merge coded census tract data from multiple sites.

Results

Successfully implementing this protocol will yield indices of neighborhood crime and generate an index of neighborhood disadvantage at the census tract level (Table 1).

thumbnail
Table 1. Resulting indices of the neighborhood risk environment in example multisite study.

https://doi.org/10.1371/journal.pone.0278672.t001

The Neighborhood Disadvantage Index represents a summation of the census-tract level z-scores for each socioeconomic variable that have been normed to the national level. For example, a Z-score of 1 indicates one standard deviation greater disadvantage compared to the US. The coordinating institution can construct the Neighborhood Disadvantage Index once all site data have been merged. (S3 File provides step-by-step instructions for how to construct the Neighborhood Disadvantage Index.)

Crime indices are directly abstracted from Esri’s Business Analyst. These indices provide an indication of the relative risk of a crime occurring measured against the overall risk for the US. When a study participant’s coded census tract contains a value greater than 100 for the relevant crime index (e.g., for the Murder Rate Index as in this example protocol), this denotes an increased relative risk of murder or assault compared to the US. Conversely, a value less than 100 denotes a lower relative crime risk compared to the national level.

Depending on the size of the study population, continuous index data can be transformed into risk categories (e.g., binary splits, tertiles) by using cut-points appropriate for the sample distribution [45].

Discussion

As population health and health equity research increasingly focus on health determinants beyond the individual-level, practical and implementable tools that characterize the neighborhood environment while protecting individual confidentiality are paramount. This protocol offers a free, publicly available and user-friendly resource for generating measures of the neighborhood risk environment during multisite studies that rely on PHI. We effectively implemented this protocol in our prior work to identify multilevel resilience resources among African American/Black (AA/B) adults living with HIV [30]. We found in this prior work that despite the majority of study participants residing in more economically disadvantaged and higher crime neighborhoods compared to the US, AA/B adults living with HIV still identified resilience resources that helped them engage in care.

Our protocol offers four key improvements over current approaches to geocoding and data linkages [23, 2628]. First, an ongoing concern during clinic-based studies is the confidentiality of PHI. A main advantage of this protocol is that it should satisfy any site-specific security or IRB requirements to obtain census tract-level data because all geocoding and data linkages occur at the cohort site, which eliminates the need for PHI to leave the clinic-cohort site. Even for census tracts containing only one geocoded address, our process for creating coded census tracts ensures that a patient’s exact residential location cannot be known or visually displayed while allowing census tracts to be distinguishable across multiple sites. A second challenge during multisite studies is ensuring that geocoding and data linkages are performed systematically within and between sites. The Macros developed for this protocol provide an automated process for linking neighborhood-level indicators to census tract identifiers regardless of the number of participants enrolled at a given site or number of sites performing the linkages. Use of the Macros can help reduce human error and cognitive burden. Third, this protocol draws from the cadre of established methods [20, 21] for geocoding and linking residential data with neighborhood information. Using simple, validated methods rather than more complex geocoding approaches increases the likelihood that research personnel without prior GIS training will be able to implement this protocol with minimal oversight. Strengthening the geoanalytic skills of on-site study staff who are directly involved in a research project negates the need to solicit GIS expertise from non-study staff, which can help conserve time and resources and prevent exposing PHI to a third party. Last, a key advantage of this protocol is the flexibility of the variable/indicator selection process. As previously mentioned, the example contextual markers of neighborhood disadvantage and crime were informed by our prior experience [30] working with HIV clinical cohorts and evidence demonstrating associations between these markers and HIV outcomes [3740]. However, the same data abstraction process can be used to select indicators of crime and neighborhood disadvantage that are study-specific.

The current protocol utilizes a Murder Rate Index and an Assault Rate Index derived from the AGC CrimeRisk© indices. Other studies have used these indices, including to examine associations between neighborhood crime and sexual risk-taking behaviors among Black men living with HIV [46] and to assess for interactions between neighborhood crime, childhood trauma and longer-term mental health outcomes [47]. This protocol also utilizes Census Bureau ACS 5-year estimates. Prior large-scale studies have utilized ACS 5-year estimates, including to investigate county-level disparities in COVID-19 cases and deaths among racial/ethnic minority groups across the US [48]. These and other examples [49, 50] demonstrate that this protocol’s design is relevant for the current field of neighborhood health effects research and can be applied readily in future work.

Alongside this protocol, other efforts have aimed to develop standardized protocols for contextualizing neighborhood environments in research. The Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) software is one example tool that can facilitate reproducible geocoding while maintaining patients’ confidentiality [51]. The software is free and does not expose PHI to an intermediary party. However, to the best of our knowledge, DeGAUSS was tested during an initial proof-of-concept study but has yet to be extended to research requiring the anonymization of census tracts. The Women’s Interagency HIV Study (WIHS) [52] has also developed several versions of its geocoding protocol for PHI [53, 54] to help evaluate relationships between neighborhood-level poverty and disease control among women living with or at risk for chronic illness [55]. The WIHS protocol relies on Federal Information Processing Standards (FIPS) codes to define geographies across 10 cohort sites. FIPS codes have been published in census products for over 30 years and are widely used in public health geocoding [56]. In comparison, the current protocol uses census-derived identifiers because not all statistical geographic areas are covered by FIPS codes and a census-derived identifier can encompass both FIPS and Census Bureau codes [56]. The geocoding protocol developed for the Jackson Heart Study (JHS) [57] has also demonstrated considerable accuracy as a geocoding tool, retrospectively geocoding nearly 99% of the JHS cohort after address data were not obtained at baseline. The JHS protocol relied on georeferencing participants to census block groups while the current protocol utilizes census tracts which contain a greater number of residents, thereby minimizing the risk of identifying enrolled patients [58].

A limitation of this protocol is its reliance on the Esri StreetMap Premium annual license. The license can incur additional study costs if the product is not readily available at the institution or if geocoding activities span more than one year. Researchers choosing to include crime risk data in the neighborhood risk indices could also incur costs if additional Business Analyst credits (i.e., the currency used by ArcGIS) need to be purchased to facilitate the download of large amounts of data. However, the number of credits consumed by downloading data from Business Analyst for this protocol is quite small (e.g., 10 credits needed for a single download compared to the 6,000–10,000 credits included with an annual license [59]). Affiliates of any institution that licenses Esri software should have access to all data (including census-tract level crime risk data) within ArcGIS Business Analyst Online for no additional fee. Another limitation is that patient address information is derived from medical records rather than direct self-report. Thus, the percentage of records that can be matched to census tract boundaries will be predetermined by the data available in patients’ medical records. This protocol is also limited to measuring the physical neighborhood environment using objective data from the US census and other sources (e.g., Business Analyst), even though self-report can more accurately characterize neighborhood exposures in some instances [60, 61]. Also, while the anonymization of census tracts serves to safeguard PHI, it prohibits patients’ locations from being mapped or visually displayed. For studies aiming to map patients’ census tract locations or perform spatial analyses of disease dynamics, other methods may be more appropriate [6264]. Finally, while this protocol has been successfully implemented by research personnel with minimal or no prior GIS experience [30, 65], other project teams may wish to identify a GIS expert prior to study start who is available to answer protocol implementation questions if or when they arise.

Conclusions

This step-by-step geocoding and data linkage protocol addresses some of the common challenges of working with PHI and offers an adaptable resource to help assess neighborhood-level health impacts during multisite studies. Such resources can support population health research and multilevel interventions that aim to mitigate adverse neighborhood effects on health.

Supporting information

S1 File. Geocoding and data linkages protocol for protected health information (v2021).

https://doi.org/10.1371/journal.pone.0278672.s001

(PDF)

S2 File. Macro-enabled excel file.

Macro-enabled Excel file that can be used to (1) Link census tracts containing patient geocoded addresses to indicators of neighborhood crime and socioeconomic disadvantage using the census tract geoidentifier, and (2) Assign randomly generated identification numbers to census tracts and strip them of geoidentifiers to maintain patient confidentiality.

https://doi.org/10.1371/journal.pone.0278672.s002

(XLSM)

S3 File. Instructions for neighborhood disadvantage index construction.

https://doi.org/10.1371/journal.pone.0278672.s003

(PDF)

Acknowledgments

The authors gratefully acknowledge Bernadette Johnson and Sarah Dougherty-Sheff for their involvement in testing this protocol during its development and implementing this protocol during prior research. We also thank Sean McMahon for his excellent research assistance.

References

  1. 1. Dulin-Keita A, Clay O, Whittaker S, Hannon L, Adams IK, Rogers M, et al. The influence of HOPE VI neighborhood revitalization on neighborhood-based physical activity: A mixed-methods approach. Soc Sci Med. 2015;139: 90–99. pmid:26164364
  2. 2. Williams DR, Collins C. Racial residential segregation: A fundamental cause of racial disparities in health. Public Health Rep. 2001;116: 404–416. pmid:12042604
  3. 3. Pickett KE. Multilevel analyses of neighbourhood socioeconomic context and health outcomes: a critical review. J Epidemiol Community Heal. 2001;55: 111–122. pmid:11154250
  4. 4. Dulin-Keita A, Casazza K, Fernandez JR, Goran MI, Gower B. Do neighbourhoods matter? Neighbourhood disorder and long-term trends in serum cortisol levels. J Epidemiol Community Health. 2012; pmid:20736487
  5. 5. Barber S, Hickson DA, Kawachi I, Subramanian S V., Earls F. Neighborhood Disadvantage and Cumulative Biological Risk Among a Socioeconomically Diverse Sample of African American Adults: An Examination in the Jackson Heart Study. J Racial Ethn Heal Disparities. 2016;3: 444–456. pmid:27294737
  6. 6. Estrada-Martínez LM, Caldwell CH, Bauermeister JA, Zimmerman MA. Stressors in Multiple Life-Domains and the Risk for Externalizing and Internalizing Behaviors Among African Americans During Emerging Adulthood. J Youth Adolesc. 2012;41: 1600–1612. pmid:22722890
  7. 7. Williams DR. Stress and the Mental Health of Populations of Color: Advancing Our Understanding of Race-related Stressors. J Health Soc Behav. 2018;59: 466–485. pmid:30484715
  8. 8. Freedman VA, Grafova IB, Rogowski J. Neighborhoods and Chronic Disease Onset in Later Life. Am J Public Health. 2011;101: 79–86. pmid:20299643
  9. 9. Ross CE, Mirowsky J. Neighborhood Disadvantage, Disorder, and Health. J Health Soc Behav. 2001;42: 258. pmid:11668773
  10. 10. Shacham E, Lian M, Önen N, Donovan M, Overton E. Are neighborhood conditions associated with HIV management? HIV Med. 2013;14: 624–632. pmid:23890194
  11. 11. Brown AF, Ang A, Pebley AR. The Relationship Between Neighborhood Characteristics and Self-Rated Health for Adults With Chronic Conditions. Am J Public Health. 2007;97: 926–932. pmid:17395847
  12. 12. Durfey SNM, Kind AJH, Buckingham WR, DuGoff EH, Trivedi AN. Neighborhood disadvantage and chronic disease management. Health Serv Res. 2019;54: 206–216. pmid:30468015
  13. 13. Oakes JM, Andrade KE, Biyoow IM, Cowan LT. Twenty Years of Neighborhood Effect Research: An Assessment. Curr Epidemiol Reports. 2015;2: 80–87. pmid:28133588
  14. 14. Schinasi LH, Auchincloss AH, Forrest CB, Diez Roux A V. Using electronic health record data for environmental and place based population health research: a systematic review. Ann Epidemiol. 2018;28: 493–502. pmid:29628285
  15. 15. Robertson C. Towards a geocomputational landscape epidemiology: surveillance, modelling, and interventions. GeoJournal. 2017;82: 397–414. pmid:32214618
  16. 16. Grubesic TH, Matisziw TC. On the use of ZIP codes and ZIP code tabulation areas (ZCTAs) for the spatial analysis of epidemiological data. Int J Health Geogr. 2006;5: 58. pmid:17166283
  17. 17. Miller HJ, Wentz EA. Representation and Spatial Analysis in Geographic Information Systems. Ann Assoc Am Geogr. 2003;93: 574–594.
  18. 18. Boscoe FP, Ward MH, Reynolds P. Current practices in spatial analysis of cancer data: Data characteristics and data sources for geographic studies of cancer. Int J Health Geogr. 2004;3: 1–14. pmid:15574197
  19. 19. Rushton G, Armstrong MP, Gittler J, Greene BR, Pavlik CE, West MM, et al. Geocoding in Cancer Research. Am J Prev Med. 2006;30: S16–S24. pmid:16458786
  20. 20. Kirby RS, Delmelle E, Eberth JM. Advances in spatial epidemiology and geographic information systems. Ann Epidemiol. 2017;27: 1–9. pmid:28081893
  21. 21. Carroll LN, Au AP, Detwiler LT, Fu T, Painter IS, Abernethy NF. Visualization and analytics tools for infectious disease epidemiology: A systematic review. J Biomed Inform. 2014;51: 287–298. pmid:24747356
  22. 22. States United. Health Insurance Portability and Accountability Act of 1996. Public Law 104–191. US Statut Large. 1996;110: 1936–2103. Available: http://www.ncbi.nlm.nih.gov/pubmed/16477734 pmid:16477734
  23. 23. Ajayakumar J, Curtis AJ, Curtis J. Addressing the data guardian and geospatial scientist collaborator dilemma: how to share health records for spatial analysis while maintaining patient confidentiality. Int J Health Geogr. 2019;18: 30. pmid:31864350
  24. 24. Brownstein JS, Cassa CA, Kohane IS, Mandl KD. Reverse geocoding: concerns about patient confidentiality in the display of geospatial health data. American Medical Informatics Association Annual Symposium 2005; 905. Available: http://www.ncbi.nlm.nih.gov/pubmed/16779192 pmid:16779192
  25. 25. Curtis AJ, Mills JW, Leitner M. Spatial confidentiality and GIS: re-engineering mortality locations from published maps about Hurricane Katrina. Int J Health Geogr. 2006;5: 44. pmid:17032448
  26. 26. Kounadi O, Lampoltshammer TJ, Leitner M, Heistracher T. Accuracy and privacy aspects in free online reverse geocoding services. Cartogr Geogr Inf Sci. 2013;40: 140–153.
  27. 27. Präger M, Kurz C, Böhm J, Laxy M, Maier W. Using data from online geocoding services for the assessment of environmental obesogenic factors: a feasibility study. Int J Health Geogr. 2019;18: 13. pmid:31174531
  28. 28. Lemke D, Mattauch V, Heidinger O, Hense HW. Who Hits the Mark? A Comparative Study of the Free Geocoding Services of Google and OpenStreetMap. Gesundheitswesen. 2015; pmid:26154258
  29. 29. Cockings S, Dunn CE, Bhopal RS, Walker DR. Users’ perspectives on epidemiological, GIS and point pattern approaches to analysing environment and health data. Health Place. 2004;10: 169–182. pmid:15019911
  30. 30. Dulin AJ, Earnshaw VA, Dale SK, Carey MP, Fava JL, Wilson-Barthes M, et al. A Concept Mapping Study to Understand Multilevel Resilience Resources Among African American/Black Adults Living with HIV in the Southern United States. AIDS Behav. 2021;25: 773–786. pmid:32940827
  31. 31. Mugavero MJ, Lin H-Y, Allison JJ, Willig JH, Chang P-W, Marler M, et al. Failure to Establish HIV Care: Characterizing the “No Show” Phenomenon. Clin Infect Dis. 2007;45: 127–130. pmid:17554713
  32. 32. Napravnik S, Eron JJ, McKaig RG, Heine AD, Menezes P, Quinlivan E. Factors associated with fewer visits for HIV primary care at a tertiary care center in the SoutheAstern U.S. AIDS Care. 2006;18: 45–50. pmid:16938674
  33. 33. Howe CJ, Cole SR, Napravnik S, Eron JJ. Enrollment, Retention, and Visit Attendance in the University of North Carolina Center for AIDS Research HIV Clinical Cohort, 2001–2007. AIDS Res Hum Retroviruses. 2010;26: 875–881. pmid:20672995
  34. 34. Kruse CS, Smith B, Vanderlinden H, Nealand A. Security Techniques for the Electronic Health Records. J Med Syst. 2017;41: 127. pmid:28733949
  35. 35. Esri. Geocoding options properties. [Internet]. Esri ArcMap 10.5. 2017 [cited 17 Dec 2020]. Available: https://desktop.arcgis.com/en/arcmap/10.5/manage-data/geocoding/geocoding-options-properties.htm
  36. 36. Ratcliffe JH. Geocoding crime and a first estimate of a minimum acceptable hit rate. Int J Geogr Inf Sci. 2004;18: 61–72.
  37. 37. Pellowski JA, Kalichman SC, Matthews KA, Adler N. A pandemic of the poor: Social disadvantage and the U.S. HIV epidemic. Am Psychol. 2013;68: 197–209. pmid:23688088
  38. 38. Arnold M, Hsu L, Pipkin S, McFarland W, Rutherford GW. Race, place and AIDS: The role of socioeconomic context on racial disparities in treatment and survival in San Francisco. Soc Sci Med. 2009;69: 121–128. pmid:19443092
  39. 39. Latkin CA, German D, Vlahov D, Galea S. Neighborhoods and HIV: A social ecological approach to prevention and care. Am Psychol. 2013;68: 210–224. pmid:23688089
  40. 40. Latkin CA, Curry AD, Hua W, Davey MA. Direct and Indirect Associations of Neighborhood Disorder With Drug Use and High-Risk Sexual Partners. Am J Prev Med. 2007;32: S234–S241. pmid:17543716
  41. 41. Esri. Esri Crime Indexes Data Variable List [Internet]. 2020. [cited 17 Dec 2020]. Available: https://downloads.esri.com/esri_content_doc/dbl/us/Var_List_Crime-Indexes_Summer2020.pdf
  42. 42. Nau C, Sidell M, Clift K, Koebnick C, Desai J, Rohm-Young D. A commercially available crime index may be a reliable alternative to actual census-tract crime in an urban area. Prev Med Reports. 2020;17: 100996. pmid:31871880
  43. 43. United States Census Bureau. American Community Survey 5-Year Data (2009–2019) [Internet]. 2020. [cited 17 Dec 2020]. Available: https://www.census.gov/data/developers/data-sets/acs-5year.html
  44. 44. US Census Bureau. Understanding and Using American Community Survey Data: What All Data Users Need to Know [Internet]. 2020. [cited 17 Dec 2020] Available: https://www.census.gov/content/dam/Census/library/publications/2020/acs/acs_general_handbook_2020.pdf
  45. 45. Lamb KE, White SR. Categorisation of built environment characteristics: the trouble with tertiles. Int J Behav Nutr Phys Act. 2015;12: 19. pmid:25889014
  46. 46. Ojikutu BO, Bogart LM, Klein DJ, Galvan FH, Wagner GJ. Neighborhood Crime and Sexual Transmission Risk Behavior among Black Men Living with HIV. J Health Care Poor Underserved. 2018;29: 383–399. pmid:29503307
  47. 47. Lowe SR, Quinn JW, Richards CA, Pothen J, Rundle A, Galea S, et al. Childhood trauma and neighborhood-level crime interact in predicting adult posttraumatic stress and major depression symptoms. Child Abuse Negl. 2016;51: 212–222. pmid:26499372
  48. 48. Khanijahani A. Racial, ethnic, and socioeconomic disparities in confirmed COVID-19 cases and deaths in the United States: a county-level analysis as of November 2020. Ethn Heal. 2020; pmid:33334160
  49. 49. Tang X, Ohri-Vachaspati P, Abbott JK, Aggarwal R, Tulloch DL, Lloyd K, et al. Associations between Food Environment around Schools and Professionally Measured Weight Status for Middle and High School Students. Child Obes. 2014;10: 511–517. pmid:25343730
  50. 50. Liang Z. The Application of Community-Based Information from the American Community Survey in a Large Integrated Health Care Organization. Perm J. 2020;24. pmid:33635758
  51. 51. Brokamp C, Wolfe C, Lingren T, Harley J, Ryan P. Decentralized and reproducible geocoding and characterization of community and environmental exposures for multisite studies. J Am Med Informatics Assoc. 2018;25: 309–314. pmid:29126118
  52. 52. Adimora AA, Ramirez C, Benning L, Greenblatt RM, Kempf M-C, Tien PC, et al. Cohort Profile: The Women’s Interagency HIV Study (WIHS). Int J Epidemiol. 2018;47: 393–394i. pmid:29688497
  53. 53. Women’s Interagency HIV Study. Section 40: WIHS/MACS Geocoding Protocol [Internet]. 2018. [cited 17 Dec 2020]. Avialable: https://statepi.jhsph.edu/wihs/wordpress/wp-content/uploads/2019/03/Section_40-Geocoding-1018.pdf
  54. 54. Women’s Interagency HIV Study. Section 40: WIHS/MACS Geocoding Protocol [Internet]. 2014. [cited 17 Dec 2020]. Available: https://statepi.jhsph.edu/wihs/wordpress/wp-content/uploads/2016/09/Section_40-Geocoding-1114.pdfs
  55. 55. Cope AB, Edmonds A, Ludema C, Cole SR, Eron JJ, Anastos K, et al. Neighborhood Poverty and Control of HIV, Hypertension, and Diabetes in the Women’s Interagency HIV Study. AIDS Behav. 2020; pmid:31907676
  56. 56. United States Census Bureau. Understanding Geographic Identifiers (GEOIDs) [Internet]. 2020. [cited 31 Mar 2021]. Available: https://www.census.gov/programs-surveys/geography/guidance/geo-identifiers.html
  57. 57. Robinson JC, Wyatt SB, Hickson D, Gwinn D, Faruque F, Sims M, et al. Methods for Retrospective Geocoding in Population Studies: The Jackson Heart Study. J Urban Heal. 2010;87: 136–150. pmid:20187277
  58. 58. US Census Bureau. Glossary [Internet]. 2019. [cited 5 Apr 2021]. Available: https://www.census.gov/programs-surveys/geography/about/glossary.html#par_textimage_0
  59. 59. Esri. ArcGIS Business Analyst: Access Business Analyst [Internet]. 2020. [cited 22 Sep 2022]. Available: https://doc.arcgis.com/en/business-analyst/web/about-business-analyst.htm#ESRI_SECTION1_E79EF4AB549E4E9E9A27B03BF1FE65DB
  60. 60. Echeverría S, Diez-Roux A V., Shea S, Borrell LN, Jackson S. Associations of neighborhood problems and neighborhood social cohesion with mental health and health behaviors: The Multi-Ethnic Study of Atherosclerosis. Health Place. 2008;14: 853–865. pmid:18328772
  61. 61. Howe CJ, Siegel H, Dulin-Keita A. Neighborhood Environments and Sexual Risk Behaviors for HIV Infection Among U.S. Women: A Systematic Review. AIDS Behav. 2017;21: 3353–3365. pmid:28424970
  62. 62. Kolak M, Abraham G, Talen MR. Mapping census tract clusters of type 2 diabetes in a primary care population. Prev Chronic Dis. 2019;16:180502. pmid:31095922
  63. 63. Dijkstra A, Janssen F, De Bakker M, Bos J, Lub R, Van Wissen LJG, et al. Using Spatial Analysis to Predict Health Care Use at the Local Level: A Case Study of Type 2 Diabetes Medication Use and Its Association with Demographic Change and Socioeconomic Status. Timmer A, editor. PLoS One. 2013;8: e72730. pmid:24023636
  64. 64. Chrest DP, Wheaton WD. Using Geographic Information Systems to Define and Map Commuting Patterns as Inputs to Agent-Based Models. Methods Rep RTI Press. 2009;2009(12):906. pmid:20505785
  65. 65. Dulin AJ, Fava JL, Earnshaw VA, Dale SK, Carey MP, Wilson-Barthes M, et al. Development of Long and Short Forms of the Multilevel Resilience Resource Measure for African American/Black Adults Living with HIV. AIDS Behav. 2022;26: 2469–2484. pmid:35092536