Skip to main content
Log in

Implementation of a workflow for publishing citeable environmental data: successes, challenges and opportunities from a data centre perspective

  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

In recent years, the development and implementation of a robust way to cite data have encouraged many previously sceptical environmental researchers to publish the data they create, thus ensuring that more data than ever are now open and available for re-use within and between research communities. Here, we describe a workflow for publishing citeable data in the context of the environmental sciences—an area spanning many domains and generating a vast array of heterogeneous data products. The processes and tools we have developed have enabled rapid publication of quality data products including datasets, models and model outputs which can be accessed, re-used and subsequently cited. However, there are still many challenges that need to be addressed before researchers in the environmental sciences fully accept the notion that datasets are valued outputs and time should be spent in properly describing, storing and citing them. Here, we identify current challenges such as citation of dynamic datasets and issues of recording and presenting citation metrics. In conclusion, whilst data centres may have the infrastructure, tools, resources and processes available to publish citeable datasets, further work is required before large-scale uptake of the services offered is achieved. We believe that once current challenges are met, data resources will be viewed similarly to journal publications as valued outputs in a researcher’s portfolio, and therefore both the quality and quantity of data published will increase.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. http://inspire.ec.europa.eu/.

  2. http://www.rcuk.ac.uk/research/datapolicy/.

  3. https://www.datacite.org/.

  4. http://www.doi.org/.

  5. http://eidc.ceh.ac.uk/.

  6. http://www.agi.org.uk/join-us/agi-groups/standards-committee/uk-g emini.

  7. http://www.git-scm.com/.

  8. http://www.json.org/.

  9. http://schema.datacite.org/.

  10. https://catalogue.ceh.ac.uk/.

  11. https://mds.datacite.org/static/apidoc.

  12. http://shortdoi.org/.

  13. http://www.crossref.org/.

  14. https://rd-alliance.org/.

  15. http://thomsonreuters.com/en.html.

    Fig. 3
    figure 3

    Metrics provided by the data journal Earth System Science Data including views and citations

References

  1. Lawrence, B., Jones, C., Matthews, B., Pepler, S., Callaghan, S.: Citation and peer review of data: moving towards formal data publication. Int. J. Digit. Curation 6(2), 4–37 (2011)

    Article  Google Scholar 

  2. Klump, J., Bertelmann, R., Brase, J., Diepenbroek, M., Grobe, H., Hock, H., Lautenschalger, M., Schindler, U., Sens, I., Wachter, J.: Data publication in the open access initiative. Data Sci. J. 5, 79–83 (2006)

    Article  Google Scholar 

  3. CODATA-ICSTI, Socha, Y. (ed.): Out of cite, out of mind: the current state of practice, policy, and technology for the citation of data. Data Sci. J. 12(0), CIDCR1–CIDCR7 (2013). doi:10.2481/dsj.OSOM13-043

  4. Arzberger, P., Schroeder, P., Beaulieu, A., Bowker, G., Casey, K., Laaksonen, L., Moorman, D., Uhlir, P., Wouters, P.: Promoting access to public research data for scientific, economic, and social development. Data Sci. J. 3, 135–152 (2004)

    Article  Google Scholar 

  5. Mayernik, M.: Bridging data lifecycles: Tracking data use via data citations workshop report. NCAR Technical Note NCAR/TN-494+PROC (2013). doi:10.5065/D6PZ56TX

  6. Assante, M., Candela, L., Castelli, D., Manghi, P., Pagano, P.: Science 2.0 repositories: time for a change in scholarly communication. D-Lib Mag. 21(1/2) (2015). doi:10.1045/january2015-assante

  7. Kratz, J.E., Strasser, C.: Making data count. Sci. Data 2, 150039 (2015). doi:10.1038/sdata.2015.39

    Article  Google Scholar 

  8. Duerr, R., Downs, R., Tilmes, C., Barkstrom, B., Lenhardt, W., Glassy, J., Bermudez, L., Slaughter, P.: On the utility of identification schemes for digital earth science data: an assessment and recommendations. Earth Sci. Inform. 4, 139–60 (2011). doi:10.1007/s12145-011-0083-6

    Article  Google Scholar 

  9. Callaghan, S., Donegan, S., Pepler, S., et al.: Making data a first class scientific output: data citation and publication by NERC’s environmental data centres. Int. J. Digit. Curation 7(1), 107–113 (2012)

    Article  Google Scholar 

  10. ESIP (Federation of Earth Science Information Partners): Data citation guidelines for data providers and archives. In: Parsons, M.A., Barkstrom, B., Downs, R.R., Duerr, R., Tilmes, C., ESIP Preservation and Stewardship Committee (eds.) ESIP Commons (2012). doi:10.7269/P34F1NNJ

  11. Klump, J., Huber, R., Diepenbroek, M.: DOI for geoscience data-how early practices shape present perceptions. Earth Sci. Inform. 9, 123–136 (2016). doi:10.1007/s12145-015-0231-5

  12. Ball, A., Duke, M.: How to Cite Datasets and Link to Publications. DCC How-to Guides. Digital Curation Centre, Edinburgh (2015). http://www.dcc.ac.uk/resources/how-guides

  13. Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima Fallout in Great Britain. NERC Environmental Information Data Centre (2011). doi:10.5285/1a91c7d1-ec44-4858-9af2-98d80f169bbd

  14. Beresford, N.A., Barnett, C.L., Howard, B.J., Howard, D.C., Wells, C., Tyler, A.N., Bradley, S., Copplestone, D.: Observations of Fukushima fallout in Great Britain. J. Environ. Radioact. 114, 48–53 (2012). doi:10.1016/j.jenvrad.2011.12.008

    Article  Google Scholar 

  15. Haxton, T., Crooks, S., Jackson, C.R., Barkwith, A.K.A.P., Kelvin, J., Williamson, J., Mackay, J.D., Wang, L., Davies, H., Young, A., Prudhomme, C.: Future flows hydrology data. NERC Environ. Inf. Data Centre (2012). doi:10.5285/f3723162-4fed-4d9d-92c6-dd17412fa37b

    Google Scholar 

  16. Prudhomme, C., Haxton, T., Crooks, S., Jackson, C., Barkwith, A., Williamson, J., Kelvin, J., Mackay, J., Wang, L., Young, A., Watts, G.: Future flows hydrology: and ensemble of a daily river flow and monthly groundwater levels for use for climate change impact assessment across Great Britain. Eath Syst. Sci. Data 5, 101–107 (2013). doi:10.5194/essd-5-101-2013

    Article  Google Scholar 

  17. Royan, A., Prudhomme, C., Hannah, D.M., Reynolds, S.J., Noble, D.G., Sadler, J.P.: Climate-induced changes in river flow regimes will alter future bird distributions. Ecosphere 6(4), 50 (2015). doi:10.1890/ES14-00245.1

    Article  Google Scholar 

  18. British Library, DataCite: Working with the British Library and DataCite—a guide for Higher Education Institutions in the UK. British Library (2013). http://www.bl.uk/aboutus/stratpolprog/digi/datasets/WorkingWithDataCite_2013.pdf

  19. Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: collated indices 2011. NERC Environmental Information Data Centre (2012). doi:10.5285/ff55462e-38a4-4f30-b562-f82ff263d9c3

  20. Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2011. NERC Environmental Information Data Centre (2013). doi:10.5285/cad2af6c-0c97-414c-8d5f-992741b283cf

  21. Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Collated Indices 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/7949cc99-76c4-4a3e-8c33-41a35b8b7777

  22. Botham, M., Roy, D., Brereton, T., Middlebrook, I., Randle, Z.: United Kingdom Butterfly Monitoring Scheme: Species Trends 2012. NERC Environmental Information Data Centre (2013). doi:10.5285/5afbbd36-2c63-4aa1-8177-695bed98d7a9

  23. Rauber, A., Pröll, S.: Scalable dynamic data citation—RDA-WG-DC position paper (2015). https://rd-alliance.org/groups/data-citation-wg/wiki/scalable-dynamic-data-citation-rda-wg-dc-position-paper.html. Accessed 25 June 2015

Download references

Acknowledgments

The authors would like to thank Rick Stuart and Pete Vodden for their work in helping to create the workflows and processes, and Rod Scott, Chris Johnson, Jon Cooper, Evgeniya Vetchinkina and Mike Wilson for developing technical solutions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kathryn A. Harrison.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Harrison, K.A., Wright, D.G. & Trembath, P. Implementation of a workflow for publishing citeable environmental data: successes, challenges and opportunities from a data centre perspective. Int J Digit Libr 18, 133–143 (2017). https://doi.org/10.1007/s00799-016-0175-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-016-0175-5

Keywords

Navigation