Selection, retention and destruction

Nobody knows exactly what data is going to be wanted or needed in the future, but you need to consider what should/has to be kept, and what can or should be deleted or disposed of.

Why select and appraise

It is not possible for all digital data to be kept forever but outside the archive and library communities there is no widespread recognition of the need to select data for curation. Instead there is a view that "storage is cheap so why don’t we just decide to keep everything". While that may in theory be technologically possible in practice there are four main objections to this view:

  1. Digital content expands. And "…if the growth of content (per byte or per object) keeps pace with the declining cost [of storage], then the real cost of keeping everything may actually be the same as it is now, or higher".
  2. Backup and mirroring increases costs. No digital preservation approach can survive without appropriate mirroring and backup systems. This instantly increases the storage cost by at least a factor of two.
  3. Discovery gets harder. Keeping everything means that the noise to signal ratio of searches will be high, requiring additional individual effort to ascertain which data is the intended target of a search.
  4. Managing and preserving is expensive. We must consider the cost of creating and managing preservation metadata, and the cost of preservation actions on data that does need to be retained.

From Appraise and Select Research Data for Curation – DCC and Australian National Data Service

Checklist

  • What data does my funder require me to keep?
  • What data does the university require me to keep?
  • What does legislation require me to keep or destroy (see Data Protection Act principles - “Personal data processed for any purpose or purposes shall not be kept for longer than is necessary for that purpose or those purposes”).
  • Is this data 'vital' to the project or organisation?
  • Do I have the legal and intellectual property rights to keep and re-use this data? If not, can these be negotiated?
  • Is there sufficient documentation to explain the data, and allow the data or record to be found wherever it ends up being stored?
  • If I need to pay to keep the data, can I afford it?
  • Is the data transient/transitory?
  • Ensure that what you keep is well documented – will anyone know what it is?
  • Store, name and organise appropriately – consider for example what will happen when you leave and the data is left for others to manage and use.
Further Information

Research Data material:

RDM Principles

Data Planning Documents

DMP Documents - Small file

Poster (pdf pdf)

University of Leicester Research Data Management poster

Brochure (pdf pdf)

Copies: researchdata@le.ac.uk 

Blog Data For Research Blog

Glossary

A-Z

books

Input and feedback

2017 - This website will be relaunched in spring/summer 2017. Feedback and input is welcome, so please let the RDM project team know what type of content you would like to be included on the website by contacting us at researchdata@le.ac.uk.

RDM news and events
  1. The University has adopted a simple data classification model which must be applied by the PI and/or data owner
  2. We share our RDM survey: overview findings presentation as well as the results of our 2015 survey on Your research data management needs
  3. "Introduction to Information Governance" training is now available via "Embedding Informatics in Clinical Education" (EICE). The University is registered as an institution so you can register against this or ask for an account via IGT@le.ac.uk.
  4. There is now a University Information Governance web presence
  5. Our Data Management Planning documents have been updated - hard copies can be obtained if you contact researchdata@le.ac.uk
  6. Jisc has published a new RDM strategy document - "Directions for RDM Management in UK Universities"
  7. The Partridge Review - HSCIC Data Release Review - Full report (3.58 Mb) and Summary
  8. The Economic and Social Research Council (ESRC) have published a new Research data policy underpinned by nine core principles and Data management plan guidance (26/03/2015)
  9. RDM Principles approved by the University.
  10. DMPonline version 4.0 - DCC "DMPonline" (Data Management Planning) v4 has gone live. You can use this web based form to create your Data Management Plan, and share it with your collaborators.
  11. Research funder specific web pages (accessible via 'Data Planning & Management - What you need to do') for data planning and management guidance (AHRC, BBSRC, EPSRC, ESRC, MRC, NERC, and STFC)
  12. 'Data Planning & Management - what you need to' - a web page to take you through what you need to do about data planning and management.
  13. 'RDM - The Movie' is released! The 4 minute digital story of research data work here at Leicester.

... more News and Events

Search RDM