How do you ensure the quality of research data?

Description

Quality is a key attribute of research data. Data quality affects the reliability of research results and it is a key factor increasing the reusability of data for secondary research. Data quality control can take place at any stage during the research data lifecycle. That said, you should ensure that the necessary procedures are defined during data management planning.

Considerations

  • What is your data collection mechanism? Quality control is most typically performed during data collection. The elements of data collection in your research will determine the quality measures you can take. Examples of such measures are:
    • setup data management working group (DMWG) that includes people who generate data, analyse data and data managers,
    • for data collection: DMWG to plan and define data dictionary (including validation rules) before collecting data,
    • for metadata collection: DMWG to plan and define metadata data templates,
    • use electronic data capture systems,
    • automated quality monitoring through tools, pipelines, dashboards,
    • training of study participants and researchers, surveyors or other staff involved,
    • adopting standards,
    • instrument calibrations,
    • repeated samples,
    • post collection data curation,
    • data peer-review.
  • Are there standards or established working practices for quality in your field of study? Certain areas such as clinical studies, or those involving Next Generation Sequencing have commonly working methods to ensure data quality.

Solutions

Relevant tools and resources

Tool or resource Description Tags Registry
OpenRefine Data curation tool for working with messy data data quality