Skip to content Skip to footer

Data quality Edit me

How do you ensure the quality of research data?


Quality is a key attribute of research data. Data quality affects the reliability of research results and it is a key factor increasing the reusability of data for secondary research. Data quality control can take place at any stage during the research data lifecycle. That said, you should ensure that the necessary procedures are defined during data management planning.


  • What is your data collection mechanism? Quality control is most typically performed during data collection. The elements of data collection in your research will determine the quality measures you can take. Examples of such measures are:
    • setup data management working group (DMWG) that includes people who generate data, analyse data and data managers,
    • for data collection: DMWG to plan and define data dictionary (including validation rules) before collecting data,
    • for metadata collection: DMWG to plan and define metadata data templates,
    • use electronic data capture systems,
    • automated quality monitoring through tools, pipelines, dashboards,
    • training of study participants and researchers, surveyors or other staff involved,
    • adopting standards,
    • instrument calibrations,
    • repeated samples,
    • post collection data curation,
    • data peer-review.
  • Are there standards or established working practices for quality in your field of study? Certain areas such as clinical studies, or those involving Next Generation Sequencing have commonly working methods to ensure data quality.


Relevant tools and resources

Skip tool table
Tool or resource Description Related pages Registry
OpenRefine Data curation tool for working with messy data TeSS
REDCap REDCap is a secure web application for building and managing online surveys and databases. While REDCap can be used to collect virtually any type of data in any environment, it is specifically geared to support online and offline data capture for research studies and operations. Identifiers Data steward infrastructure Data steward research