Skip to content Skip to footer

Your role: Researcher

Applies to

PhD candidates, research grant applicants, project managers, group leaders, PIs

Scenario

The funding organisation I am applying to requires a data management plan (DMP). I have little experience in writing a DMP, and I am not sure of the level of detail I am required to provide. I have limited access to data management experts within my institution. I am considering using the RDMkit for my data management needs. I also hope to find useful references to local training about data management requirements, data archives and DMP tools.

I know the types and the approximate amount of data I will generate, but I have not thought about how to share data with my collaborators and how to store data securely. Initially, my plan was to buy a powerful computer and portable hard drive, but I am now thinking that I need to use a national computing infrastructure. The field I work in has well defined data and curation standards, for example, capturing information (metadata) about how to collect and sample my data. However, I am not yet familiar with the importance of storing provenance data, such as tool and database versions used in analysis.

Focus

  • Write data management plans, also in the context of grant applications
  • Ensure compliance with institution policy, including legal and ethical aspects
  • Ensure proper data organisation and storage
  • Ensure secure sharing, reproducibility and preservation of data
  • Transmits the good practices in RDM to his group

Getting started

  • Check out the various steps of the RDM life cycle, in particular the planning stage
  • Identify and contact the data steward in your local organisation or your national contact in the ELIXIR network
  • Start planning your project taking the DMP into account

Related pages

More information

Relevant tools and resources

Skip tool table
Tool or resource Description Related pages Registry
Argos Plan and follow your data. Bring your Data Management Plans closer to where data are generated, analysed and stored. Data management plan Data Steward: research
Arvados With Arvados, bioinformaticians run and scale compute-intensive workflows, developers create biomedical applications, and IT administrators manage large compute and storage resources. Data Steward: infrastructure Data Steward: policy Data analysis
Atlas Free, publicly available web-based, open-source software application developed by the OHDSI community to support the design and execution of observational analyses to generate real world evidence from patient level observational data. Data Steward: research TransMed Tool info Training
Beacon The Beacon protocol defines an open standard for genomics data discovery. Data Steward: research Data Steward: infrastructure Human data Tool info Standards/Databases Training
BIONDA BIONDA is a free and open-access biomarker database, which employs various text mining methods to extract structured information on biomarkers from abstracts of scientific publications Data storage Human data Proteomics Tool info
BMRB Biological Magnetic Resonance Data Bank Intrinsically disordered proteins Tool info
Bulk Rename Utility File renaming software for Windows Data organisation Data Steward: research
CEDAR CEDAR is making data submission smarter and faster, so that scientific researchers and analysts can create and use better metadata. Documentation and metadata Machine actionability Data Steward: research Tool info Standards/Databases
ChEMBL Database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties and abstracted bioactivities. Data analysis Toxicology data Tool info Standards/Databases Training
Choose a license Choose an open source license Licensing Data Steward: research Data Steward: policy
Common Workflow Language (CWL) An open standard for describing workflows that are build from command line tools Data Steward: infrastructure Data analysis Standards/Databases Training
COPO Portal for scientists to broker more easily rich metadata alongside data to public repos. Documentation and metadata Plant sciences Machine actionability Plant Phenomics Plant Genomics Tool info Standards/Databases
Create a Codebook Examples and tools to create a codebook by the Data Documentation Initiative (DDI) Documentation and metadata Data Steward: research
Creative Commons License Chooser It helps you choose the right Creative Commons license for your needs. Licensing Data Steward: research Data Steward: policy
Crop Ontology The Crop Ontology compiles concepts to curate phenotyping assays on crop plants, including anatomy, structure and phenotype. Data Steward: research Data Steward: infrastructure Plant sciences Plant Phenomics Standards/Databases Training
DAMAP It guides you step by step through a DMP and lets you export a pre-filled DMP as a Word document that you can customize and use for submission to funders. Also, DAMAP is compatible with the RDA recommendation for machine-actionable DMPs and offers an export of JSON DMPs. DAMAP is open source and to be self deployed. Data management plan Data Steward: research
Data Curation Centre Metadata list List of metadata standards Documentation and metadata Data Steward: research
Data INRAE Dataverse for life sciences and agronomic related data Plant sciences Plant Genomics Data Steward: research Plant Phenomics Standards/Databases
Data Stewardship Wizard Publicly available online tool for composing smart data management plans Data management plan Data Steward: research Data Steward: infrastructure NeLS TSD Plant Phenomics Plant Genomics Tool info Training
Data Use Ontology DUO allows to semantically tag datasets with restriction about their usage. Data Steward: research Human data Standards/Databases Training
DATAVERSE Open source research data respository software. Data storage Data Steward: research Data Steward: infrastructure IFB Training
dbGAP The database of Genotypes and Phenotypes (dbGaP) archives and distributes data from studies investigating the interaction of genotype and phenotype in Humans Data publication Data Steward: infrastructure Human data Tool info Standards/Databases Training
DisGeNET A discovery platform containing collections of genes and variants associated to human diseases. Data analysis Human data Toxicology data Tool info Standards/Databases Training
DisProt A database of intrinsically disordered proteins Intrinsically disordered proteins Tool info Standards/Databases Training
DMP Canvas Generator Questionnaire, which generates a pre-filled a DMP Data management plan Data Steward: research
DMPlanner Semi-automatically generated, searchable catalogue of resources that are relevant to data management plans. Data management plan Data Steward: research
DMPRoadmap DMP Roadmap is a Data Management Planning tool Data management plan Data Steward: research
DMPTool Build your Data Management Plan Data management plan Data Steward: research
e!DAL-PGP Plant Genomics and Phenomics Research Data Repository Plant sciences Plant Genomics Data Steward: research Data Steward: infrastructure Data publication Documentation and metadata Plant Phenomics Standards/Databases
ECPGR Hub for the identification of plant genetic resources in Europe Plant sciences Data Steward: research
ELIXIR Deposition Databases for Biomolecular Data List of discipline-specific deposition databases recommended by ELIXIR. Data publication Data Steward: research Data Steward: infrastructure COVID-19 Data Portal NeLS IFB CSC Standards/Databases
EMBL-EBI Ontology Lookup Service EMBL-EBI’s web portal for finding ontologies Documentation and metadata Data Steward: research
EMBL-EBI's data submission wizard EMBL-EBI's wizard for finding the right EMBL-EBI repository for your data. Data publication Data Steward: research
ENA upload tool The program submits experimental data and respective metadata to the European Nucleotide Archive (ENA). Data Steward: infrastructure Data Steward: research Data brokering
EUDAT licence selector wizard EUDAT's wizard for finding the right licence for your data or code. Licensing Data Steward: research Data Steward: policy
EURISCO European Search Catalogue for Plant Genetic Resources Plant sciences Data Steward: research Plant Phenomics Tool info
Europe PMC Europe PMC is a repository, providing access to worldwide life sciences articles, books, patents and clinical guidelines. Tool info Standards/Databases Training
FAIDARE FAIDARE is a tool allowing to search data across dinstinct databases that implemented BrAPI. Data Steward: research Plant sciences IFB Plant Phenomics Plant Genomics Tool info
FAIR Implementation Profile The FIP is a collection of FAIR implementation choices made by a community of practice for each of the FAIR Principles. Project data management coordination Data management plan Data Steward: research Standards/Databases
FAIRDOMHub Data, model and SOPs management for projects, from preliminary data to publication, support for running SBML models, etc. (public SEEK instance) Data storage NeLS Documentation and metadata Microbial biotechnology Machine actionability Data Steward: research Standards/Databases
FAIRsharing A curated, informative and educational resource on data and metadata standards, inter-related to databases and data policies. Documentation and metadata Data publication Data Steward: policy Data Steward: research Microbial biotechnology Existing data Standards/Databases Training
FIP Wizard FIP Wizard is a toolset to facilitate the capture of data in FAIR Convergence Matrix questionnaire prompting communities to explicitly declare their FAIR Implementation Profiles. These profiles can be then stored and published as nanopublications. Project data management coordination Data management plan Data Steward: research
Galaxy Open, web-based platform for data intensive biomedical research. Whether on the free public server or your own instance, you can perform, reproduce, and share complete analyses. NeLS Marine Metagenomics Data analysis Data Steward: infrastructure IFB Galaxy Tool info Training
GENEID Geneid is an ab initio gene finding program used to predict genes along DNA sequences in a large set of organisms. Data analysis Tool info
Harvard Medical School - Electronic Lab Notebooks ELN Comparison Grid by Hardvard Medical School Documentation and metadata Identifiers Data Steward: research
How to License Research Data - DCC Guidelines about how to license research data from Digital Curation Centre Licensing Data Steward: research Data Steward: policy
HumanMine HumanMine integrates many types of human data and provides a powerful query engine, export for results, analysis for lists of data and FAIR access via web services. Data organisation Data Steward: research Human data Data analysis Tool info Standards/Databases Training
Linked Open Vocabularies (LOV) Web portal for finding ontologies Documentation and metadata Data Steward: research
LUMI EuroHPC world-class supercomputer Data analysis Data Steward: infrastructure CSC Tool info
MIADE Minimum Information About Disorder Experiments (MIADE) standard Documentation and metadata Data Steward: research Intrinsically disordered proteins
MIAPPE Minimum Information About a Plant Phenotyping Experiment Documentation and metadata Data Steward: research Plant sciences Plant Genomics Plant Phenomics Standards/Databases Training
MIGS/MIMS Minimum Information about a (Meta)Genome Sequence Documentation and metadata Data Steward: research Marine metagenomics Microbial biotechnology Standards/Databases
MIxS Minimum Information about any (x) Sequence Documentation and metadata Data Steward: research Marine metagenomics Plant Genomics Standards/Databases Training
MobiDB A database of protein disorder and mobility annotations Intrinsically disordered proteins Tool info Standards/Databases Training
MRI2DICOM a Magnetic Resonance Imaging (MRI) converter from ParaVision® (Bruker, Inc. Billerica, MA) file format to DICOM standard Data Steward: research XNAT-PIC
Multi-Crop Passport Descriptor (MCPD) The Multi-Crop Passport Descriptor is the metadata standard for plant genetic resources maintained ex situ by genbanks. Documentation and metadata Data Steward: infrastructure Data Steward: policy Plant sciences Plant Phenomics Plant Genomics Standards/Databases Training
OHDSI Multi-stakeholder, interdisciplinary collaborative to bring out the value of health data through large-scale analytics. All our solutions are open-source. Data Steward: research Data analysis Data storage TransMed Toxicology data Tool info
OnotoMaton OntoMaton facilitates ontology search and tagging functionalities within Google Spreadsheets. Data Steward: research Data Steward: infrastructure Documentation and metadata Identifiers
Ontobee A web portal to search and visualise ontologies Documentation and metadata Data Steward: research Standards/Databases
Open Definition Conformant Licenses Licenses that are conformant with the principles laid out in the Open Definition. Licensing Data Steward: research Data Steward: policy
OSF OSF (Open Science Framework) is a free, open platform to support your research and enable collaboration. Data storage Data Steward: research Training
PAA PAA is an R/Bioconductor tool for protein microarray data analysis aimed at biomarker discovery. Data analysis Human data Proteomics Tool info
PANGAEA Data Publisher for Earth and Environmental Science Data publication Documentation and metadata Data Steward: research Tool info Standards/Databases
PCDDB The Protein Circular Dichroism Data Bank Intrinsically disordered proteins Tool info
PDB The Protein Data Bank (PDB) Intrinsically disordered proteins Structural Bioinformatics Tool info Training
PIA - Protein Inference Algorithms PIA is a toolbox for mass spectrometrey based protein inference and identification analysis. Data analysis Proteomics Tool info
pISA-tree A data management solution for intra-institutional organization and structured storage of life science project-associated research data, with emphasis on the generation of adequate metadata. Microbial biotechnology Data Steward: research Data organisation Documentation and metadata Plant Phenomics Plant Genomics Tool info
PLAZA Access point for plant comparative genomics, centralizing genomic data produced by different genome sequencing initiatives. Plant sciences Plant Genomics Standards/Databases Training
R Markdown R Markdown documents are fully reproducible. Use a productive notebook interface to weave together narrative text and code to produce elegantly formatted output. Use multiple languages including R, Python, and SQL. Data analysis Training
RD-Connect Genome Phenome Analysis Platform The RD-Connect GPAP is an online tool for diagnosis and gene discovery in rare disease research. Human data Rare disease data Training
RDA Standards Directory of standard metadata, divided into different research areas Documentation and metadata Data Steward: research
Renamer4Mac File renaming software for Mac Data organisation Data Steward: research
Repository Finder Repository Finder can help you find an appropriate repository to deposit your research data. The tool is hosted by DataCite and queries the re3data registry of research data repositories. Data publication Data Steward: research
Research Data Management Organiser Supports the systematic planning, organisation and implementation of research data management throughout the course of a project Data management plan Data Steward: research Data Steward: infrastructure
Research Management Plan Machine actionable DMPs. Data management plan Data Steward: research
Research Object Crate (RO-Crate) RO-Crate is a lightweight approach to packaging research data with their metadata, using schema.org. An RO-Crate is a structured archive of all the items that contributed to the research outcome, including their identifiers, provenance, relations and annotations. Documentation and metadata Data storage Data organisation Data Steward: research Microbial biotechnology Machine actionability Data provenance Standards/Databases
Rightfield RightField is an open-source tool for adding ontology term selection to Excel spreadsheets Documentation and metadata Data Steward: research Microbial biotechnology Identifiers Machine actionability Tool info
Rstudio Rstudio notebooks allow to share code, documentation Data analysis Data Steward: infrastructure Tool info Training
SASBDB Small Angle Scattering Biological Data Bank Intrinsically disordered proteins
Scientific Data's Recommended Repositories List of respositories recommended by Scientific Data, contains both discipline-specific and general repositories. Data publication Data Steward: research Data Steward: infrastructure
Semares All-in-one platform for life science data management, semantic data integration, data analysis and visualization Data Steward: research Documentation and metadata Data analysis Data Steward: infrastructure Data storage
SIFTS Structure integration with function, taxonomy and sequence Intrinsically disordered proteins
Talend Talend is an open source data integration platform. Data Steward: research TransMed
The Genomic Standards Consortium (GSC) The Genomic Standards Consortium (GSC) is an open-membership working body enabling genomic data integration, discovery and comparison through international community-driven standards. Documentation and metadata Data Steward: infrastructure Data Steward: policy Human data Standards/Databases
The Open Biological and Biomedical Ontology (OBO) Foundry Collaborative effort to develob interoperable ontologies for the biological sciences Documentation and metadata Data Steward: research Standards/Databases
tranSMART Knowledge management and high-content analysis platform enabling analysis of integrated data for the purposes of hypothesis generation, hypothesis validation, and cohort discovery in translational research. Data Steward: research Data analysis Data storage TransMed Tool info
TXG-MAPr A tool that contains weighted gene co-expression networks obtained from the Primary Human Hepatocytes, rat kidney, and liver TG-GATEs dataset. Data analysis Toxicology data Tool info
UniProt Comprehensive resource for protein sequence and annotation data Documentation and metadata Intrinsically disordered proteins Microbial biotechnology Proteomics Structural Bioinformatics Tool info Standards/Databases Training
University of Cambridge - Electronic Research Notebook Products List of Electronic Research Notebook Products by University of Cambridge Documentation and metadata Identifiers Data Steward: research
Wellcome Open Research - Data Guidelines Wellcome Open Research requires that the source data underlying the results are made available as soon as an article is published. This page provides information about data you need to include, where your data can be stored, and how your data should be presented. Data publication Data Steward: research
WorkflowHub WorkflowHub is a registry for describing, sharing and publishing scientific computational workflows. Data publication Data Steward: research Tool info Standards/Databases Training
XNAT Open source imaging informatics platform. It facilitates common management, productivity, and quality assurance tasks for imaging and associated data. Data analysis TransMed XNAT-PIC Bioimaging data
XNAT-PIC Pipelines Analysing of single or multiple subjects within the same project in XNAT Data Steward: research Data analysis XNAT-PIC
XNAT-PIC Uploader Import tool for multimodal DICOM image datasets to XNAT Data Steward: research XNAT-PIC
Zooma Find possible ontology mappings for free text terms in the ZOOMA repository. Documentation and metadata Data Steward: research Tool info Training
National resources
RDM Guide

RDM Guide describes Belgian data management guidelines, resources, tools and services available for researchers in Life Sciences.

Data Steward: research
Galaxy Belgium

Galaxy Belgium is a Galaxy instance managed by the Belgian ELIXIR node, funded by the Flemish government, which utilizing infrastructure provided by the Flemish Supercomputer Center (VSC).

Galaxy
Data analysis
DMPonline.be

This instance of DMPonline is provided by the DMPbelgium Consortium. We can help you write and maintain data management plans for your research.

DMPRoadmap
Data Steward: research Data management plan
PIPPA

PIPPA, the PSB Interface for Plant Phenotype Analysis, is the central web interface and database that provides the tools for the management of the plant imaging robots on the one hand, and the analysis of images and data on the other hand.

Plant Phenomics Plant sciences Data Steward: research Data Steward: infrastructure Tool info
Belnet

Belnet is the privileged partner of higher education, research and administration for connectivity. We provide high-bandwidth internet access and related services for our specific target groups.

Data Steward: research Data Steward: infrastructure Data transfer
Galaxy MetaCentrum

Galaxy MetaCentrum is a Galaxy instance managed by the Czech ELIXIR node and e-INFRA. It provides extra support for RepeatExplorer tool for plant genomic analysis.

Galaxy
Data analysis Tool info
ownCloud@CESNET

CESNET-hosted ownCloud is a 100 GB cloud storage freely available for Czech scientists to manage their data from any research projects.

ownCloud
Data Steward: infrastructure Data storage Data organisation
Czech National Repository

National Repository (NR) is a service provided to the scientific and research communities in the Czech Republic to store their generated research data together with persistent DOI identifier. NR service is currently under the pilot program.

Data Steward: research Data Steward: infrastructure Data storage Existing data Identifiers Data management plan
GHGA

The German Human Genome-Phenome Archive.

Data storage Documentation and metadata Data Steward: research
PUBLISSO

Open access publishing platform for life sciences.

Data publication Data Steward: research
Galaxy Estonia

This is the Estonian instance of Galaxy, which is an open source, web-based platform for data intensive biomedical research.

Galaxy
Data analysis
Red Española de Supercomputación

The Spanish Supercomputing Network’s mission is to offer the resources and services of supercomputing and data management necessary for the development of innovative and high-quality scientific and technological projects, through competitive calls based on the scientific excellence of the projects to be developed.

Data Steward: research Data Steward: infrastructure
RedIRIS

Spanish academic and research network that provides advanced communication services to the scientific community and national universities.

Data Steward: research Data Steward: infrastructure
Recolecta

The national aggregator of open access repositories. This platform brings together all the Spanish digital infrastructures in which open access research results are published and / or deposited.

Data Steward: research Data Steward: infrastructure
Datos.gob.es

Open data portal of the spanish government. A meeting point for the various actors that make up the open data ecosystem.

Data Steward: research Data Steward: infrastructure
Chipster

Chipster is a user-friendly analysis software for high-throughput data such as RNA-seq and single cell RNA-seq. It contains analysis tools and a large reference genome collection.

CSC Data Steward: infrastructure Data analysis
DMPTuuli

Data management planning tool (Finland).

DMPRoadmap
CSC Data Steward: research Data management plan
Fairdata.fi

With the Fairdata Services you can store, share and publish your research data with easy-to-use web tools.

CSC Data Steward: research Data storage Data publication Existing data
Federated EGA Finland

FEGA allows you to store and shaare sensitive data in Finland in a way that fulfils all the requirements of the General Data Protection Regulation (GDPR).

CSC Data Steward: research Data sensitivity Data publication Existing data Human data
Findata

The Health and Social Data Permit Authority. Findata offers services and enables secure and efficient utilisation of data materials containing health and social data.

CSC Data Steward: research Data sensitivity Existing data Human data
Fingenious

Finnish Biobank Cooperative (FINBB) connects researchers to Finnish biomedical research. Via Fingenious® services the researcher can connect to all Finnish public bio banks.

CSC Data Steward: research Data sensitivity Human data
Sensitive Data Services for Research

CSC Sensitive Data Services for Research are designed to support secure sensitive data management through web-user interfaces accessible from the user’s own computer.

CSC Data Steward: research Data sensitivity Data analysis Data storage Data publication Human data
High performance computing

CSC Supercomputers Puhti, Mahti and LUMI performance ranges from medium scale simulations to one of the most competitive supercomputers in the world.

CSC Data Steward: research Data analysis
Cloud computing

CSC offers a variety of cloud computing services: the Pouta IaaS services and the Rahti container cloud service.

CSC Data Steward: research Data analysis
IceBear

A browser-based Research Data Management tool for protein cyrstallization that offers flexible crystal fishing workbench, no-typing submission for crystal shipment, and linking crystals and datasets including PDB depositions.

Data Steward: research Data analysis
DMP OPIDoR

Online questionnaire for the development of data management plans - repository of DMPs.

DMPRoadmap
IFB Data Steward: research Data management plan
Open-science.it

Italian portal dedicated to the field of open science.

Data Steward: research Data management plan
Health-RI Service Catalogue

Health-RI provides a set of tools and services available to the biomedical research community.

Human data Data analysis Existing data Data storage
BBMRI catalogue

Biobanking Netherlands makes biosamples, images and data findable, accessible and usable for health research.

Human data Data analysis Existing data Data storage
CBS, Statistics Netherlands

The national statistical office, Statistics Netherlands (CBS), provides reliable statistical information and data in the life sciences and health domain.

Human data Existing data
Technology Hotels

More than 130 Technology Hotels offer access to high-end technology and expertise in the field of bioimaging, bioinformatics, genomics, medical imaging, metabolomics, phenotyping, proteomics, structural biology, and/or systems biology.

Human data Bioimaging data Proteomics Compliance monitoring & measurement
Dutch COVID-19 Data Support Programme

To support investigators and health care professionals with tools and services in their search for ways to overcome the pandemic and its health consequences.

Human data Existing data
RIVM Health and Healthcare Data

The Dutch National Institute for Public Health and the Environment (RIVM), together with other organisations, provides numbers and explanation on relevant topics, to prevent duplication of data collection.

Human data Existing data
Handbook for Adequate Natural Data Stewardship

Guidelines on data stewardship and practical toolbox for researchers at Dutch University Medical Centres (UMCs).

Human data Data management plan Compliance monitoring & measurement
FAIR-Aware

Online tool which helps researchers and data managers assess how much they know about the requirements for making datasets findable, accessible, interoperable, and reusable (FAIR) before uploading them into a data repository.

Data management plan Compliance monitoring & measurement Data publication
BioData.pt Service Hub

BioData.pt Service Hub includes several data management resources, tools and services available for researchers in Life Sciences.

Data Steward: research Data analysis Data storage
BioData.pt Data Management Portal (DMPortal)

This instance of DataVerse is provided by the BioData.pt. We can help you write and maintain data management plans for your research.

DATAVERSE
Data Steward: research Data storage
BioData.pt Data Stewardship Wizard

Local instance of Data Stewardship Wizard. You can use this tool to create your own Data Management Plans.

Data Stewardship Wizard
Data Steward: research Data management plan
Ready for BioData Management

Capacity building program in data management for the life sciences to empower researchers and institutions in managing their data more effectively and efficiently.

Data Stewardship Wizard
Data management plan
DMPonline

DMPonline is a web-based tool that supports researchers to develop data management and sharing plans. It contains the latest funder templates and best practice guidelines to support users to create good quality DMPs.

DMPRoadmap
Data Steward: research Data management plan
CyVerse UK

The CyVerse Data Store is a cloud-based storage space, accessible via the CyVerse Discovery Environment (DE), a virtual bioinformatics lab workbench, and developer APIs such as the AGAVE API. In the DE, users can share datasets and tools to analyse data with as many or as few people as they wish.

Data Steward: research Documentation and metadata
Jisc Research data management toolkit

Guidance on the research data lifecycle that signposts resources from a wide range of organisations and websites.

Data Steward: research Documentation and metadata
Agrischema

Linked data schemas for the fields of agriculture, food, agri-business, plant biology.

Data Steward: research Documentation and metadata
InterMine

InterMine integrates heterogenous data sources, making it easy to query and analyse data.

Data Steward: research Documentation and metadata
Contributors