Data life cycle: Sharing
What is data sharing?
Sharing data means making your data known to other people.
You can share your data with collaboration partners in the context of a collaborative research project, or you can publish your data to share it with the global research community and society at large.
It’s important to know that data sharing doesn’t mean open data or public data. You can choose to share your data with restricted access or even closed access. Moreover, sharing or publishing data is different from publishing a paper or a manuscript in a journal. Here we focused on data (i.e. raw observations and measurements, analysis workflows, code, etc), not on papers or articles.
Data sharing can be done at any time during the research data life cycle but, at the latest, data should be made available at the time of publication of articles that use the data to make scientific conclusions.
Why is data sharing important?
In a collaborative project, being able to easily share data makes research more efficient.
Sharing of data is a cornerstone of good science. It is a good research practice to ensure that data underlying research is preserved and made available to the research community and society at large. Sharing data is a prerequisite for making your research reproducible. To be useful for others, you should strive to make the shared data adhere to the FAIR principles.
In the EU, the ‘Open Data Directive’ (Directive (EU) 2019/1024) states that “Member States shall support the availability of research data by adopting national policies and relevant actions aiming at making publicly funded research data openly available (‘open access policies’), following the principle of ‘open by default’ and compatible with the FAIR principles.”
Many research funders, institutions and reputable journals/publishers now have data sharing mandates, from which you normally cannot opt out of unless there are legitimate reasons (ethical or legal reasons). Additional reasons to share your datasets:
- Ten reasons to share your data.
- Ask not what you can do for open data; ask what open data can do for you.
Even though it may not be possible to openly share all data because of ethical, legal, contractual, or intellectual property reasons, do strive to make data “as open as possible, as closed as necessary”.
Making the data as FAIR as possible will ensure that maximum value can be obtained out of it in future.
What should be considered for data sharing?
- If you are part of a collaborative research project, it is recommended to plan and establish the following in advance:
- The use of repositories and sharing services which allow controlled access to share your preliminary data with project partners.
- The use of storage solutions that guarantee shared, controlled and secure access to the data and appropriate data transfer.
- The deposition of your data to a public repository as early as possible. This saves a lot of trouble later on. Data can be put under embargo until you want to release it, e.g. at the time of article publication.
- The use of common data organisation, data formats, standards, data documentation and metadata.
- If you want to share or publish your data, you should:
- Make sure you have the rights to do so (i.e., are you the creator of the data?).
- Consider all possible ethical, legal, contractual, or intellectual property restrictions related to your data (GDPR, consent, patent, etc).
- Check funders and institutional requirements about data sharing policy and data availability.
- Establish if you need to limit reusability of your data for legitimate reasons (consider applying a specific licence).
- Make the data citable so that you can receive credit (use identifiers).
- Based on the considerations listed above, you should be able to determine the right type of access for you data. Even if the access to the data is restricted, it is good practice to openly and publicly share the metadata of your data.
- Open access: data is shared publicly. Anyone can access the data freely.
- Registered access or authentication procedure: potential users must register before they are able to access the data. The “researcher” status is guaranteed by the institution and the user agrees to abide by data usage policies of repositories that serve the shared data. Datasets that are shared via registered-access would typically have no restrictions besides the condition that data is to be used for research. Registered access allows the data archive to monitor who can access data, enabling reminders about conditions of use to be issued.
- Controlled access or Data Access Committees (DACs): data can only be shared with researchers, whose research is reviewed and approved by a Data Access Committee (DAC). DAC is an organization of one or more named individuals responsible for data release to external requestors based on specific criteria (research topics, allowed geographical regions, allowed recipients etc). Criteria established by DAC for data access are usually described on the website of the organization.
- Access upon request (not recommended): in order to manage this type of access a named contact is required for the dataset who would be responsible for making decisions about whether access is granted. The owner of the data must provide his/her contact in the documentation associated with the datasets (metadata). Metadata about the datasets must be open.
- Share and publish your data in professional deposition databases that provide the appropriate access type and licence:
- If there are discipline-specific repositories available for you data, this should be your primary choice. They will work towards a high level of FAIRness by recommending appropriate community standards for describing the data.
- If there are no suitable discipline-specific repositories for your data:
- Deposit the data in an Institutional repository, if there is one. These often provide stewardship and curation, helping to ensure that your dataset is preserved and accessible. Contact the Research Data Office function at your institution, if there is one.
- Deposit the data in a General purpose repository.
- If there isn’t any suitable repository that can harbour your controlled access data, it is recommended that you at least create a metadata record for the data in an Institutional or General purpose repository.
How to make research data compliant to gdpr.
Information on brokering data to data repositories on behalf of data producers
Prepare data and find repositories for publication.
How to transfer data files.
How to use identifiers for research data.
How to license research data.
Documentation and metadata
How to document and describe your data.
How to identify different research data types.
How to find appropriate storage solutions.