Best Practices: Choosing a Repository

Sharing your data in a repository is at the foundation of the FAIR Principles. This is what makes your data findable and accessible – so that it can be reused.

Repository Types

The type of repository you use will affect the findability of your data. Use of a generalist repository can facilitate cross disciplinary discovery of your data. Using a discipline specific repository will make your data discoverable to other researchers in that field.

Choosing a Repository

Not all repositories are made equal. Review this list of desirable characteristics before choosing your repository: Desirable Characteristics for All Data Repositories

Tools

Things to Consider

  • Will your data require controlled access?
    • Controlled access means that anyone trying to access your data must obtain your permission first.
      • Some of the reasons to use controlled access:
        • Your data is sensitive. This is data that may contain personally identifiable data about human subjects or protected populations.
        • Your data must comply with laws and regulations such as HIPAA.
        • Your data contains intellectual property.
  • Review the submission requirements prior to choosing a repository – make sure that your data is a good fit.
  • Does the repository require a certain data standard? If so, make sure you structure your data with that standard.
  • Does your chosen repository have fees for sharing or curation?
    • These fees can usually be written into your grant budget.
  • Does the repository provide a DOI or other persistent identifier? This is important for discoverability and citation.
  • If you haven’t already, register for an ORCiD. You will need an ORCiD to put your data in a repository.
  • How long does the repository promise to save the data?
  • What licensing does the repository require? For example, Dryad requires that you share your data with a CC0 license. See our page on best practices for copyright and data licensing.