14

As the title says, I wish to publicly share a dataset used in one of my articles which has already been published (by MDPI, if that helps). I do not think the data is worthy of an article by itself, so I just want to upload them online and be able to link them to my already published article.

After some searching on the internet, I found that Mendeley Data does exactly what I need, but it is quite unclear if linking articles works only for Elsevier publications or for others too. Another possible solution is OSF, but it is also unclear whether it's possible to link a published article to the data.

If you can confirm that any of these services works in my situation, or propose other solutions, you would be helping greatly. Thanks in advance!

3
  • 1
    This site describes several (open science) data repositories, which is also the term you're looking for: guides.library.csupueblo.edu/data/repositories Commented Mar 30, 2022 at 11:05
  • 1
    Could you specify in which field of science you are working? There are domain specific data repositories existing. You might have at re3data.org to find an appropriate domain specific repository. The publishing workflow at Zenodo is very easy and fast to go through. Domain specific repositories might have a more complicated workflow. However, at the latter you have metadata schemata for your domain which allows -- when filled in properly -- other scientists to find your data more easily. Moreover, these repositories sometimes have additional services like web-processing of data. Commented Mar 31, 2022 at 6:28
  • 1
    paperswithcode.com/datasets if the domain is related to ML? Commented Mar 31, 2022 at 20:38

5 Answers 5

22

Have a look at Zenodo (https://zenodo.org/). It is free, gives you a DOI, and you can link your article with the data. It lets you upload 50 GB by default, but you can ask for more space if needed.

8

I've used Figshare; it's free, gives you a DOI, and you can make a link from the data back to the article yourself.

What is presumably more useful is the ability to make a link from the article to your newly uploaded data. In general I do that by somehow citing the data's DOI within the article when submitting, but since your article has already been published you will need to contact the publisher, as they are the ones that must create that link.

Edit: just to expand on the latter: I expect your article has a DOI - used in other papers when they cite it - which redirects to a webpage run by the publisher; that webpage is or has a link to the "Version of Record" of your article. It would be nice to also have a link to your source data. This second link has to be inserted by the publisher, since they control their webpage, but they first need to verify a couple of things:

  • is this the right data? Nobody ever accidentally clicks on the wrong file in the upload dialogue, of course... what if someone faked a screen name and then uploaded a version of the dataset maliciously modified so that the existing paper's analysis section now appears to be wrong or some form of misconduct?

  • is the data allowed to be shared? Have all the data's creators given permission? Study participants? Suitable licence?

Checking these at submission is straightforward: the Corresponding Author (submitter) provides the data (or link to it), and the Corresponding Author can sign off on any required permissions, ethical approval, etc. just as they do for the paper itself.

Making these checks after publication will be more problematic, and (personal opinion) I would be very wary of any process that allows this without direct communication with the original Corresponding Author.

(My concerns about validating data integrity are, of course, independent of which data-hosting platform is used!)

3

https://dataverse.org/ if your academic organization got an account. It is easy to use.

1

I think the OSF will work for your use case. Once you upload all of your files to a project, you can create a registration (a frozen copy of that project) which comes with a DOI. Like Lou mentioned earlier, you would need to contact the publisher to include that DOI in the article.

1

Springer Nature has this list of data repositories covering: mandated data types; biological sciences; chemistry and chemical biology; earth, environmental and space sciences; health sciences; materials science; physics; social science; and generalist repositories.

You must log in to answer this question.

Not the answer you're looking for? Browse other questions tagged .