Reviewer has chosen not to be Anonymous
Overall Impression: Excellent
Suggested Decision: Accept
Technical Quality of the paper: Excellent
Presentation: Good
Reviewer`s confidence: High
Significance: High significance
Background: Comprehensive
Novelty: Clear novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right
Summary of paper in a few sentences:
The paper presents RO-Crate as an approach to package all resources and their metadata into one object, enabling a high level of FAIRNess and supporting research reproducibility.
Reasons to accept:
Authors address a long lasting challenge in the scholarly communication domain: the representation of "compound object" as a mean to exchange all resources (used and produced) in the context of a research activity that are required to support the main objectives of Open Science: transparency, omni-comprehensive assessment and reward, and reproducibility of science.
The presented approach, RO-Crate, coherently builds upon past experiences (research object, wf4ever), state of the art technologies (json, RDF) with the support of an active community of researchers and developers of different domains.
The paper includes a conceptual description of RO-Crate, clearly explains the route that led to specific design decisions, and provides a more formal definition.
Possible applications and the list of tooling are very useful to understand the potential impact of RO-Crate and highlight how it can be adapted to address different challenges.
Reasons to reject:
I do not see reasons to reject the paper.
Nanopublication comments:
Further comments:
I have some specific comments/remarks that I suggest authors to address before publication.
The most important remark is that I had not a good experience with the RO-Crate of this publication.
I clicked on supplementary files 1128.html and went to https://datasciencehub.net/system/files/ds-supplementary-706-1128.html#r...
- conformsTo is missing
- the link to download returns 404
- the RO-Crate check says it is not a valid RO-Crate
The same happens for the version deposited on Zenodo (https://doi.org/10.5281/zenodo.5146227) linked in the abstract.
Section "Conceptual definition":
- Change ';' with ':' in "Figure 1 shows the main conceptual elements involved in an RO-Crate;"
- Figure 1: check the caption, there are missing words at "The RO is described within a , "
- Conceptual definition does not clarify what's the difference between a generic Research Object (as introduced in the previous section) and a RO-Crate
Section "Linked data as a foundation"
- Move the footnote about IRIs the first time you use the acronym, if you think the footnote is needed
Section "Data Entities are described using Contextual Entities"
The example of orcid is not very clear to me. Why would I need to include a contextual entity that describes the person? For example, as shown in listing 1, we have
"author": {"@id": "https://orcid.org/0000-0002-1825-0097"}
I can understand the suggestion to add
{ "@id": "https://orcid.org/0000-0002-1825-0097",
"@type": "Person",
"name": "Josiah Carberry"
},
for discoverability (and convenience) purposes, but following the linked data practices I could get the jsonld of the person with
`curl --location --request GET 'https://orcid.org/0000-0002-1825-0097' --header 'Accept: application/ld+json'`
- Figure 2 does not clearly show how the data entities can be linked to each other. Neither does listing 1.
Section "People"
- add links to all mentioned projects.
- Change ';' with ':' in "A key set of stakeholders are developers;"
- Table 1: check missing link for SCHeMa
Section "Profiles of RO-Crate in use"
- Cultural Heritages: remove the final 's'
Section "Machine-actionable Data Management Plans"
Pending brace at " RO-Crate {RDA maDMP Mapper"
Section 'Institutional data repositories – Harvard Data Commons"
- missing figure number at "in the metadata to allow their future reuse (Figure~)"
Section "FAIR Digital Objects"
- the link behind "As an “abstract protocol”" returns 404
Section "packaging workflows"
- Missing citation at "authors might be penalised for doing so [cite?] "
Section "Minimal RO-Crate"
- Align terms in the definition of minimal RO-crate (published vs datePublished)
- Check footnote 8
Section "Mapping to RDF with schema.org"
- The mapping for ContextualEntity(e) is missing
A couple of final questions, that can be addressed in authors' future work, is about the profiles. Could they limit the re-usability of RO-Crate and the data entities in it in different domain? Would the proliferation of profiles hinder the simplicity and set barriers to the adoption of the community tools that have been developed so far?
1 Comment
Meta-Review by Editor
Submitted by Tobias Kuhn on
I have to confess it has been a pleasure reading this paper since it presents, with a lot of details, an important resource for the scholarly community at large, in particular in the context of current practices (such as those concerning FAIR and EOSC). All the reviewers agreed on this aspect, even if they provided some insights and questions that should be appropriately addressed in the camera-ready and that I will carefully check before publication.
Silvio Peroni (https://orcid.org/0000-0003-0530-4305)