Reviewer has chosen not to be Anonymous
Overall Impression: Good
Suggested Decision: Accept
Technical Quality of the paper: Good
Presentation: Good
Reviewer`s confidence: High
Significance: High significance
Background: Comprehensive
Novelty: Limited novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right
Summary of paper in a few sentences:
The paper describes a Knowledge Graph Metadata Specification, a set of 33 metadata elements that can be used to describe linked datasets. The paper describes the methodology used to obtain those properties and have also created a github repository to maintain that vocabulary. The paper also presents how those terms can be used to describe one Knowledge Graph.
Reasons to accept:
The paper is very well written and is easy to follow. It describes the process to identify those metadata elements and provides a comparison with previous approaches and a well structured discussion about the status and the future of the work.
The paper also has some supplementary contents in 2 github repositories, one for the Knowledge Graph Metadata Specification and another for the Food Claims Ontology.
About the metadata itself, I think the set of properties is good enough as it has been proposed by a community of experts. I wonder if there should be some mechanism established to maintain the set of properties in a more collaborative way…maybe having those shapes maintained in a spreadsheet like format from which they could be exported to SHACL (or ShEx). I may suggest the use of DCTAP for such templates as it has been done recently by PubChem: https://pubchem.ncbi.nlm.nih.gov/docs/rdf-schema where they keep the source of truth in a DCTAP spreadsheet and generate ShEx, SHACL, etc. shapes from it.
In fact the authors chose to publish and maintain the shapes in SHACL, which can be OK, but looking at the shapes, I think the shapes could have also been published in ShEx and could also be more readable and accessible to other processors implemented in ShEx.
In the introduction, when the authors describe related works, I would suggest them to mention rdf_config: https://github.com/dbcls/rdf-config which is used by DBCLS and also provides several metadata possibilities with the inclusion of example SPARQL queries.
In the case of the KG Schema field with the property dct:conformsTo, I wonder if it could contain more structured values than just an IRI. The reason is that there are different technologies for validation like ShEx/SHACL/… and it may be necessary to indicate that it has to conform to some schema using some technology or even some format (Compact syntax vs RDF syntax), so an automatic process could validate the contents.
In the case of the version, the value is a literal. Would it be possible to extend it to use IRIs to identify versions?
Listing 1 has two values for example, sh:example and :example, but I think the SHACL vocabulary doesn’t have the property sh:example
I wonder if the authors have taken into account other properties to describe SPARQL endpoints like the propertyPartition, classPartition, etc. which could be useful to both human users and clients to optimize query plans based on those numbers.
Reasons to reject:
I found no reason to reject the paper. Maybe the paper is not a full research paper providing something new but it can be a useful resource.
Nanopublication comments:
Further comments:
Maybe, the authos should try to promote their work so it can be adopted by the community.
1 Comment
meta-review by editor
Submitted by Tobias Kuhn on
The reviews clearly identify the strenghts of the metadata model that is proposed, but also discuss about the need to contextualise better the decisions that have been made, especially in line with existing efforts in other catalogues (e.g., open data portals), as well as technical means that are provided to analyse the metadata. This should be reviewed in the revised submission.
Oscar Corcho (https://orcid.org/0000-0002-9260-0753)