Current Limitation: Prose-Described Derivation Chain
CHAD-KG
Morph-KGChad
CHAD-ASK
Provenance
The relationships between TAB-SEB (the abstract blueprint), its concrete implementations (Morph-KGChad, CHAD-ASK), and the produced artefacts (CHAD-KG, CSV datasets) are currently described only in natural language prose, dispersed across documentation articles, repository READMEs, and the blueprint itself. This limits the traversability of the blueprint → implementation → artefact derivation chain and prevents systematic cross-implementation comparability at the level of formal metadata. In the absence of typed provenance relations, an automated agent cannot determine which artefact was generated by which process, which implementation instantiates which blueprint phase, or which input datasets were consumed to produce a given output.
This gap is directly relevant to TAB-SEB's FAIR evaluation: the absence of formally typed provenance is among the factors that constrain the score to 62/79 (78.5%), as typed derivation relationships are a prerequisite for several higher-level FAIR indicators concerning reusability and machine-readable provenance.
Proposed Development: PROV-O Vocabulary Adoption
PROV-O
W3C
Machine-traversable
Research object
The proposed development is to formalise these relations using PROV-O, the W3C ontology for provenance information. Specific properties identified as applicable to the TAB-SEB derivation chain include:
prov:wasDerivedFrom
prov:wasGeneratedBy
prov:used
prov:wasAssociatedWith
prov:wasInformedBy
Adopting these vocabularies would enable machine-readable traversal of the full derivation chain — from the abstract blueprint specification, through its executable implementations, to the concrete artefacts those implementations produce. It would further support structured cross-implementation comparability: for any two implementations of TAB-SEB, a PROV-O graph would allow automated identification of which phases they share, which artefacts they consume or derive, and where their derivation paths diverge.
A complete PROV-O annotation would also make TAB-SEB and its instantiations registrable as interlinked, citable research objects, making the epistemic dependencies between the blueprint's methodological claims and the empirical artefacts that instantiate them explicit and machine-verifiable.