You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While the KPIs define specific XPaths in order to perform quality assessment, we need to consider ISO XML Schema cardinality of various complexTypes.
For example, KPI 2 defines an XPath of /gmd:MD_Metadata/gmd:identificationInfo//gmd:citation/gmd:CI_Citation/gmd:title. In 19115/19139 proper, gmd:identification can occur 1..n times. For a WMCP document that defines, say, 3 gmd:identificationInfo elements, how should pywcmp evaluate?
test for all titles (3), thus bumping up the total by 8 foreach)
test for one title (the first?)
In reality I'm not sure how many GISCs are putting more than one gmd:identificationInfo per WCMP document, so should pywcmp check for all occurrences, or fix to the first? My gut would say the former for completeness, to which the total points will scale.
I would say, that such ambiguities shall be cleared in the "core profile", i.e. for WIS metadata we should add a rule telling that there should be just one title, one abstract etc. This does not prevent localization (optional translation of elements) as that is handled on different level (as Tom explained me). The reason why I think so, is that as a data consumer I would not know what to think about a product that has two descriptions or how to handle a product that claims to have two different data formats.
Therefore, I would prefer to have pywcmp implemented in such a way, that it will flag all such ambiguities. Then we could use it to scan the whole WIS catalogue. My gut feeling is that there will be very few ambiguities in the existing metadata records, but without a thorough check we will never know.
@josusky this is related to #125. Was the repeatability of titles and abstracts addressed in pywcmp? If not, may I recommend that you change the test to only evaluate the first instances?
While the KPIs define specific XPaths in order to perform quality assessment, we need to consider ISO XML Schema cardinality of various complexTypes.
For example, KPI 2 defines an XPath of
/gmd:MD_Metadata/gmd:identificationInfo//gmd:citation/gmd:CI_Citation/gmd:title
. In 19115/19139 proper,gmd:identification
can occur 1..n times. For a WMCP document that defines, say, 3gmd:identificationInfo
elements, how should pywcmp evaluate?In reality I'm not sure how many GISCs are putting more than one
gmd:identificationInfo
per WCMP document, so should pywcmp check for all occurrences, or fix to the first? My gut would say the former for completeness, to which the total points will scale.Thoughts?
cc @josusky
The text was updated successfully, but these errors were encountered: