You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This would run after kg.tsv has been generated, and generate some kind of report so we know the file was generated correctly. At the simplest, this could check the number of rows is approximately 11,336,863 (which is where it was on the last generation).
Some other stats that might be useful to track:
Predicates by count
Number of nodes with direct types
Number of nodes by Biolink type
Number of edges by types (e.g. how many biolink:Gene --[GO:1234]--> biolink:Protein edges we have)
Some example nodes and edges
The main use of this report would be to make sure that we don't make a change that gets rid of a particular type of edge. Once we add qualifiers (#145), we could add a qualifier report as well to see how much detail we're adding.
We could implement this as a Scala Script -- it should be straightforward to implement in ZStream.
The text was updated successfully, but these errors were encountered:
This would run after
kg.tsv
has been generated, and generate some kind of report so we know the file was generated correctly. At the simplest, this could check the number of rows is approximately 11,336,863 (which is where it was on the last generation).Some other stats that might be useful to track:
The main use of this report would be to make sure that we don't make a change that gets rid of a particular type of edge. Once we add qualifiers (#145), we could add a qualifier report as well to see how much detail we're adding.
We could implement this as a Scala Script -- it should be straightforward to implement in ZStream.
The text was updated successfully, but these errors were encountered: