You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
sklearn.metrics is in a way much simpler, using plain fuctions. Can we do something analogous or even depend on scikit-learn for stuff like ROC, recall-precision etc.?
The text was updated successfully, but these errors were encountered:
Looking at this again. The main issue is that linkpred uses its own data structure (Scoresheet) to track prediction scores. This has at least two advantages:
Order of nodes is never a problem: (a,b) == (b,a) and ranking of pairs with the same scores is deterministic
Only node pairs for which there is a prediction need to be tracked, which is less memory-intensive. This is especially a concern for larger networks. E.g. 5000 nodes yield 12497500 node pairs.
Especially 2 is fundamentally different from scikit-learn.
The way forward is probably to replace Scoresheet with a Pandas Series, whose keys are all node pairs and whose values are scores. The index could be built prior to evaluation:
sklearn.metrics
is in a way much simpler, using plain fuctions. Can we do something analogous or even depend on scikit-learn for stuff like ROC, recall-precision etc.?The text was updated successfully, but these errors were encountered: