-
Notifications
You must be signed in to change notification settings - Fork 314
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
K-truss benchmarking #4593
Comments
In 24.06, we updated our K-Truss implementation by moving from cuHornet to an implementation matching our primitive and the performance improvement was several order of magnitude faster. As a summary, our earlier implementation run several iteration of our primitive ![]() ![]() ![]() ![]() ![]() ![]() ![]() On the memory side we are currently limited by the number of edge we can process in |
This is a great start. But I think we should also show comparison to other implementations (I saw nx, hornet, arkuda in a quick internet search, but any we can find). |
These are all single-gpu so far? Which GPU is being used? |
Correct. This issue will be updated incrementally. In the first part of this k-truss benchmarking issue, I am portraying how the new optimization improved our runtime from our previous one which was already faster than the cuHornet based implementation. The two fastest implementation in the literature on single GPU are KTrussExplorer and a linear algebraic based implementation which we are targeting to outperform. |
Yes I do also have single node multi GPU numbers which will be added shortly.
NVIDIA H100 80GB HBM3 |
Benchmark k-truss and compare to other existing implementations.
The text was updated successfully, but these errors were encountered: