-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CPU-GPU sync contrary to comment #46
Comments
You are right that this is inconsistent with the comment. If I remember correctly, this might come down to the problem being solved. Timing this is also non-trivial. If you come to a conclusion about which way is faster, we can either fix the comment or make the code behave as described again. Thanks for looking into this! |
I have added you as a collaborator to the repository :) |
I think I know what's going on here:
Theoretically, this could be an issue for sparse evaluation points, but in my experiments I couldn't achieve a speed-up by deferring this sync, so I'm just going to sum all this up in the comment, referencing this issue, and leave things as-is. |
Good catch that there is a sync point happening anyway! |
Hi, sorry to bother you again. Seems that this line shouldn't be there, at least according to the comment above. I was able to achieve a significant speed-up by removing this
if
.The text was updated successfully, but these errors were encountered: