We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
张老师你好,请问move_elision实现可以跟FA2一起用吗? 另外我做了简单测速,发现推理并没有变快: 反倒比没用MLA,和不用move_elision都慢些。
The text was updated successfully, but these errors were encountered:
显存占用确实对比实验4明显减小了。
Sorry, something went wrong.
No branches or pull requests
张老师你好,请问move_elision实现可以跟FA2一起用吗?
另外我做了简单测速,发现推理并没有变快:
反倒比没用MLA,和不用move_elision都慢些。
The text was updated successfully, but these errors were encountered: