-
Notifications
You must be signed in to change notification settings - Fork 10.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
server: Add "tokens per second" information in the backend #10548
Conversation
@ngxson Thank you for your suggestion. I’m also not very confident about the UI/UX part.
|
I haven't had time to look deeper into this, but seems like what you're doing is already handled by |
It doesn't get the correct value because What I'm thinking is:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This code can be simplified further.
To pass the CI, you need to merge with latest upstream master branch
@ngxson Thanks~ |
…#10548) * add cmake rvv support * add timings * remove space * update readme * fix * fix code * remove empty line * add test --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
…#10548) * add cmake rvv support * add timings * remove space * update readme * fix * fix code * remove empty line * add test --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
Implement #10502