Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[LLM Runtime] enable MHA fusion for gptneox&dolly&starcoder&llama2-70b #567

Merged
merged 17 commits into from
Nov 1, 2023

Conversation

intellinjun
Copy link
Contributor

Type of Change

feature or bug fix or documentation or others
API changed or not:not

enable MHA fusion for gptneox&dolly&starcoder&llama2-70b

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: intellinjun <jun.lin@intel.com>
intellinjun and others added 2 commits October 27, 2023 15:35
Signed-off-by: intellinjun <jun.lin@intel.com>
Signed-off-by: intellinjun <105184542+intellinjun@users.noreply.github.com>
@intellinjun
Copy link
Contributor Author

@airMeng airMeng marked this pull request as draft October 27, 2023 07:54
@DDEle DDEle added ITREX.cpp and removed draft labels Oct 30, 2023
@intellinjun
Copy link
Contributor Author

image

@intellinjun
Copy link
Contributor Author

image

@intellinjun
Copy link
Contributor Author

image

@intellinjun
Copy link
Contributor Author

image

Copy link
Contributor

@DDEle DDEle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@airMeng
Copy link
Contributor

airMeng commented Oct 30, 2023

can you summary the comparison, before and after?

Signed-off-by: intellinjun <105184542+intellinjun@users.noreply.github.com>
@airMeng airMeng marked this pull request as ready for review October 31, 2023 06:32
@airMeng airMeng requested a review from ClarkChin08 October 31, 2023 06:34
Copy link
Contributor

@a32543254 a32543254 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Signed-off-by: intellinjun <jun.lin@intel.com>
Signed-off-by: intellinjun <jun.lin@intel.com>
Signed-off-by: intellinjun <jun.lin@intel.com>
@intellinjun
Copy link
Contributor Author

can you summary the comparison, before and after?

I used ci test to get the summary,there will be results tomorrow.

Signed-off-by: intellinjun <jun.lin@intel.com>
@intellinjun
Copy link
Contributor Author

starcoder still has accuracy issues, please don't merge

@intellinjun
Copy link
Contributor Author

intellinjun commented Nov 1, 2023

@VincyZhang VincyZhang merged commit 81dde20 into main Nov 1, 2023
@VincyZhang VincyZhang deleted the mha_fusion branch November 1, 2023 03:45
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants