-
Notifications
You must be signed in to change notification settings - Fork 256
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SageMaker LLM Falcon 40b fails to start #635
Comments
Hey pdifranc - thank you for the note. We have an open ticket with the SageMaker team to resolve. Will keep you updated on progress. |
Thanks!
Meanwhile, @pdifranc, now that Amazon Bedrock is generally available, you might want to try using QnABot with the Bedrock plugin. See https://github.com/aws-samples/qnabot-on-aws-plugin-samples
Sent from Workspace ONE Boxer
On Oct 1, 2023 11:16 AM, kroeter ***@***.***> wrote:
Hey pdifranc - we noticed this issue on Friday and have an open ticket with the SageMaker team to resolve. Will keep you updated on progress.
—
Reply to this email directly, view it on GitHub<#635 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACTSFHW6ZY35YXSRELPM5NTX5GCM5ANCNFSM6AAAAAA5OMA4B4>.
You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>
|
## [5.4.2] - 2023-09-30 ### Added - Self-hosting web fonts. Font files used by QnABot UI are now served from QnABot server instead of using third party font provider. ### Updated - Security patches for npm and pip packages - Lambda runtimes updated to NodeJS 18 for CFN Bootstrap Lambda - SonarQube Quality Gates fix - Bluebird Promise Migration (Partial) - Utilize native promises supported in JavaScript - Full migration planned for v5.5.0. Done as a prerequisite for JavaScript SDK v3 migration planned for v6.0.0 - Remaining changes to be implemented in v5.5.0 include ./website, ./cfn, & ./templates ### Fixed - Fixed request signing issue when using Custom domain ([issue #605](#605)) - Fixed Sagemaker LLM deployment ([issue #635](#635)) - Fixed voice integration with LLM response - Fixed unsupported SSML tags - Fixed Kendra API retrieval bug
Thanks @pdifranc for reporting the issue. This problem affects v5.4.0, v5.4.1, and the version of v5.4.2 that you are using. To summarize the issue, a recent commit on the model repo (https://huggingface.co/tiiuae/falcon-40b-instruct/commit/ecb78d97ac356d098e79f0db222c9ce7c5d9ee5f) has broken compatibility with the model image used in QnABot. This problem has been resolved in the v5.4.2 release by specifying an earlier commit from the model repo (see https://github.com/aws-solutions/qnabot-on-aws/blob/main/templates/sagemaker-qa-summarize-llm/index.js#L53). If you are deploying from this repo, please pull and deploy the v5.4.2 tag/branch: https://github.com/aws-solutions/qnabot-on-aws/releases/tag/v5.4.2. If you deployed from the Implementation Guide, please pull and deploy the latest template. |
Describe the bug
When enabling the SageMaker endpoint with Falcon 40b, the endpoint fails to start
To Reproduce
just deploy the stack and enable LLM on a ml.g5.12xlarge instance
Expected behavior
The deployment is successful
Please complete the following information about the solution:
To get the version of the solution, you can look at the description of the created CloudFormation stack. For example, "(SO0189) QnABot [...] [v5.4.2]".
Screenshots
logs specified above
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: