Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docs] Update Endpoint Deployment guide to specify advanced config options #1740

Merged
merged 1 commit into from
Apr 4, 2024

Conversation

nikhil-sk
Copy link
Contributor

Description

Adds following details to SageMaker endpoint deployment guide:

  1. https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-hosting.html

(Option to set VolumeSizeInGB, ModelDataDownloadTimeoutInSeconds, ContainerStartupHealthCheckTimeoutInSeconds)

We could point to AWS docs (ideal) or mention these in our docs itself.

  1. Another is to deploy uncompressed model: https://docs.aws.amazon.com/sagemaker/latest/dg/large-model-inference-uncompressed.html

@nikhil-sk nikhil-sk requested review from zachgk, frankfliu and a team as code owners April 4, 2024 18:45
@nikhil-sk nikhil-sk changed the title Update Endpoint Deployment guide to specify advanced config options [Docs] Update Endpoint Deployment guide to specify advanced config options Apr 4, 2024
@nikhil-sk nikhil-sk merged commit eeefc2e into deepjavalibrary:master Apr 4, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants