You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The serverless architecture (API Gateway – Lambda - Sagemaker endpoint) is a common pattern
to deploy real-time ML inference microservices in production. Most ML models require data pre-processing and feature engineering, that cannot be easily implemented using the API gateway's request mapping templates. In some cases, raw data also needs to be enriched using reference datasets. This is where the proposed lambda-sagemakerendpoint construct is a better choice than the existing aws-apigateway-sagemakerendpoint construct. The Lambda provides the compute power to perform any required data pre-processing, feature engineering, and/or data enrichment. The proposed lambda-sagemakerendpoint can be combined with the existing apigateway-lambda construct to create apigateway-lambda-sagemakerendpoint.
Use cases
Real-time synchronous inference: the proposed lambda-sagemakerendpoint, combined with aws-apigateway, can be used for real-time synchronous inference, where requests from a client applications are forwarded to SageMaker endpoint, and predictions, produced by the endpoint’s model, are returned in the responses to the client application.
Real-time asynchronous inference: The proposed lambda-sagemakerendpoint pattern can be used in real-time asynchronous inference by integrating it with a messaging service, such as AWS SQS. This pattern is suitable for high-throughput applications, where API gateway limits, such as number of requests/second, payload size, etc., or heavy data pre-processing, create a bottleneck in the ML inference pipeline.
Proposed solution
The proposed patterns provide the following:
Implementation to use an existing SageMaker inference endpoint, or create a new one.
Build required roles permissions (e.g., Lambda to invoke SageMaker endpoint, logs, X-ray, SageMaker service role to create SageMaker resources, etc.).
VPC configuration (i.e., creating a new VPC, or using existing VPC) to be used by the SageMaker endpoint and Lambda.
The text was updated successfully, but these errors were encountered:
hnishar
changed the title
New Pattern: lambda-sagemakerendpoint
New Pattern: aws-lambda-sagemakerendpoint
Dec 16, 2020
The serverless architecture (API Gateway – Lambda - Sagemaker endpoint) is a common pattern
to deploy real-time ML inference microservices in production. Most ML models require data pre-processing and feature engineering, that cannot be easily implemented using the API gateway's request mapping templates. In some cases, raw data also needs to be enriched using reference datasets. This is where the proposed lambda-sagemakerendpoint construct is a better choice than the existing aws-apigateway-sagemakerendpoint construct. The Lambda provides the compute power to perform any required data pre-processing, feature engineering, and/or data enrichment. The proposed lambda-sagemakerendpoint can be combined with the existing apigateway-lambda construct to create apigateway-lambda-sagemakerendpoint.
Use cases
Proposed solution
The proposed patterns provide the following:
The text was updated successfully, but these errors were encountered: