Tuesday 6 March 2018

Amazon SageMaker has now added support for Auto Scaling which is now available

Auto Scaling can now be configured of the user’s endpoints from the Amazon SageMaker console, the AWS SDKs, and AWS Auto Scaling API which ultimately makes the capacity management easier. By utilizing the Amazon SageMaker you can now define the type and number of instances per endpoint to deliver the scale that will be required for the inferences. If the inferences volume changes then the users can change the type and number of instances that back each endpoint so that to adjust to the change. Auto Scaling allows you to adjust the inference capacity automatically to maintain the predictable performance at the low cost. 

No comments:

Post a Comment

Empower Your Generative AI Innovation with Amazon Bedrock

  In the dynamic world of cloud computing, AWS has consistently set benchmarks with its innovative services and solutions. One of the inter...