Showing posts with label Scaling based on metrics. Show all posts
Showing posts with label Scaling based on metrics. Show all posts

14 Jan 2023

Google Associate Cloud Engineer Lecture 29 (GCP Autoscaling Theory)

GCP Auto scaling is a feature that automatically adjusts the number of virtual machines (VMs) in a managed instance group based on changes in demand for resources. This allows for optimal use of resources and cost savings by ensuring that resources are not wasted on underutilized VMs, and that there are enough resources available to handle increased demand. Auto scaling can be configured to scale based on a variety of metrics such as CPU usage, network traffic, and custom metrics.

For example, you can configure auto scaling to add VMs to a group when CPU usage exceeds a certain threshold and remove VMs when it falls below another threshold. Similarly, you can configure autoscaling to add VMs when network traffic exceeds a certain threshold and remove VMs when it falls below another threshold. You can also set different thresholds for different times of the day or week to account for changes in demand.

In addition to scaling based on metrics, Autoscaling can also be configured to scale based on the number of pending requests for VMs. This can be useful for applications that experience sudden spikes in traffic, as it ensures that there are enough resources available to handle the increased demand.

Overall, GCP Autoscaling is a powerful tool that allows you to optimize the use of resources and reduce costs by automatically scaling the number of VMs in a managed instance group based on demand.

 

Watch detailed series on google Cloud :  https://www.urduitacademy.com/courses/detail/Cloud_and_Devops#Google_Cloud_Platform