AWS Classroom Series – 05/Dec/2020 – Direct DevOps from Quality Thought

AWS Autoscaling

Autoscaling ensures application is able to handle load by increasing the number of ec2 instances when needed and decreasing the number of ec2 instances in the case of decreased load
Autoscaling is used to ensure the desired number of ec2 instances are always running.
Autoscaling Terms
- Minimum Size
- Desired capacity
- Maximum Size
Desired Capacity can be set to static number in such cases aws will ensure you will always have fixed set of ec2 instance running your application
To make Desired Capacity Dynamic, We can specify any cloud watch metric such as cpu, network io, memory load etc to determine the desired capacity
In the case of web servers & app servers as users increase or as load increases the stress will be more on CPU, so we will be using CPU metrics to determine the desired capacity
Auto Scaling Components
- Auto Scaling Groups:
  - We specify minimum, maximum & desired capacity
- Configuration Templates:
  - We can use Launch templates or Launch configurations to determine the ami and other ec2 instance details
- Scaling Options:
  - Dynamic
  - Fixed number
  - Manual Scaling

This is similar to Launch configuration. Launch template allows us to have multiple versions of the template.
Launch template also gives option for dynamic values while creating
Using Launch Template you can specify the same configuration as launch template, some of the values can be made dynamic and versioing is enabled.

ASG contains collection of EC2 instances for the purpose of automatic scaling & management.
ASG Group has features such as
- Health check replacements
- Scaling policies
Size of ASG depends on number of instances in Desired Capacity which can be adjusted on demand, manually or by using automatic scaling