AWS EC2 Scaling

Scaling involves utilizing only the resources you need while maintaining the flexibility to grow as required.

It’s essential to have an architecture capable of adapting to fluctuations in demand.

By designing a scalable architecture, you ensure that you only pay for the resources you need at any moment.

AWS EC2 Auto Scaling

Servers can receive more requests than they can manage, leading to timeouts and outages when demand exceeds capacity.

AWS EC2 Auto Scaling enables the automatic addition or removal of EC2 instances, adjusting capacity in line with demand.

There are two approaches to scaling:

Dynamic Scaling: Adjusts in real-time to changing demand.
Predictive Scaling: Schedules the number of instances based on anticipated demand.

Combining Dynamic and Predictive Scaling allows for faster scaling responses.

demand

Image created by Amazon Web Services

The illustration shows that demand can fluctuate throughout the week.