Scaling involves utilizing only the resources you need while maintaining the flexibility to grow as required.
It’s essential to have an architecture capable of adapting to fluctuations in demand.
By designing a scalable architecture, you ensure that you only pay for the resources you need at any moment.
Servers can receive more requests than they can manage, leading to timeouts and outages when demand exceeds capacity.
AWS EC2 Auto Scaling enables the automatic addition or removal of EC2 instances, adjusting capacity in line with demand.
There are two approaches to scaling:
Combining Dynamic and Predictive Scaling allows for faster scaling responses.
Image created by Amazon Web Services
The illustration shows that demand can fluctuate throughout the week.