Cloud scalability is a cloud-based infrastructure's capability to seamlessly increase or decrease its resources without causing performance drops or downtime. The scaling process in the cloud typically occurs in minutes once an admin updates the system.
Scaling down takes resources (computing power, storage, networking, etc.) out of the cloud infrastructure. Scaling up expands the system, which either happens by adding resources to existing instances (vertical scaling) or adding more copies of existing instances (horizontal scaling).