The system's ability to cope with increased load without performance degradation.
Breaking a large dataset into smaller subsets (partitions) assigned to different nodes to enable unlimited horizontal scaling.
Scaling a system by adding more machines to the resource pool, rather than increasing the capacity of existing machines.
The metrics used to describe the pressure on a system, such as requests per second or the ratio of reads to writes.
Scalability just means 'adding more servers'.