Kubernetes amount of Pods vs amount of CPU requests

The point of requests is to provide the scheduler with information so it can figure out where to throw the pod in the cluster. Requests should be reasonable enough for the application to viably run. Limits should be set to prevent an application from starving the other applications on the same node of resources.

In addition to this, you want to scale your application with horizontal pod autoscalers. Out of the box, kubernetes favors horizontally scalable apps.