Scaling deployments
Scaling is accomplished by changing the number of replicas in a deployment. A replica is a copy of a pod that already contains a running service. By having multiple replicas of a pod, you can ensure that your deployment has the available resources to handle increasing load.
About this task
All of the ODM services can be scaled up or down in a cluster, except for the Rule Execution Server console. The console is used to deploy new versions of a decision service dynamically and notify all connected rule engines to pick up the newest version. Although you do not need a console for rule execution, it is useful to deploy new versions, notify decision services to execution components, and gather statistics for rule execution.
If a deployment is exposed publicly when you change the number of replicas, the service distributes the traffic to the available pods during the update. An available pod is an instance that can be accessed by users.
Procedure
Results
- DESIRED - The desired number of replicas of a pod, which you define when you create the deployment.
- CURRENT - The number of replicas currently running.
- READY - The number of replicas that are available to the users compared to the desired state.
- AVAILABLE - The number of replicas that are available to the users.