Advice on scaling strategy: 1 pod per transaction, each lasting 10–30s (with headroom) #6857
-
Hi everyone, I'm working on a use case where I want to autoscale a service with KEDA based on transactions per second (TPS), but with a twist: My questions:
If my goal is: 1 TPS → 5 pods what’s the cleanest way to implement that logic? Thanks for any guidance or best practices you can share! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Hi, About the "headroom“. If you want something more than you get back you can use scaling modifiers. With these you can make sure that you add something to the value you get. See also: https://keda.sh/docs/2.17/concepts/scaling-deployments/#scaling-modifiers About flapping, you can use scaleUp and/or scaleDown behaviors and include a stabilizationWindowSeconds of for example 300 seconds to prevent rapid fluctuations (flapping) in the number of replicas. so;
see also: |
Beta Was this translation helpful? Give feedback.
-
Thanks, I'll have a closer look to Scaling modifiers. |
Beta Was this translation helpful? Give feedback.
Hi,
About the "headroom“. If you want something more than you get back you can use scaling modifiers. With these you can make sure that you add something to the value you get. See also:
https://keda.sh/docs/2.17/concepts/scaling-deployments/#scaling-modifiers
About flapping, you can use scaleUp and/or scaleDown behaviors and include a stabilizationWindowSeconds of for example 300 seconds to prevent rapid fluctuations (flapping) in the number of replicas.
so;
see also:
https://keda.sh/docs/2.17/reference/scaledobject-spec/