topology-aware: try picking resources by hints first #545

klihub · 2025-07-01T07:49:13Z

This PR updates how topology hints are taken into account during resource allocation, especially when there are multiple devices with different HW locality allocated to a single container. In particular the PR

gives more priority for topology hints, bringing them just below explicitly annotated affinity
updates CPU allocation to try pick CPUs by pod resource API hints first
updates memory allocation to pick memory nodes by pod resource API hints first
updates the helm charts values to include agent setting defaults for enabling NRT and pod resource API

Notes: This behavioral change is currently not put behind a config option or an annotation yet. I intend to do that however, by pushing an additional commit.

marquiz

With a very quick pass on this, looks sane to me 😅 With a potentially "might not be a good idea" I'd suggest to start with an off-by-default configuration option.

Spotted one typo in one commit message With multiple devices allocated to a since container

klihub · 2025-07-01T08:32:51Z

With a very quick pass on this, looks sane to me 😅 With a potentially "might not be a good idea" I'd suggest to start with an off-by-default configuration option.

Yes, definitely. I'll put this behind an annotation so that it will be in effect only for containers annotated for it. And only take of the draft status once that commit is in place.

Spotted one typo in one commit message With multiple devices allocated to a since container

Thanks for spotting that! Fixed.

Set defaults for agent config in values. This makes it easy to enable pod resource API by passing this to helm install --set config.agent.podResourceAPI=true Signed-off-by: Krisztian Litkey <[email protected]>

Allow checking if a topology hint is based on pod resource API. Signed-off-by: Krisztian Litkey <[email protected]>

Give more priority for topology hints than earlier, putting them right below annotated affinities. This will give hints priority over memory pinning tightness, which is preferable when a container allocates multiple devices with different memory locality. Hints now have precedence over annotated memory type. This might be a bit questionable, since hints are implied while memory type annotations are explicit. We probably can live with this for the time being. Hints can be selectively dis- abled per pod or container to restore the earlier behavior. Signed-off-by: Krisztian Litkey <[email protected]>

If a container asks for at least as many exsclusive CPUs as it has pod resource API hints, try allocating CPUs by hints first. With multiple devices allocated to a single container, this can help in cases where the collective locality of devices forces allocation high in the pool tree, where we should prefer CPUs with locality to one of the devices and avoid other CPUs. For instance, devices with locality to NUMA node #0 and #3, or 'half of' sockets #0 and #1, and a request for 2 CPUs, we end up in the root pool. But we should only prefer allocating CPUs with locality to NUMA nodes #0 or #3 and avoiding any CPU with locality to node #1 or #3. Signed-off-by: Krisztian Litkey <[email protected]>

If a container has pod resource API hints, try allocating from and pinning to memory from nodes which hints indicate locality to. Signed-off-by: Krisztian Litkey <[email protected]>

Only try to pick resources by hints, if a container is annotated for it using 'pick-resources-by-hints.resource-policy.nri.io'. Signed-off-by: Krisztian Litkey <[email protected]>

fmuyassarov

If NRT is now enabled by default on the agent, I wonder if we still need the Helm based gate for toggling it. Since exposing resources via NRT seems to be the expected default (at least in Helm deployments), maybe we can simplify things by removing that extra switch, unless we're still concerned about generating large CRs? Just a thought.

fmuyassarov · 2025-07-02T07:12:14Z

but otherwise LGTM, as you already mentioned the annotation name is something that can be improved but I can't really suggest anything better since naming is always hard.

Signed-off-by: Krisztian Litkey <[email protected]>

klihub · 2025-07-02T08:01:02Z

If NRT is now enabled by default on the agent, I wonder if we still need the Helm based gate for toggling it. Since exposing resources via NRT seems to be the expected default (at least in Helm deployments), maybe we can simplify things by removing that extra switch, unless we're still concerned about generating large CRs? Just a thought.

I think NRT has already been on by default in the configuration. This was just implicit by leaving it out of the default configuration in values.yaml but having it default to enabled in the configuration CRD.

fmuyassarov

Thank you.

klihub requested review from kad, marquiz and fmuyassarov July 1, 2025 07:49

klihub force-pushed the devel/pick-resources-by-hints branch from a7edac8 to 00a510a Compare July 1, 2025 07:58

marquiz reviewed Jul 1, 2025

View reviewed changes

klihub force-pushed the devel/pick-resources-by-hints branch from 00a510a to 6b88f9e Compare July 1, 2025 08:31

klihub added 6 commits July 2, 2025 09:56

helm: set defaults for agent config in values.

061808f

Set defaults for agent config in values. This makes it easy to enable pod resource API by passing this to helm install --set config.agent.podResourceAPI=true Signed-off-by: Krisztian Litkey <[email protected]>

podresapi: allow checking hint type.

72fcbcf

Allow checking if a topology hint is based on pod resource API. Signed-off-by: Krisztian Litkey <[email protected]>

topology-aware: pick memory by podresapi hint(s) first.

10be158

If a container has pod resource API hints, try allocating from and pinning to memory from nodes which hints indicate locality to. Signed-off-by: Krisztian Litkey <[email protected]>

topology-aware: add pick-resources-by-hints annotation.

e86db46

Only try to pick resources by hints, if a container is annotated for it using 'pick-resources-by-hints.resource-policy.nri.io'. Signed-off-by: Krisztian Litkey <[email protected]>

fmuyassarov reviewed Jul 2, 2025

View reviewed changes

docs: document hint-based picking of resources.

7ea0bb5

Signed-off-by: Krisztian Litkey <[email protected]>

klihub force-pushed the devel/pick-resources-by-hints branch from 5b0e265 to 7ea0bb5 Compare July 2, 2025 07:54

klihub marked this pull request as ready for review July 2, 2025 07:54

fmuyassarov approved these changes Jul 2, 2025

View reviewed changes

klihub mentioned this pull request Jul 2, 2025

helm: set defaults for agent config in values. #546

Closed

fmuyassarov merged commit 7094685 into containers:main Jul 2, 2025
9 checks passed

klihub deleted the devel/pick-resources-by-hints branch July 2, 2025 09:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

topology-aware: try picking resources by hints first #545

topology-aware: try picking resources by hints first #545

Uh oh!

klihub commented Jul 1, 2025

Uh oh!

marquiz left a comment

Uh oh!

klihub commented Jul 1, 2025

Uh oh!

fmuyassarov left a comment

Uh oh!

fmuyassarov commented Jul 2, 2025

Uh oh!

klihub commented Jul 2, 2025

Uh oh!

fmuyassarov left a comment

Uh oh!

Uh oh!

Uh oh!

topology-aware: try picking resources by hints first #545

topology-aware: try picking resources by hints first #545

Uh oh!

Conversation

klihub commented Jul 1, 2025

Uh oh!

marquiz left a comment

Choose a reason for hiding this comment

Uh oh!

klihub commented Jul 1, 2025

Uh oh!

fmuyassarov left a comment

Choose a reason for hiding this comment

Uh oh!

fmuyassarov commented Jul 2, 2025

Uh oh!

klihub commented Jul 2, 2025

Uh oh!

fmuyassarov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!