fix: handle corner cases in the async preemption #133213

sanposhiho · 2025-07-25T10:04:04Z

What type of PR is this?

/kind bug

What this PR does / why we need it:

See #133167, which was reverted due to a failing test. This PR contains additional fix to a test as well.

Which issue(s) this PR is related to:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

This reverts commit 006d762.

k8s-ci-robot · 2025-07-25T10:04:13Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot · 2025-07-25T10:04:23Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sanposhiho

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [sanposhiho]
~~test/integration/scheduler/OWNERS~~ [sanposhiho]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sanposhiho · 2025-07-25T10:04:30Z

/hold

Need approver's approval + I'd like to make sure if the test pass several times.

sanposhiho · 2025-07-25T10:20:59Z

pkg/scheduler/framework/preemption/preemption.go

-			errCh.SendErrorWithCancel(err, cancel)
+			errCh.SendError(err)


We shouldn't cancel context if the error is not found. So, I changed here to SendError and then cancel() at the receiver side

Canceling the context has another purpose as well - it terminates the Parallelizer().Until, i.e. if there is an error, we stop preempting the rest of the pods.

So, I think we need to check apierrors.IsNotFound(err) here and handle it appropriately.

Yes, so calling cancel() at the receiver side is equivalent, right?

Checking the not found error here would be a bit tricky and I prefer the current impl, if you don't have the objection there

Yes, so calling cancel() at the receiver side is equivalent, right?

No, Parallelizer().Until() blocks until all goroutines are finished. Then, the error is checked. So, in the current implementation, you would cancel the context after all pods were tried to be preempted.

Also, the error channel should be called only once (as the length of the buffer is 1). Currently, if one preemptPod would return Not Found error and second will return another error, we would only handle the Not Found one.

I think, the simplest would be just to set an allPodsAlreadyDeleted boolean atomic in preemptPod

No, Parallelizer().Until() blocks until all goroutines are finished.

It's a good point, fixed.

sanposhiho · 2025-07-25T10:21:34Z

test/integration/scheduler/preemption/preemption_test.go

+					st.MakePod().Name("medium").Priority(mediumPriority).Req(map[v1.ResourceName]string{v1.ResourceCPU: "3"}).Obj(),
 				},
 				{
-					st.MakePod().Name("high").Priority(highPriority).Req(map[v1.ResourceName]string{v1.ResourceCPU: "3"}).Obj(),
+					st.MakePod().Name("high").Priority(highPriority).Req(map[v1.ResourceName]string{v1.ResourceCPU: "4"}).Obj(),


this doesn't matter. it's just that the test name didn't match what it tests

dom4ha · 2025-07-25T12:38:43Z

pkg/scheduler/framework/preemption/preemption.go

+				cancel()
+				result = metrics.GoroutineResultError
+			default:
+				allPodsAlreadyDeleted = false


I think we shouldn't "disable activate on finish" here, because we don't rely on notifications sent from pods that were removed asynchronously. We explicitly call removing the last pod synchronously to have a gurantee we don't miss the notification.

For that reason I'd rename allPodsAlreadyDeleted to something like noGuaranteedNotification, which would be false at the beginning and will turn to true after we send the last removal synchronously.

We explicitly call removing the last pod synchronously to have a gurantee we don't miss the notification.

Deleting the last pod synchronously is for completely another scenario: if all pods are deleted async and all pod/delete events arrive at the queue immediately, before the preemptor pod is un-gated, the preemptor pod could miss all pod/delete events.

What we need to do is "when all pods are already deleted after all, activate the preemptor pod". Because, if all pods are already deleted, the preemptor pod might miss pod/delete while it's being gated.
So, I don't think your proposed change would cover the corner case that we're solving here.

Revert "Revert "fix: handle corner cases in the async preemption""

a03c7d1

This reverts commit 006d762.

k8s-ci-robot added the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Jul 25, 2025

k8s-ci-robot requested review from dom4ha and kerthcet July 25, 2025 10:04

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 25, 2025

sanposhiho force-pushed the second-trial-conor branch from 1f29ab1 to f6cc41c Compare July 25, 2025 10:19

sanposhiho commented Jul 25, 2025

View reviewed changes

dom4ha reviewed Jul 25, 2025

View reviewed changes

sanposhiho force-pushed the second-trial-conor branch from f6cc41c to 1d3ee06 Compare July 26, 2025 08:51

sanposhiho mentioned this pull request Jul 26, 2025

feat: trigger PreBindPreFlight in the binding cycle #133021

Open

fix: flake integration test

6d11c36

sanposhiho force-pushed the second-trial-conor branch from 1d3ee06 to 6d11c36 Compare July 28, 2025 09:59

sanposhiho requested a review from macsko July 28, 2025 10:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: handle corner cases in the async preemption #133213

fix: handle corner cases in the async preemption #133213

sanposhiho commented Jul 25, 2025

Uh oh!

k8s-ci-robot commented Jul 25, 2025

Uh oh!

k8s-ci-robot commented Jul 25, 2025

Uh oh!

sanposhiho commented Jul 25, 2025

Uh oh!

sanposhiho Jul 25, 2025

Uh oh!

macsko Jul 28, 2025

Uh oh!

macsko Jul 28, 2025

Uh oh!

sanposhiho Jul 28, 2025

Uh oh!

sanposhiho Jul 28, 2025

Uh oh!

macsko Jul 28, 2025

Uh oh!

macsko Jul 28, 2025

Uh oh!

macsko Jul 28, 2025

Uh oh!

sanposhiho Jul 28, 2025

Uh oh!

sanposhiho Jul 25, 2025

Uh oh!

dom4ha Jul 25, 2025

Uh oh!

sanposhiho Jul 26, 2025

Uh oh!

Uh oh!

fix: handle corner cases in the async preemption #133213

Are you sure you want to change the base?

fix: handle corner cases in the async preemption #133213

Conversation

sanposhiho commented Jul 25, 2025

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR is related to:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Uh oh!

k8s-ci-robot commented Jul 25, 2025

Uh oh!

k8s-ci-robot commented Jul 25, 2025

Uh oh!

sanposhiho commented Jul 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!