Commit 447629b
fix: don't treat in-progress binding states as terminal failures (#670)
HasBindingFailed previously returned true for any condition with
Status=False, regardless of the Reason. Several in-progress reasons on
the per-cluster binding conditions (Overridden, WorkSynchronized,
Applied, Available) are expressed as Status=False rather than
Status=Unknown — most notably WorkNotSynchronizedYetReason and
NotAvailableYetReason. The over-broad classification caused upstream
controllers (rollout, updaterun) to bail on bindings that were still
making forward progress.
Switch HasBindingFailed to consult an explicit allowlist of
non-terminal reasons (nonTerminalBindingFailureReasons) drawn from the
condition package: OverriddenPendingReason, WorkNotSynchronizedYetReason,
ApplyPendingReason, NotAvailableYetReason. Anything outside that set is
treated as terminal — the safer default, since wrongly classifying an
unknown reason as "transient" would stall rollouts forever.
Per code review feedback (the original draft used a `strings.Contains`
on the literal "failed", which is fragile against typos and convention
drift), the allowlist uses typed constants. New "Pending" / "NotYet"
reasons added to pkg/utils/condition must also be added to the
allowlist; this is documented inline.
Tests:
- TestHasBindingFailed_NonTerminalReasons covers each known transient
reason returning false, each known terminal reason returning true,
an unknown-reason fall-through to terminal, and a mixed-condition
case where a transient reason on one condition does not mask a
terminal failure on another.
- Existing TestHasBindingFailed (which uses camelCase reason strings
not in the allowlist) still passes — those legacy strings are
correctly classified as terminal.
- TestCheckClusterUpdateResult in the updaterun package was encoding
the previous bug: it asserted that WorkNotSynchronizedYet should
produce an error from checkClusterUpdateResult. Updated the case to
match corrected semantics (in-progress, no error) and added a new
case for the terminal SyncWorkFailedReason path. This aligns with
the existing TODO at execution.go:626 that called for distinguishing
recoverable from non-recoverable failures.
Fixes #648
Signed-off-by: Yetkin Timocin <ytimocin@microsoft.com>
Co-authored-by: Claude Code <noreply@anthropic.com>1 parent abfa3ef commit 447629b
3 files changed
Lines changed: 161 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
242 | 242 | | |
243 | 243 | | |
244 | 244 | | |
245 | | - | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
246 | 248 | | |
247 | 249 | | |
248 | 250 | | |
| |||
258 | 260 | | |
259 | 261 | | |
260 | 262 | | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
261 | 282 | | |
262 | 283 | | |
263 | 284 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
27 | 57 | | |
28 | 58 | | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
33 | 67 | | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
34 | 71 | | |
35 | 72 | | |
36 | 73 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
| 25 | + | |
25 | 26 | | |
26 | 27 | | |
27 | 28 | | |
| |||
326 | 327 | | |
327 | 328 | | |
328 | 329 | | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
329 | 426 | | |
330 | 427 | | |
331 | 428 | | |
| |||
0 commit comments