Skip to content

adding orion consolidated analysis for 24, 120, 252 nodes#78671

Draft
ArthurChenCoding wants to merge 6 commits intoopenshift:mainfrom
ArthurChenCoding:main
Draft

adding orion consolidated analysis for 24, 120, 252 nodes#78671
ArthurChenCoding wants to merge 6 commits intoopenshift:mainfrom
ArthurChenCoding:main

Conversation

@ArthurChenCoding
Copy link
Copy Markdown
Contributor

@ArthurChenCoding ArthurChenCoding commented Apr 30, 2026

Summary

Enable Orion consolidated run for control-plane tests at 24, 120, and 252 node scales (we only have it for 6 nodes now)

Changes

Standard tests (10 total):

  • Add RUN_ORION: "true" environment variable (defer for 4.21)
  • Add openshift-qe-orion-consolidated chain after openshift-qe-control-plane

Tests Updated (10 total)

Tests Updated

Version Standard Tests (RUN_ORION: "true" + chain) Deferred Tests (RUN_ORION: "deferred", + chain)
4.21 - control-plane-120nodes, control-plane-24nodes
4.22 control-plane-252nodes, control-plane-120nodes, control-plane-24nodes -
4.23 control-plane-252nodes, control-plane-120nodes, control-plane-24nodes -
5.0 control-plane-120nodes, control-plane-24nodes -

Summary by CodeRabbit

  • Chores
    • Enabled Orion execution for multiple AWS perfscale control-plane test variants across OpenShift 4.21, 4.22, 4.23, and 5.0 nightly flows.
    • Added the Orion consolidated test chain to control-plane scaling test sequences for variants with Orion enabled.

@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Apr 30, 2026

Walkthrough

Adds RUN_ORION environment settings and inserts the openshift-qe-orion-consolidated test chain into control-plane perfscale CI job sequences across AWS nightly configs for OpenShift 4.21, 4.22, 4.23, and 5.0; some variants set RUN_ORION: "true", others deferred.

Changes

Control-plane Orion enablement (AWS nightly CI)

Layer / File(s) Summary
Env / Flag additions
ci-operator/config/.../*__aws-4.21-nightly-x86.yaml, ...__aws-4.22-nightly-x86.yaml, ...__aws-4.23-nightly-x86.yaml, ...__aws-5.0-nightly-x86.yaml
Adds RUN_ORION to control-plane job steps.env blocks. Values vary by variant: some set "true", others set deferred (4.21 variants use deferred in the changed jobs).
Test chain wiring
ci-operator/config/.../*__aws-4.21-nightly-x86.yaml, ...__aws-4.22-nightly-x86.yaml, ...__aws-4.23-nightly-x86.yaml, ...__aws-5.0-nightly-x86.yaml
Appends - chain: openshift-qe-orion-consolidated into the steps.test / tests chain immediately after openshift-qe-control-plane for the control-plane job variants where RUN_ORION was added as "true" or deferred per the diffs.
Files touched (summary)
ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.21-nightly-x86.yaml, ...__aws-4.22-nightly-x86.yaml, ...__aws-4.23-nightly-x86.yaml, ...__aws-5.0-nightly-x86.yaml
YAML-only CI job configuration updates; no code, manifests, or docs changed.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 12
✅ Passed checks (12 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly and specifically summarizes the main change: enabling Orion consolidated analysis for control-plane tests at 24, 120, and 252 node scales across multiple OpenShift versions.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.
Stable And Deterministic Test Names ✅ Passed The custom check validates Ginkgo test names for stability. This PR modifies CI orchestration YAML files only, not Ginkgo test code, so the check is not applicable.
Test Structure And Quality ✅ Passed The PR modifies only CI/CD configuration YAML files setting environment variables and test chains, not Ginkgo test code, making the custom check inapplicable.
Microshift Test Compatibility ✅ Passed This PR contains only CI configuration changes to YAML files in ci-operator/config/ directory with no new Ginkgo e2e test code.
Single Node Openshift (Sno) Test Compatibility ✅ Passed PR modifies only CI configuration YAML files to enable Orion analysis for existing tests, adding no new Ginkgo e2e test code.
Topology-Aware Scheduling Compatibility ✅ Passed PR modifies only CI configuration files for test execution, not deployment manifests, operator code, or Kubernetes controllers that would impose scheduling constraints.
Ote Binary Stdout Contract ✅ Passed The custom check for OTE Binary Stdout Contract is not applicable to this pull request. The PR only modifies YAML CI configuration files with no changes to source code or binaries.
Ipv6 And Disconnected Network Test Compatibility ✅ Passed PR modifies only CI configuration YAML files in ci-operator/config/, not Ginkgo e2e test files. No new tests with potential IPv4 assumptions or external connectivity requirements were added.
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented Apr 30, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ArthurChenCoding
Once this PR has been reviewed and has the lgtm label, please assign jtaleric for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci Bot requested review from memodi and vishnuchalla April 30, 2026 20:47
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented Apr 30, 2026

Caution

Failed to replace (edit) comment. This is likely due to insufficient permissions or the comment being deleted.

Error details
{}

@ArthurChenCoding ArthurChenCoding marked this pull request as draft April 30, 2026 21:31
@openshift-ci openshift-ci Bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 30, 2026
Signed-off-by: ArthurChenCoding <artchen@redhat.com>
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Inline comments:
In
`@ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml`:
- Around line 137-143: For the three non-standard variant job configs
(control-plane-etcdencrypt-24nodes, control-plane-ipsec-120nodes,
control-plane-ipsec-252nodes) change their Orion gating so it’s deferred: set
RUN_ORION to "deferred" instead of "true" and remove or comment out the
'openshift-qe-orion-consolidated' entry from the test chain (the list under test
containing ref/chain entries) until Orion baselines are validated; update the
blocks referencing RUN_ORION and the test array for those specific job
definitions to reflect this gating.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: 7d701534-b795-4a1e-8ca1-39698d5335ba

📥 Commits

Reviewing files that changed from the base of the PR and between 0384648 and fa85f7d.

📒 Files selected for processing (4)
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.21-nightly-x86.yaml
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.22-nightly-x86.yaml
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-5.0-nightly-x86.yaml

Signed-off-by: ArthurChenCoding <artchen@redhat.com>
@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@ArthurChenCoding, pj-rehearse: unable to determine affected jobs. This could be due to a branch that needs to be rebased. ERROR:

couldn't prepare candidate: couldn't checkout base SHA f284674372c3d8e1f62609209004213bfc4a180d: error checking out "f284674372c3d8e1f62609209004213bfc4a180d": exit status 128 fatal: unable to read tree (f284674372c3d8e1f62609209004213bfc4a180d)

Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@ArthurChenCoding ArthurChenCoding marked this pull request as ready for review May 1, 2026 06:10
@openshift-ci openshift-ci Bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 1, 2026
@openshift-ci openshift-ci Bot requested a review from rsevilla87 May 1, 2026 06:10
--local-indexing
NODE_DENSITY_GC: "false"
OPENSHIFT_INFRA_NODE_INSTANCE_TYPE: r5.4xlarge
RUN_ORION: "true"
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why true and not deferred?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can see 4.21 6 node is deferred. But I'm not sure if 24-node and 120-node should also be deferred. for next step, Im checking if 4.21 have Orion baselines established for control-plane tests.

@ArthurChenCoding
Copy link
Copy Markdown
Contributor Author

I will revert all the etcd and ispec ones

@ArthurChenCoding
Copy link
Copy Markdown
Contributor Author

/pj-rehearse

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@ArthurChenCoding: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

Signed-off-by: ArthurChenCoding <artchen@redhat.com>
Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

♻️ Duplicate comments (1)
ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml (1)

130-144: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

RUN_ORION: "true" + openshift-qe-orion-consolidated still present for non-standard variants — contradicts PR description.

control-plane-etcdencrypt-24nodes (line 137), control-plane-ipsec-120nodes (line 239), and control-plane-ipsec-252nodes (line 265) still carry RUN_ORION: "true" and the openshift-qe-orion-consolidated chain. The PR description explicitly states these variants should use RUN_ORION: deferred without the consolidated chain, and the PR author stated an intent to revert these. Despite the prior review being marked "✅ Addressed", the current code still reflects the original (incorrect) state.

These jobs use non-standard cluster configurations (OVN-IPsec / etcd-encryption) for which Orion baselines have not been confirmed. Running Orion with "true" against missing baselines will produce systematic failures in the consolidated chain steps.

Either apply the fix per the PR description, or revert the three jobs entirely:

🛠️ Option A: Apply deferred gating per PR description
# control-plane-etcdencrypt-24nodes (line 137)
-      RUN_ORION: "true"
+      RUN_ORION: deferred
   test:
   - ref: openshift-qe-workers-scale
   - chain: openshift-qe-control-plane
-   - chain: openshift-qe-orion-consolidated

# control-plane-ipsec-120nodes (line 239)
-      RUN_ORION: "true"
+      RUN_ORION: deferred
   test:
   - ref: openshift-qe-workers-scale
   - chain: openshift-qe-control-plane
-   - chain: openshift-qe-orion-consolidated

# control-plane-ipsec-252nodes (line 265)
-      RUN_ORION: "true"
+      RUN_ORION: deferred
   test:
   - ref: openshift-qe-workers-scale
   - chain: openshift-qe-control-plane
-   - chain: openshift-qe-orion-consolidated

Also applies to: 226-250, 252-276

🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In
`@ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml`
around lines 130 - 144, The three job variants
control-plane-etcdencrypt-24nodes, control-plane-ipsec-120nodes, and
control-plane-ipsec-252nodes still enable Orion; update each job to match the PR
intent by setting the env key RUN_ORION to "deferred" (replace RUN_ORION:
"true") and remove the openshift-qe-orion-consolidated entry from the test chain
(i.e., remove the ref/chain "openshift-qe-orion-consolidated" from the test
list); alternatively, if you prefer the revert approach, restore the entire job
definitions to their previous committed state instead of editing them. Ensure
you modify the job blocks that include the env and test keys so the change
applies to all listed occurrences.
🤖 Prompt for all review comments with AI agents
Verify each finding against the current code and only fix it if needed.

Duplicate comments:
In
`@ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml`:
- Around line 130-144: The three job variants control-plane-etcdencrypt-24nodes,
control-plane-ipsec-120nodes, and control-plane-ipsec-252nodes still enable
Orion; update each job to match the PR intent by setting the env key RUN_ORION
to "deferred" (replace RUN_ORION: "true") and remove the
openshift-qe-orion-consolidated entry from the test chain (i.e., remove the
ref/chain "openshift-qe-orion-consolidated" from the test list); alternatively,
if you prefer the revert approach, restore the entire job definitions to their
previous committed state instead of editing them. Ensure you modify the job
blocks that include the env and test keys so the change applies to all listed
occurrences.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Repository YAML (base), Central YAML (inherited)

Review profile: CHILL

Plan: Enterprise

Run ID: 3b4842f4-89a1-4cf6-b357-e29554e28135

📥 Commits

Reviewing files that changed from the base of the PR and between f446e90 and bfe8c36.

📒 Files selected for processing (2)
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.21-nightly-x86.yaml
  • ci-operator/config/openshift-eng/ocp-qe-perfscale-ci/openshift-eng-ocp-qe-perfscale-ci-main__aws-4.23-nightly-x86.yaml

Signed-off-by: ArthurChenCoding <artchen@redhat.com>
Signed-off-by: ArthurChenCoding <artchen@redhat.com>
Signed-off-by: ArthurChenCoding <artchen@redhat.com>
@ArthurChenCoding
Copy link
Copy Markdown
Contributor Author

/pj-rehearse

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

@ArthurChenCoding: now processing your pj-rehearse request. Please allow up to 10 minutes for jobs to trigger or cancel.

@openshift-merge-bot
Copy link
Copy Markdown
Contributor

[REHEARSALNOTIFIER]
@ArthurChenCoding: the pj-rehearse plugin accommodates running rehearsal tests for the changes in this PR. Expand 'Interacting with pj-rehearse' for usage details. The following rehearsable tests have been affected by this change:

Test name Repo Type Reason
pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-120nodes openshift-eng/ocp-qe-perfscale-ci presubmit Ci-operator config changed
pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-24nodes openshift-eng/ocp-qe-perfscale-ci presubmit Ci-operator config changed
pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-252nodes openshift-eng/ocp-qe-perfscale-ci presubmit Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.21-nightly-x86-control-plane-120nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.21-nightly-x86-control-plane-24nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.22-nightly-x86-control-plane-120nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.22-nightly-x86-control-plane-24nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.22-nightly-x86-control-plane-252nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-5.0-nightly-x86-control-plane-120nodes N/A periodic Ci-operator config changed
periodic-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-5.0-nightly-x86-control-plane-24nodes N/A periodic Ci-operator config changed
Interacting with pj-rehearse

Comment: /pj-rehearse to run up to 5 rehearsals
Comment: /pj-rehearse skip to opt-out of rehearsals
Comment: /pj-rehearse {test-name}, with each test separated by a space, to run one or more specific rehearsals
Comment: /pj-rehearse more to run up to 10 rehearsals
Comment: /pj-rehearse max to run up to 25 rehearsals
Comment: /pj-rehearse auto-ack to run up to 5 rehearsals, and add the rehearsals-ack label on success
Comment: /pj-rehearse list to get an up-to-date list of affected jobs
Comment: /pj-rehearse abort to abort all active rehearsals
Comment: /pj-rehearse network-access-allowed to allow rehearsals of tests that have the restrict_network_access field set to false. This must be executed by an openshift org member who is not the PR author

Once you are satisfied with the results of the rehearsals, comment: /pj-rehearse ack to unblock merge. When the rehearsals-ack label is present on your PR, merge will no longer be blocked by rehearsals.
If you would like the rehearsals-ack label removed, comment: /pj-rehearse reject to re-block merging.

@ArthurChenCoding ArthurChenCoding marked this pull request as draft May 2, 2026 05:53
@openshift-ci openshift-ci Bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label May 2, 2026
@openshift-ci
Copy link
Copy Markdown
Contributor

openshift-ci Bot commented May 2, 2026

@ArthurChenCoding: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/rehearse/openshift-eng/ocp-qe-perfscale-ci/main/aws-4.23-nightly-x86-control-plane-120nodes 553b1fc link unknown /pj-rehearse pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-120nodes
ci/rehearse/openshift-eng/ocp-qe-perfscale-ci/main/aws-4.23-nightly-x86-control-plane-252nodes 553b1fc link unknown /pj-rehearse pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-252nodes
ci/rehearse/openshift-eng/ocp-qe-perfscale-ci/main/aws-4.23-nightly-x86-control-plane-ipsec-120nodes f446e90 link unknown /pj-rehearse pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-ipsec-120nodes
ci/rehearse/openshift-eng/ocp-qe-perfscale-ci/main/aws-4.23-nightly-x86-control-plane-etcdencrypt-24nodes f446e90 link unknown /pj-rehearse pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-etcdencrypt-24nodes
ci/rehearse/openshift-eng/ocp-qe-perfscale-ci/main/aws-4.23-nightly-x86-control-plane-24nodes 553b1fc link unknown /pj-rehearse pull-ci-openshift-eng-ocp-qe-perfscale-ci-main-aws-4.23-nightly-x86-control-plane-24nodes

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants