(Test) Advanced adaptive filter selectivity evaluation by adriangb · Pull Request #20363 · apache/datafusion

adriangb · 2026-02-15T04:46:21Z

Which issue does this PR close?

Related to filter pushdown performance optimization work.

Rationale for this change

Currently when pushdown_filters = true, DataFusion pushes all filter predicates into the Parquet reader as row-level filters (ArrowPredicates) unconditionally. This is suboptimal because:

Some filters are expensive relative to their selectivity. A filter that references wide columns but prunes few rows wastes CPU decoding those columns during the row-filter phase, when it would be cheaper to apply the filter post-scan on the already-decoded batch.
The old reorder_filters heuristic was static. It used compressed column size as a proxy for cost and sorted filters by that metric, but never measured actual runtime selectivity or evaluation cost. It could not adapt to data skew or runtime conditions.
Dynamic join filters (e.g., from HashJoinExec) cannot be dropped even when they provide no benefit. Without a way to mark filters as optional, the system was forced to always evaluate them.

This PR introduces an adaptive filter selectivity tracking system that observes filter behavior at runtime and makes data-driven decisions about whether each filter should be pushed down as a row-level predicate or applied post-scan.

What changes are included in this PR?

1. New module: `selectivity.rs` (1,554 lines)

The core of this PR. Introduces SelectivityTracker, a shared, lock-guarded structure that:

Tracks per-filter statistics using Welford's online algorithm for numerically stable streaming mean and variance of filter "effectiveness" (bytes_pruned_per_second_of_eval_time).
Implements a filter state machine: each filter transitions through New -> RowFilter | PostScan -> (promoted/demoted/dropped) states based on:
- Initial placement: uses a byte-ratio heuristic (filter_bytes / projection_bytes) to cheaply decide whether a new filter starts as a row filter or post-scan filter.
- Promotion (PostScan -> RowFilter): when the confidence interval lower bound on effectiveness exceeds filter_pushdown_min_bytes_per_sec.
- Demotion (RowFilter -> PostScan): when the confidence interval upper bound drops below the threshold.
- Dropping (for optional filters only): filters wrapped in OptionalFilterPhysicalExpr can be dropped entirely when ineffective.
Detects dynamic filter updates via snapshot_generation(), resetting statistics when a filter's predicate changes (e.g., when a DynamicFilterPhysicalExpr from a hash join updates its value set).
Sorts filters by effectiveness within each partition (row-level and post-scan), so the most selective filters are applied first.

Key types:

SelectivityTracker -- cross-file tracker shared by all ParquetOpener instances
TrackerConfig -- immutable configuration (built from ParquetOptions)
SelectivityStats -- per-filter Welford statistics with confidence interval methods
FilterState -- RowFilter | PostScan | Dropped enum
PartitionedFilters -- output of partition_filters(), consumed by the opener
FilterId -- stable usize identifier assigned by ParquetSource::with_predicate

2. New wrapper: `OptionalFilterPhysicalExpr` (in `physical_expr_common`)

A transparent PhysicalExpr wrapper that marks a filter as optional -- droppable without affecting query correctness. All PhysicalExpr trait methods delegate to the inner expression. The selectivity tracker detects this via downcast_ref::<OptionalFilterPhysicalExpr>() and can drop the filter entirely when it is ineffective, rather than demoting it to post-scan.

HashJoinExec now wraps its dynamic join filters in OptionalFilterPhysicalExpr before pushing them down. This is why plan output now shows Optional(DynamicFilter [...]) instead of DynamicFilter [...].

3. Removal of `reorder_filters` config option

The old static reorder_filters boolean and its associated heuristic (sort by required_bytes, then can_use_index) are removed entirely. The adaptive system subsumes this:

FilterCandidate no longer stores required_bytes or can_use_index fields.
The size_of_columns() and columns_sorted() helper functions in row_filter.rs are removed.
Filter ordering is now handled by SelectivityTracker::partition_filters() based on measured effectiveness or byte-ratio fallback.

4. Three new configuration options (in `ParquetOptions`)

Option	Default	Purpose
`filter_pushdown_min_bytes_per_sec`	52,428,800 (50 MiB/s)	Throughput threshold for promoting a filter to row-level. `0.0` = all promoted, `INFINITY` = none promoted (feature disabled).
`filter_collecting_byte_ratio_threshold`	0.15	Byte-ratio threshold for initial filter placement. Filters whose columns use < 15% of projected bytes start as row filters; otherwise post-scan.
`filter_confidence_z`	2.0	Z-score for confidence intervals (~95%). Controls how much evidence is needed before promoting or demoting a filter.

5. Changes to `ParquetOpener` / opener.rs

Predicates are now stored as Vec<(FilterId, Arc<dyn PhysicalExpr>)> instead of a single combined Arc<dyn PhysicalExpr>.
The opener calls selectivity_tracker.partition_filters() to split filters into row-level vs. post-scan.
Row-level filters are built via build_row_filter() (updated signature).
Post-scan filters are applied in apply_post_scan_filters_with_stats(), a new function that evaluates each filter individually, reports per-filter timing and selectivity back to the tracker, and combines results into a single boolean mask.
The limit is only applied to the Parquet reader when there are no post-scan filters (otherwise limiting would cut off rows before the filter could find matches).
The projection mask is expanded to include columns needed by post-scan filters.
A new filter_apply_time metric tracks post-scan filter evaluation time.

6. Changes to `ParquetSource` / source.rs

Internal predicate storage changed from Option<Arc<dyn PhysicalExpr>> to Option<Vec<(FilterId, Arc<dyn PhysicalExpr>)>>.
with_predicate() now splits the predicate into conjuncts and assigns stable FilterIds (indices).
SelectivityTracker is stored as a shared Arc on ParquetSource and passed to all openers.
with_table_parquet_options() now builds a fresh SelectivityTracker from the three new config values.
with_reorder_filters() and reorder_filters() methods are removed.

7. Changes to `build_row_filter()` / row_filter.rs

Signature changed: takes Vec<(FilterId, Arc<dyn PhysicalExpr>)> + &Arc<SelectivityTracker> instead of &Arc<dyn PhysicalExpr> + reorder_predicates: bool.
Returns RowFilterWithMetrics (new struct) containing both the RowFilter and any unbuildable filters that must be applied post-scan.
DatafusionArrowPredicate now carries a FilterId and Arc<SelectivityTracker>, reporting per-batch evaluation metrics back to the tracker after each evaluate() call.
No reordering is done inside build_row_filter -- filters arrive pre-ordered by the tracker.

8. Changes to `HashJoinExec`

Dynamic join filters are now wrapped in OptionalFilterPhysicalExpr before being pushed down.
When receiving a pushed-down filter back, the join unwraps OptionalFilterPhysicalExpr to find the inner DynamicFilterPhysicalExpr.

9. Protobuf schema updates

reorder_filters field (tag 6) marked as reserved in datafusion_common.proto.
Three new optional fields added: filter_pushdown_min_bytes_per_sec (tag 35), filter_collecting_byte_ratio_threshold (tag 40), filter_confidence_z (tag 41).
Corresponding serialization/deserialization code updated in pbjson.rs, prost.rs, from_proto, to_proto, and file_formats.rs.

10. Test and benchmark updates

All references to reorder_filters removed from tests and benchmarks.
Existing filter pushdown tests set filter_pushdown_min_bytes_per_sec = 0.0 to preserve deterministic behavior (all filters always pushed down).
Snapshot test expectations updated from DynamicFilter [...] to Optional(DynamicFilter [...]).
New unit tests in selectivity.rs covering: effectiveness calculation, Welford's algorithm, confidence intervals, state machine transitions (initial placement, promotion, demotion, dropping), dynamic filter generation tracking, filter ordering, and integration lifecycle tests.
One expected output change in explain_analyze.rs (output_rows=8 -> output_rows=5) due to the adaptive system now placing some filters as post-scan that were previously row-level, causing slight row count differences in EXPLAIN ANALYZE output.

Are these changes tested?

Yes:

Existing tests: All existing pushdown_filters and filter pushdown SLT tests pass (with filter_pushdown_min_bytes_per_sec = 0.0 to force all filters to row-level for deterministic behavior).
New unit tests: Comprehensive tests in selectivity.rs (~450 lines of tests) covering the SelectivityStats calculator, TrackerConfig builder, state machine transitions (initial placement, promotion, demotion, dropping, reset on generation change), filter ordering, and full promotion/demotion lifecycle integration tests.
Updated snapshot tests: All physical optimizer filter pushdown snapshot tests updated to reflect the Optional(...) wrapper on dynamic filters.
Updated SLT tests: dynamic_filter_pushdown_config.slt, information_schema.slt, preserve_file_partitioning.slt, projection_pushdown.slt, push_down_filter.slt, and repartition_subset_satisfaction.slt updated.
Benchmark data included: benchmarks/results.txt shows TPC-H (13 faster, 6 slower, 3 unchanged), TPC-DS (33 faster, 31 slower, 35 unchanged, with notable 24x improvement on Q64), and ClickBench (18 faster, 12 slower, 13 unchanged) results.

Are there any user-facing changes?

Yes:

reorder_filters config option removed. This is a breaking change. Users who set SET datafusion.execution.parquet.reorder_filters = true will get an error. The adaptive system replaces this functionality automatically.
Three new config options added under datafusion.execution.parquet:
- filter_pushdown_min_bytes_per_sec (default: 52428800)
- filter_collecting_byte_ratio_threshold (default: 0.15)
- filter_confidence_z (default: 2.0)
Changed default behavior when pushdown_filters = true. Previously, all filters were unconditionally pushed into the Parquet reader. Now, the adaptive system decides per-filter based on byte-ratio thresholds and runtime effectiveness measurements. To restore the old behavior of pushing all filters unconditionally, set filter_pushdown_min_bytes_per_sec = 0.0.
EXPLAIN plan output changes. Dynamic join filters now display as Optional(DynamicFilter [...]) instead of DynamicFilter [...], reflecting their new optional wrapper.
Deprecated predicate() method signature changed. ParquetSource::predicate() now returns Option<Arc<dyn PhysicalExpr>> (owned) instead of Option<&Arc<dyn PhysicalExpr>> (reference). This method was already deprecated in favor of filter().

adriangb · 2026-02-15T04:49:22Z

run benchmark tpcds
DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true
DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true

adriangb · 2026-02-15T04:49:29Z

run benchmark clickbench_partitioned
DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true
DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true

alamb-ghbot · 2026-02-15T04:49:34Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing filter-pushdown-dynamic-bytes (dbab02b) to 53b0ffb diff using: clickbench_partitioned
Results will be posted here when complete

adriangb · 2026-02-15T04:49:36Z

run benchmark tpch
DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true
DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true

alamb-ghbot · 2026-02-15T05:18:40Z

🤖: Benchmark completed

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃        HEAD ┃ filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │     2.52 ms │                       2.78 ms │  1.10x slower │
│ QQuery 1  │    52.24 ms │                      49.61 ms │ +1.05x faster │
│ QQuery 2  │   132.32 ms │                     136.95 ms │     no change │
│ QQuery 3  │   160.45 ms │                     159.43 ms │     no change │
│ QQuery 4  │  1006.07 ms │                    1001.72 ms │     no change │
│ QQuery 5  │  1264.63 ms │                    1240.53 ms │     no change │
│ QQuery 6  │    17.57 ms │                       6.46 ms │ +2.72x faster │
│ QQuery 7  │    68.54 ms │                      55.34 ms │ +1.24x faster │
│ QQuery 8  │  1432.88 ms │                    1357.09 ms │ +1.06x faster │
│ QQuery 9  │  1783.22 ms │                    1723.13 ms │     no change │
│ QQuery 10 │   469.85 ms │                     339.66 ms │ +1.38x faster │
│ QQuery 11 │   515.70 ms │                     392.95 ms │ +1.31x faster │
│ QQuery 12 │  1424.70 ms │                    1142.60 ms │ +1.25x faster │
│ QQuery 13 │  2086.66 ms │                    1758.57 ms │ +1.19x faster │
│ QQuery 14 │  1465.46 ms │                    1166.84 ms │ +1.26x faster │
│ QQuery 15 │  1205.75 ms │                    1142.22 ms │ +1.06x faster │
│ QQuery 16 │  2455.92 ms │                    2414.92 ms │     no change │
│ QQuery 17 │  2453.39 ms │                    2375.62 ms │     no change │
│ QQuery 18 │  4799.30 ms │                    4693.76 ms │     no change │
│ QQuery 19 │   141.25 ms │                     142.58 ms │     no change │
│ QQuery 20 │  1861.44 ms │                    1845.91 ms │     no change │
│ QQuery 21 │  2312.86 ms │                    2179.66 ms │ +1.06x faster │
│ QQuery 22 │  3956.56 ms │                    4079.52 ms │     no change │
│ QQuery 23 │  1067.61 ms │                    4758.55 ms │  4.46x slower │
│ QQuery 24 │   244.71 ms │                     185.57 ms │ +1.32x faster │
│ QQuery 25 │   635.55 ms │                     447.91 ms │ +1.42x faster │
│ QQuery 26 │   318.23 ms │                     204.37 ms │ +1.56x faster │
│ QQuery 27 │  2945.79 ms │                    2432.80 ms │ +1.21x faster │
│ QQuery 28 │ 23684.76 ms │                   23006.72 ms │     no change │
│ QQuery 29 │   948.88 ms │                     986.96 ms │     no change │
│ QQuery 30 │  1269.24 ms │                    1229.75 ms │     no change │
│ QQuery 31 │  1311.55 ms │                    1343.10 ms │     no change │
│ QQuery 32 │  4161.16 ms │                    3919.03 ms │ +1.06x faster │
│ QQuery 33 │  5017.59 ms │                    5183.95 ms │     no change │
│ QQuery 34 │  5507.17 ms │                    5238.45 ms │     no change │
│ QQuery 35 │  1862.72 ms │                    1797.80 ms │     no change │
│ QQuery 36 │   171.63 ms │                     185.39 ms │  1.08x slower │
│ QQuery 37 │    90.51 ms │                      72.77 ms │ +1.24x faster │
│ QQuery 38 │    85.76 ms │                     111.77 ms │  1.30x slower │
│ QQuery 39 │   286.54 ms │                     332.16 ms │  1.16x slower │
│ QQuery 40 │    56.28 ms │                      39.13 ms │ +1.44x faster │
│ QQuery 41 │    50.29 ms │                      35.28 ms │ +1.43x faster │
│ QQuery 42 │    36.46 ms │                      32.72 ms │ +1.11x faster │
└───────────┴─────────────┴───────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 80821.72ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 80952.02ms │
│ Average Time (HEAD)                          │  1879.57ms │
│ Average Time (filter-pushdown-dynamic-bytes) │  1882.61ms │
│ Queries Faster                               │         20 │
│ Queries Slower                               │          5 │
│ Queries with No Change                       │         18 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

alamb-ghbot · 2026-02-15T05:18:43Z

🤖 ./gh_compare_branch.sh gh_compare_branch.sh Running
Linux aal-dev 6.14.0-1018-gcp #19~24.04.1-Ubuntu SMP Wed Sep 24 23:23:09 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
Comparing filter-pushdown-dynamic-bytes (dbab02b) to 53b0ffb diff using: tpcds
Results will be posted here when complete

Dandandan · 2026-02-15T07:27:39Z

show benchmark queue

alamb-ghbot · 2026-02-15T07:27:49Z

🤖 Hi @Dandandan, you asked to view the benchmark queue (#20363 (comment)).

Job	User	Benchmarks	Comment
`20363_3903261774.sh`	adriangb	tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903261774`
`20363_3903262814.sh`	adriangb	tpch (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903262814`
`20365_3903537986.sh`	Dandandan	tpch tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903537986`

Dandandan · 2026-02-15T09:21:03Z

show benchmark queue

alamb-ghbot · 2026-02-15T09:21:11Z

🤖 Hi @Dandandan, you asked to view the benchmark queue (#20363 (comment)).

Job	User	Benchmarks	Comment
`20363_3903261774.sh`	adriangb	tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903261774`
`20363_3903262814.sh`	adriangb	tpch (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903262814`
`20365_3903537986.sh`	Dandandan	tpch tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903537986`
`20365_3903568877.sh`	Dandandan	clickbench_partitioned (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903568877`

Dandandan · 2026-02-15T09:21:34Z

Hm it seems stuck again

Dandandan · 2026-02-15T12:07:08Z

FYI @alamb

Hm it seems stuck again

adriangb · 2026-02-15T12:40:12Z

@Dandandan this is mostly vibe coded, I'm only 50% confident it even makes sense without reviewing the code fwiw

adriangb · 2026-02-15T13:46:23Z

show benchmark queue

alamb-ghbot · 2026-02-15T13:46:30Z

🤖 Hi @adriangb, you asked to view the benchmark queue (#20363 (comment)).

Job	User	Benchmarks	Comment
`20363_3903261774.sh`	adriangb	tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903261774`
`20363_3903262814.sh`	adriangb	tpch (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903262814`
`20365_3903537986.sh`	Dandandan	tpch tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903537986`
`20365_3903568877.sh`	Dandandan	clickbench_partitioned (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903568877`

adriangb · 2026-02-15T13:47:03Z

Wonder if I'm infinite looping it or something :(

Dandandan · 2026-02-15T14:10:04Z

Wonder if I'm infinite looping it or something :(

Yes I think previously it got stuck during infinite loops / extremely long running tasks.

adriangb · 2026-02-15T14:12:58Z

Wonder if I'm infinite looping it or something :(

Yes I think previously it got stuck during infinite loops / extremely long running tasks.

My bad I’ll try to add a PR to have timeouts and a cancel command

adriangb · 2026-02-15T18:27:08Z

show benchmark queue

alamb-ghbot · 2026-02-15T18:27:14Z

🤖 Hi @adriangb, you asked to view the benchmark queue (#20363 (comment)).

Job	User	Benchmarks	Comment
`20363_3903261774.sh`	adriangb	tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903261774`
`20363_3903262814.sh`	adriangb	tpch (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20363#issuecomment-3903262814`
`20365_3903537986.sh`	Dandandan	tpch tpcds (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903537986`
`20365_3903568877.sh`	Dandandan	clickbench_partitioned (env: DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS=true DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS=true)	`https://github.com/apache/datafusion/pull/20365#issuecomment-3903568877`
`arrow-9414-3904516323.sh`	Dandandan	arrow_reader_clickbench	`https://github.com/apache/arrow-rs/pull/9414#issuecomment-3904516323`

alamb · 2026-02-15T20:36:32Z

run benchmark tpch

adriangb · 2026-04-11T18:34:39Z

run benchmark clickbench_partitioned

adriangb · 2026-04-11T18:35:16Z

run benchmark clickbench_partitioned

env:
  DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: true
  DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: true

adriangbot · 2026-04-11T18:36:35Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4229959251-1092-g5tm7 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (f8ee955) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T18:37:17Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4229961271-1093-7mgz6 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (f8ee955) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T18:54:54Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                   HEAD ┃         filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │           1.20 / 4.47 ±6.39 / 17.25 ms │          1.18 / 4.43 ±6.34 / 17.10 ms │     no change │
│ QQuery 1  │         15.70 / 15.95 ±0.18 / 16.18 ms │        15.69 / 15.97 ±0.19 / 16.19 ms │     no change │
│ QQuery 2  │         43.78 / 44.28 ±0.44 / 45.06 ms │        44.12 / 44.80 ±0.40 / 45.40 ms │     no change │
│ QQuery 3  │         44.02 / 46.74 ±1.63 / 48.18 ms │        44.63 / 46.27 ±1.14 / 48.07 ms │     no change │
│ QQuery 4  │     288.34 / 300.47 ±11.10 / 318.83 ms │     284.73 / 293.91 ±6.25 / 301.96 ms │     no change │
│ QQuery 5  │      345.38 / 350.65 ±3.13 / 353.80 ms │     340.37 / 349.23 ±5.35 / 354.52 ms │     no change │
│ QQuery 6  │            5.92 / 6.14 ±0.14 / 6.32 ms │          6.34 / 8.44 ±2.61 / 13.48 ms │  1.37x slower │
│ QQuery 7  │         21.81 / 22.83 ±1.46 / 25.71 ms │        22.25 / 22.47 ±0.21 / 22.78 ms │     no change │
│ QQuery 8  │      415.69 / 426.71 ±7.42 / 434.64 ms │     424.78 / 431.49 ±5.36 / 437.86 ms │     no change │
│ QQuery 9  │      648.47 / 657.12 ±5.93 / 663.73 ms │     637.74 / 652.94 ±8.62 / 660.95 ms │     no change │
│ QQuery 10 │      116.89 / 120.09 ±4.12 / 128.04 ms │     119.95 / 123.64 ±3.95 / 130.69 ms │     no change │
│ QQuery 11 │      131.39 / 134.19 ±2.94 / 139.51 ms │     131.85 / 134.42 ±1.74 / 136.84 ms │     no change │
│ QQuery 12 │      371.92 / 381.11 ±6.78 / 389.37 ms │    371.25 / 385.03 ±11.68 / 398.31 ms │     no change │
│ QQuery 13 │     482.14 / 508.22 ±17.13 / 533.52 ms │    495.10 / 513.96 ±15.27 / 535.77 ms │     no change │
│ QQuery 14 │      376.74 / 384.73 ±5.56 / 393.34 ms │     385.63 / 391.47 ±3.76 / 396.65 ms │     no change │
│ QQuery 15 │      348.50 / 366.79 ±9.93 / 374.62 ms │    364.39 / 396.28 ±57.92 / 511.92 ms │  1.08x slower │
│ QQuery 16 │     714.56 / 724.66 ±12.34 / 747.72 ms │    720.54 / 734.78 ±17.77 / 769.78 ms │     no change │
│ QQuery 17 │      705.03 / 716.64 ±8.91 / 726.57 ms │    706.73 / 726.55 ±13.68 / 740.18 ms │     no change │
│ QQuery 18 │  1411.67 / 1478.04 ±40.28 / 1537.16 ms │ 1435.24 / 1469.34 ±27.73 / 1511.89 ms │     no change │
│ QQuery 19 │         37.30 / 41.06 ±3.13 / 46.57 ms │        36.16 / 38.51 ±2.41 / 42.95 ms │ +1.07x faster │
│ QQuery 20 │     722.79 / 744.98 ±23.81 / 778.40 ms │    714.71 / 726.49 ±15.04 / 755.41 ms │     no change │
│ QQuery 21 │     728.66 / 748.77 ±10.95 / 759.64 ms │    787.84 / 810.58 ±14.92 / 832.04 ms │  1.08x slower │
│ QQuery 22 │  1142.36 / 1166.32 ±12.96 / 1180.36 ms │ 1139.09 / 1156.37 ±14.92 / 1174.65 ms │     no change │
│ QQuery 23 │      266.19 / 268.77 ±2.38 / 272.00 ms │     337.12 / 344.80 ±7.08 / 357.50 ms │  1.28x slower │
│ QQuery 24 │         93.64 / 94.69 ±0.92 / 95.78 ms │     104.25 / 105.20 ±0.88 / 106.77 ms │  1.11x slower │
│ QQuery 25 │      177.03 / 180.69 ±3.61 / 186.69 ms │     174.88 / 178.68 ±3.25 / 183.53 ms │     no change │
│ QQuery 26 │      110.32 / 113.92 ±2.96 / 118.76 ms │     112.08 / 116.62 ±3.14 / 120.79 ms │     no change │
│ QQuery 27 │    975.50 / 992.16 ±15.09 / 1015.27 ms │   970.00 / 981.63 ±15.95 / 1013.19 ms │     no change │
│ QQuery 28 │  3417.46 / 3439.03 ±12.14 / 3450.64 ms │  3429.59 / 3441.92 ±9.93 / 3451.99 ms │     no change │
│ QQuery 29 │         49.51 / 55.33 ±5.36 / 64.17 ms │        51.06 / 55.32 ±4.42 / 63.37 ms │     no change │
│ QQuery 30 │      360.73 / 365.29 ±5.01 / 373.55 ms │     369.74 / 377.17 ±4.40 / 382.23 ms │     no change │
│ QQuery 31 │     349.96 / 368.81 ±10.94 / 380.24 ms │     371.53 / 381.70 ±8.24 / 395.10 ms │     no change │
│ QQuery 32 │ 1133.74 / 1270.19 ±107.89 / 1405.52 ms │ 1059.86 / 1127.42 ±79.88 / 1273.66 ms │ +1.13x faster │
│ QQuery 33 │  1453.11 / 1477.72 ±24.41 / 1519.30 ms │  1481.86 / 1494.20 ±9.90 / 1508.54 ms │     no change │
│ QQuery 34 │  1472.11 / 1490.56 ±15.57 / 1518.74 ms │  1491.54 / 1505.17 ±8.56 / 1518.09 ms │     no change │
│ QQuery 35 │     390.91 / 400.12 ±11.55 / 422.31 ms │     395.86 / 411.29 ±8.49 / 418.07 ms │     no change │
│ QQuery 36 │       98.75 / 104.37 ±4.81 / 110.76 ms │     101.17 / 107.97 ±3.50 / 111.04 ms │     no change │
│ QQuery 37 │         53.40 / 56.55 ±2.98 / 62.00 ms │        53.38 / 57.64 ±2.64 / 61.59 ms │     no change │
│ QQuery 38 │         54.63 / 59.09 ±2.78 / 62.63 ms │        55.57 / 59.14 ±3.14 / 64.24 ms │     no change │
│ QQuery 39 │      170.07 / 177.93 ±7.99 / 187.86 ms │     176.85 / 187.11 ±8.58 / 197.51 ms │  1.05x slower │
│ QQuery 40 │         29.49 / 30.57 ±0.64 / 31.47 ms │        29.20 / 32.49 ±2.91 / 37.42 ms │  1.06x slower │
│ QQuery 41 │         25.98 / 27.30 ±1.36 / 29.47 ms │        24.57 / 27.30 ±1.62 / 29.20 ms │     no change │
│ QQuery 42 │         22.43 / 22.71 ±0.24 / 23.14 ms │        22.66 / 22.99 ±0.29 / 23.40 ms │     no change │
└───────────┴────────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 20386.75ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 20493.12ms │
│ Average Time (HEAD)                          │   474.11ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   476.58ms │
│ Queries Faster                               │          2 │
│ Queries Slower                               │          7 │
│ Queries with No Change                       │         34 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	103.1s
Peak memory	41.4 GiB
Avg memory	29.3 GiB
CPU user	926.5s
CPU sys	98.0s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	103.7s
Peak memory	44.0 GiB
Avg memory	30.0 GiB
CPU user	939.1s
CPU sys	90.5s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-11T18:55:20Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃         filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.22 / 4.51 ±6.43 / 17.36 ms │          1.21 / 4.43 ±6.31 / 17.04 ms │     no change │
│ QQuery 1  │        13.99 / 14.34 ±0.21 / 14.57 ms │        13.99 / 14.42 ±0.22 / 14.63 ms │     no change │
│ QQuery 2  │        42.55 / 43.09 ±0.47 / 43.85 ms │        43.24 / 43.61 ±0.35 / 44.20 ms │     no change │
│ QQuery 3  │        44.18 / 45.50 ±1.19 / 47.12 ms │        44.48 / 45.89 ±0.96 / 47.32 ms │     no change │
│ QQuery 4  │     285.82 / 293.40 ±6.51 / 305.17 ms │     279.94 / 287.67 ±6.73 / 295.65 ms │     no change │
│ QQuery 5  │     338.67 / 342.39 ±3.36 / 346.74 ms │     333.40 / 342.81 ±5.66 / 349.09 ms │     no change │
│ QQuery 6  │           5.82 / 6.62 ±0.59 / 7.54 ms │           5.71 / 7.23 ±0.95 / 8.20 ms │  1.09x slower │
│ QQuery 7  │        16.64 / 17.43 ±1.09 / 19.55 ms │        16.42 / 17.06 ±0.82 / 18.67 ms │     no change │
│ QQuery 8  │    420.57 / 436.43 ±10.20 / 451.25 ms │     410.84 / 419.75 ±9.65 / 435.37 ms │     no change │
│ QQuery 9  │     664.81 / 667.72 ±4.82 / 677.32 ms │     628.83 / 635.51 ±4.65 / 641.64 ms │     no change │
│ QQuery 10 │        91.14 / 93.69 ±2.00 / 96.83 ms │        91.41 / 93.14 ±1.67 / 96.16 ms │     no change │
│ QQuery 11 │     107.05 / 108.12 ±0.73 / 108.89 ms │     102.28 / 103.93 ±1.05 / 104.85 ms │     no change │
│ QQuery 12 │     349.12 / 355.42 ±3.82 / 358.67 ms │     332.95 / 341.98 ±6.85 / 353.94 ms │     no change │
│ QQuery 13 │    458.42 / 469.26 ±14.42 / 496.92 ms │    453.92 / 464.32 ±10.10 / 479.28 ms │     no change │
│ QQuery 14 │     342.24 / 351.82 ±5.67 / 358.08 ms │     339.75 / 346.08 ±3.34 / 348.63 ms │     no change │
│ QQuery 15 │    354.54 / 365.98 ±11.79 / 380.87 ms │    346.21 / 361.96 ±14.39 / 380.13 ms │     no change │
│ QQuery 16 │    712.21 / 724.77 ±10.28 / 740.00 ms │    703.77 / 717.27 ±15.25 / 744.17 ms │     no change │
│ QQuery 17 │    710.31 / 742.64 ±28.69 / 793.54 ms │     703.62 / 710.80 ±5.28 / 717.01 ms │     no change │
│ QQuery 18 │ 1399.72 / 1503.23 ±60.57 / 1567.89 ms │ 1390.74 / 1431.08 ±32.71 / 1480.30 ms │     no change │
│ QQuery 19 │        37.33 / 40.07 ±3.29 / 46.40 ms │      35.21 / 52.40 ±32.19 / 116.70 ms │  1.31x slower │
│ QQuery 20 │    721.43 / 743.33 ±20.60 / 776.90 ms │    721.98 / 732.91 ±13.08 / 755.68 ms │     no change │
│ QQuery 21 │     760.12 / 769.53 ±5.64 / 776.88 ms │    760.55 / 772.54 ±14.27 / 800.04 ms │     no change │
│ QQuery 22 │  1136.41 / 1143.04 ±7.09 / 1155.96 ms │  1136.83 / 1140.80 ±3.07 / 1145.73 ms │     no change │
│ QQuery 23 │  3076.68 / 3091.18 ±8.20 / 3099.38 ms │ 3063.61 / 3094.24 ±17.34 / 3107.76 ms │     no change │
│ QQuery 24 │      99.73 / 101.29 ±1.16 / 103.19 ms │      99.86 / 101.72 ±1.93 / 105.12 ms │     no change │
│ QQuery 25 │     138.53 / 140.49 ±1.56 / 143.00 ms │     136.52 / 140.44 ±2.88 / 145.47 ms │     no change │
│ QQuery 26 │      97.54 / 102.55 ±2.71 / 105.12 ms │      99.26 / 102.33 ±2.19 / 105.56 ms │     no change │
│ QQuery 27 │     850.58 / 855.12 ±5.39 / 865.10 ms │    841.50 / 855.30 ±15.48 / 885.03 ms │     no change │
│ QQuery 28 │ 3264.06 / 3297.98 ±17.25 / 3312.26 ms │ 3259.13 / 3285.74 ±20.91 / 3322.51 ms │     no change │
│ QQuery 29 │        50.98 / 55.81 ±4.80 / 61.96 ms │        49.34 / 54.69 ±5.44 / 63.07 ms │     no change │
│ QQuery 30 │     362.29 / 368.58 ±4.00 / 374.83 ms │     360.40 / 363.33 ±3.65 / 370.50 ms │     no change │
│ QQuery 31 │    361.57 / 375.94 ±11.68 / 390.72 ms │    344.95 / 367.79 ±11.57 / 376.80 ms │     no change │
│ QQuery 32 │ 1144.69 / 1288.62 ±78.77 / 1373.76 ms │ 1021.05 / 1034.94 ±14.54 / 1062.23 ms │ +1.25x faster │
│ QQuery 33 │ 1444.66 / 1459.43 ±21.36 / 1501.74 ms │ 1452.25 / 1488.52 ±23.37 / 1519.66 ms │     no change │
│ QQuery 34 │ 1436.75 / 1461.38 ±20.91 / 1494.83 ms │ 1462.53 / 1480.18 ±10.94 / 1490.52 ms │     no change │
│ QQuery 35 │     376.88 / 382.04 ±4.20 / 389.34 ms │     378.00 / 382.07 ±3.67 / 388.62 ms │     no change │
│ QQuery 36 │     108.71 / 114.93 ±4.16 / 120.24 ms │     111.50 / 117.80 ±3.59 / 122.22 ms │     no change │
│ QQuery 37 │        46.60 / 48.17 ±1.01 / 49.36 ms │        46.86 / 48.71 ±1.36 / 50.97 ms │     no change │
│ QQuery 38 │        74.05 / 76.01 ±1.46 / 78.49 ms │        73.03 / 74.31 ±0.74 / 75.14 ms │     no change │
│ QQuery 39 │     197.19 / 208.37 ±6.76 / 215.29 ms │     198.66 / 215.05 ±9.37 / 226.01 ms │     no change │
│ QQuery 40 │        22.98 / 25.18 ±2.70 / 30.26 ms │        21.78 / 24.02 ±2.55 / 28.93 ms │     no change │
│ QQuery 41 │        19.23 / 20.17 ±0.76 / 20.96 ms │        18.86 / 20.98 ±1.38 / 23.19 ms │     no change │
│ QQuery 42 │        18.59 / 19.45 ±0.44 / 19.78 ms │        19.28 / 19.88 ±0.73 / 21.21 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 22775.03ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 22359.63ms │
│ Average Time (HEAD)                          │   529.65ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   519.99ms │
│ Queries Faster                               │          1 │
│ Queries Slower                               │          2 │
│ Queries with No Change                       │         40 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	115.0s
Peak memory	40.8 GiB
Avg memory	30.2 GiB
CPU user	1075.0s
CPU sys	91.5s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	112.9s
Peak memory	41.4 GiB
Avg memory	30.2 GiB
CPU user	1065.5s
CPU sys	82.9s
Peak spill	0 B

File an issue against this benchmark runner

adriangb · 2026-04-11T19:02:39Z

run benchmark clickbench_partitioned

adriangb · 2026-04-11T19:02:48Z

run benchmark clickbench_partitioned

env:
  DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: true
  DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: true

adriangbot · 2026-04-11T19:04:54Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4230007606-1094-4nrx2 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (290784a) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T19:05:17Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4230007775-1095-bzdjz 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (290784a) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T19:19:40Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃          filter-pushdown-dynamic-bytes ┃          Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.23 / 4.59 ±6.51 / 17.60 ms │           1.21 / 4.51 ±6.43 / 17.37 ms │       no change │
│ QQuery 1  │        15.63 / 16.14 ±0.30 / 16.50 ms │            7.22 / 7.80 ±0.33 / 8.11 ms │   +2.07x faster │
│ QQuery 2  │        43.67 / 44.04 ±0.22 / 44.30 ms │         44.21 / 44.89 ±0.48 / 45.61 ms │       no change │
│ QQuery 3  │        43.99 / 47.37 ±2.56 / 50.48 ms │         42.21 / 45.98 ±2.29 / 48.02 ms │       no change │
│ QQuery 4  │    320.77 / 335.20 ±10.82 / 346.76 ms │      294.12 / 298.96 ±4.00 / 305.59 ms │   +1.12x faster │
│ QQuery 5  │     370.09 / 385.33 ±9.21 / 397.80 ms │      351.84 / 355.66 ±3.18 / 359.52 ms │   +1.08x faster │
│ QQuery 6  │           6.42 / 7.18 ±0.84 / 8.75 ms │            6.06 / 6.51 ±0.37 / 7.04 ms │   +1.10x faster │
│ QQuery 7  │        22.69 / 23.72 ±1.31 / 26.21 ms │         22.29 / 23.12 ±0.92 / 24.85 ms │       no change │
│ QQuery 8  │    444.24 / 460.03 ±10.45 / 473.82 ms │      423.32 / 435.14 ±6.58 / 441.82 ms │   +1.06x faster │
│ QQuery 9  │    712.27 / 732.71 ±20.01 / 767.34 ms │      651.08 / 664.98 ±8.85 / 676.14 ms │   +1.10x faster │
│ QQuery 10 │     122.41 / 128.04 ±3.97 / 132.54 ms │      117.90 / 119.95 ±2.26 / 124.09 ms │   +1.07x faster │
│ QQuery 11 │     135.18 / 141.23 ±4.96 / 147.77 ms │      130.18 / 132.33 ±1.54 / 134.86 ms │   +1.07x faster │
│ QQuery 12 │     396.41 / 402.47 ±6.46 / 412.55 ms │      375.89 / 378.32 ±2.83 / 383.56 ms │   +1.06x faster │
│ QQuery 13 │    512.28 / 530.28 ±17.63 / 560.37 ms │     477.83 / 490.80 ±12.25 / 507.95 ms │   +1.08x faster │
│ QQuery 14 │     406.23 / 411.22 ±5.90 / 422.30 ms │      377.42 / 384.12 ±4.38 / 390.25 ms │   +1.07x faster │
│ QQuery 15 │    391.84 / 405.86 ±14.42 / 428.66 ms │     352.46 / 369.17 ±20.59 / 407.82 ms │   +1.10x faster │
│ QQuery 16 │    739.13 / 751.75 ±15.37 / 781.70 ms │      698.75 / 709.55 ±8.11 / 720.04 ms │   +1.06x faster │
│ QQuery 17 │     751.81 / 759.98 ±4.93 / 767.00 ms │      698.22 / 708.34 ±5.20 / 713.04 ms │   +1.07x faster │
│ QQuery 18 │ 1520.48 / 1546.22 ±23.29 / 1582.14 ms │  1438.45 / 1460.57 ±29.45 / 1517.00 ms │   +1.06x faster │
│ QQuery 19 │        38.02 / 39.35 ±1.38 / 41.96 ms │        36.37 / 45.18 ±14.98 / 75.05 ms │    1.15x slower │
│ QQuery 20 │    723.45 / 743.49 ±19.08 / 769.07 ms │            6.57 / 6.92 ±0.51 / 7.93 ms │ +107.47x faster │
│ QQuery 21 │    711.53 / 729.48 ±13.61 / 753.19 ms │     738.58 / 764.68 ±16.84 / 790.78 ms │       no change │
│ QQuery 22 │  1157.77 / 1170.00 ±9.50 / 1181.57 ms │     901.43 / 924.30 ±21.80 / 956.28 ms │   +1.27x faster │
│ QQuery 23 │     261.59 / 264.54 ±2.46 / 267.45 ms │    318.35 / 614.12 ±222.16 / 817.10 ms │    2.32x slower │
│ QQuery 24 │        90.09 / 93.82 ±2.98 / 99.08 ms │         68.06 / 78.68 ±6.51 / 86.52 ms │   +1.19x faster │
│ QQuery 25 │     172.38 / 178.21 ±5.14 / 187.01 ms │      175.05 / 180.16 ±4.50 / 185.96 ms │       no change │
│ QQuery 26 │     111.71 / 116.31 ±2.71 / 119.95 ms │       99.60 / 101.80 ±1.82 / 104.08 ms │   +1.14x faster │
│ QQuery 27 │     973.15 / 984.74 ±8.76 / 996.63 ms │      832.45 / 839.61 ±8.22 / 855.06 ms │   +1.17x faster │
│ QQuery 28 │ 3387.80 / 3441.88 ±29.26 / 3470.20 ms │  3338.87 / 3398.40 ±42.30 / 3467.58 ms │       no change │
│ QQuery 29 │        49.57 / 55.68 ±5.20 / 63.87 ms │         51.20 / 56.61 ±4.96 / 65.53 ms │       no change │
│ QQuery 30 │     370.26 / 374.58 ±3.99 / 380.72 ms │      758.67 / 770.15 ±6.25 / 777.65 ms │    2.06x slower │
│ QQuery 31 │     357.36 / 372.30 ±8.08 / 380.26 ms │  380.25 / 1138.83 ±409.13 / 1606.32 ms │    3.06x slower │
│ QQuery 32 │ 1333.35 / 1371.82 ±20.21 / 1391.67 ms │ 1041.69 / 1212.19 ±113.08 / 1322.04 ms │   +1.13x faster │
│ QQuery 33 │ 1493.86 / 1533.72 ±32.22 / 1592.46 ms │  1447.01 / 1465.66 ±16.19 / 1489.25 ms │       no change │
│ QQuery 34 │ 1486.53 / 1499.60 ±12.59 / 1523.38 ms │   1462.02 / 1469.83 ±7.43 / 1483.06 ms │       no change │
│ QQuery 35 │     377.85 / 385.28 ±5.25 / 392.09 ms │      393.11 / 395.59 ±1.52 / 397.51 ms │       no change │
│ QQuery 36 │     102.17 / 107.96 ±3.61 / 112.36 ms │     134.19 / 145.62 ±10.51 / 161.47 ms │    1.35x slower │
│ QQuery 37 │        53.44 / 57.13 ±2.91 / 62.39 ms │         47.01 / 50.34 ±2.48 / 54.13 ms │   +1.13x faster │
│ QQuery 38 │        56.12 / 59.39 ±2.77 / 63.12 ms │       55.86 / 90.18 ±40.00 / 157.19 ms │    1.52x slower │
│ QQuery 39 │     161.95 / 173.49 ±6.65 / 181.42 ms │      236.99 / 246.45 ±8.58 / 260.38 ms │    1.42x slower │
│ QQuery 40 │        30.82 / 32.50 ±1.52 / 35.33 ms │         45.01 / 47.14 ±1.96 / 50.18 ms │    1.45x slower │
│ QQuery 41 │        26.19 / 27.01 ±0.70 / 28.08 ms │         24.38 / 26.74 ±1.88 / 30.01 ms │       no change │
│ QQuery 42 │        21.53 / 21.97 ±0.38 / 22.54 ms │         21.50 / 22.15 ±0.72 / 23.48 ms │       no change │
└───────────┴───────────────────────────────────────┴────────────────────────────────────────┴─────────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 20967.57ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 20732.00ms │
│ Average Time (HEAD)                          │   487.62ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   482.14ms │
│ Queries Faster                               │         22 │
│ Queries Slower                               │          8 │
│ Queries with No Change                       │         13 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	106.2s
Peak memory	41.3 GiB
Avg memory	29.1 GiB
CPU user	949.5s
CPU sys	102.4s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	104.8s
Peak memory	40.0 GiB
Avg memory	29.0 GiB
CPU user	939.7s
CPU sys	106.5s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-11T19:19:57Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃         filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.23 / 4.56 ±6.47 / 17.49 ms │          1.19 / 4.45 ±6.40 / 17.26 ms │     no change │
│ QQuery 1  │        14.40 / 15.04 ±0.34 / 15.39 ms │        14.19 / 14.60 ±0.22 / 14.82 ms │     no change │
│ QQuery 2  │        43.97 / 44.52 ±0.38 / 45.02 ms │        43.95 / 44.04 ±0.09 / 44.15 ms │     no change │
│ QQuery 3  │        46.53 / 47.96 ±1.50 / 50.34 ms │        41.47 / 44.14 ±2.71 / 48.34 ms │ +1.09x faster │
│ QQuery 4  │    322.09 / 356.01 ±29.82 / 396.61 ms │     289.16 / 293.85 ±3.57 / 298.65 ms │ +1.21x faster │
│ QQuery 5  │     367.65 / 375.06 ±9.31 / 392.28 ms │     344.66 / 347.16 ±2.71 / 350.68 ms │ +1.08x faster │
│ QQuery 6  │           5.22 / 6.56 ±1.15 / 8.45 ms │           5.44 / 6.20 ±1.03 / 8.19 ms │ +1.06x faster │
│ QQuery 7  │        17.60 / 18.39 ±0.61 / 19.13 ms │        16.63 / 17.14 ±0.81 / 18.76 ms │ +1.07x faster │
│ QQuery 8  │    431.53 / 448.69 ±14.48 / 464.88 ms │     408.99 / 418.67 ±8.56 / 428.94 ms │ +1.07x faster │
│ QQuery 9  │     689.73 / 705.60 ±9.11 / 717.42 ms │     635.05 / 641.70 ±6.77 / 652.58 ms │ +1.10x faster │
│ QQuery 10 │       95.27 / 98.34 ±2.04 / 101.19 ms │        93.11 / 95.26 ±2.37 / 99.71 ms │     no change │
│ QQuery 11 │     106.93 / 108.37 ±1.53 / 110.93 ms │     103.34 / 104.43 ±1.10 / 106.46 ms │     no change │
│ QQuery 12 │     363.01 / 368.83 ±5.84 / 379.53 ms │     336.79 / 340.03 ±2.10 / 342.42 ms │ +1.08x faster │
│ QQuery 13 │     473.83 / 487.77 ±9.92 / 499.34 ms │    444.24 / 462.25 ±10.24 / 472.48 ms │ +1.06x faster │
│ QQuery 14 │     355.55 / 364.06 ±6.57 / 375.33 ms │     344.72 / 348.42 ±4.08 / 353.45 ms │     no change │
│ QQuery 15 │    364.99 / 389.99 ±25.67 / 434.73 ms │    342.38 / 356.76 ±12.25 / 373.30 ms │ +1.09x faster │
│ QQuery 16 │    747.09 / 768.62 ±22.29 / 809.71 ms │     702.78 / 715.35 ±9.16 / 730.91 ms │ +1.07x faster │
│ QQuery 17 │     731.34 / 735.78 ±2.60 / 738.79 ms │     707.50 / 713.09 ±3.55 / 715.99 ms │     no change │
│ QQuery 18 │ 1417.76 / 1482.77 ±44.48 / 1545.36 ms │ 1415.80 / 1460.83 ±34.00 / 1509.06 ms │     no change │
│ QQuery 19 │        37.82 / 39.02 ±0.86 / 40.35 ms │      35.20 / 57.30 ±31.85 / 118.05 ms │  1.47x slower │
│ QQuery 20 │    726.34 / 742.75 ±16.82 / 773.05 ms │    724.53 / 734.75 ±15.57 / 765.37 ms │     no change │
│ QQuery 21 │     771.72 / 777.72 ±3.63 / 781.24 ms │    771.14 / 779.97 ±10.95 / 800.19 ms │     no change │
│ QQuery 22 │  1139.59 / 1149.87 ±8.20 / 1157.86 ms │  1128.68 / 1134.17 ±4.36 / 1139.90 ms │     no change │
│ QQuery 23 │ 3098.92 / 3120.63 ±26.73 / 3173.25 ms │ 3019.54 / 3066.35 ±24.15 / 3086.38 ms │     no change │
│ QQuery 24 │      99.94 / 102.71 ±2.85 / 107.74 ms │      98.58 / 101.72 ±2.17 / 105.25 ms │     no change │
│ QQuery 25 │     139.90 / 140.99 ±1.12 / 142.55 ms │     137.90 / 140.34 ±1.91 / 143.27 ms │     no change │
│ QQuery 26 │      97.36 / 101.13 ±2.03 / 103.30 ms │       98.10 / 99.85 ±1.45 / 102.19 ms │     no change │
│ QQuery 27 │     863.48 / 866.12 ±2.18 / 869.84 ms │     848.21 / 854.23 ±6.34 / 863.02 ms │     no change │
│ QQuery 28 │ 3286.82 / 3323.92 ±26.10 / 3365.41 ms │ 3258.63 / 3293.46 ±20.85 / 3314.43 ms │     no change │
│ QQuery 29 │        51.46 / 56.66 ±5.38 / 66.67 ms │        50.52 / 54.94 ±3.07 / 59.93 ms │     no change │
│ QQuery 30 │     360.59 / 371.96 ±7.25 / 382.36 ms │     353.89 / 363.82 ±7.51 / 376.40 ms │     no change │
│ QQuery 31 │    362.89 / 381.43 ±10.69 / 392.73 ms │    349.33 / 369.24 ±10.85 / 379.49 ms │     no change │
│ QQuery 32 │ 1050.52 / 1081.85 ±34.99 / 1140.62 ms │ 1208.62 / 1260.34 ±43.86 / 1335.44 ms │  1.16x slower │
│ QQuery 33 │ 1452.30 / 1485.80 ±18.08 / 1502.12 ms │ 1475.63 / 1548.82 ±51.96 / 1615.63 ms │     no change │
│ QQuery 34 │ 1449.79 / 1486.00 ±33.36 / 1528.07 ms │ 1499.06 / 1533.16 ±21.12 / 1562.02 ms │     no change │
│ QQuery 35 │     398.84 / 405.03 ±5.09 / 414.14 ms │    395.02 / 426.78 ±30.37 / 468.67 ms │  1.05x slower │
│ QQuery 36 │     113.25 / 120.77 ±5.45 / 127.28 ms │     119.28 / 122.66 ±2.04 / 124.75 ms │     no change │
│ QQuery 37 │        48.13 / 49.92 ±1.43 / 51.38 ms │        47.72 / 49.33 ±1.39 / 51.36 ms │     no change │
│ QQuery 38 │        75.82 / 77.30 ±1.46 / 79.54 ms │        75.46 / 77.53 ±2.57 / 82.44 ms │     no change │
│ QQuery 39 │     215.66 / 223.02 ±5.85 / 230.46 ms │     210.13 / 216.32 ±3.44 / 219.91 ms │     no change │
│ QQuery 40 │        23.52 / 26.20 ±1.68 / 27.87 ms │        23.68 / 25.71 ±1.45 / 27.99 ms │     no change │
│ QQuery 41 │        19.92 / 21.54 ±1.35 / 23.86 ms │        20.51 / 21.42 ±0.96 / 23.27 ms │     no change │
│ QQuery 42 │        20.59 / 21.02 ±0.32 / 21.56 ms │        19.43 / 19.98 ±0.36 / 20.55 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 23000.26ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 22780.56ms │
│ Average Time (HEAD)                          │   534.89ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   529.78ms │
│ Queries Faster                               │         11 │
│ Queries Slower                               │          3 │
│ Queries with No Change                       │         29 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	116.1s
Peak memory	40.3 GiB
Avg memory	31.7 GiB
CPU user	1099.6s
CPU sys	86.3s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	114.6s
Peak memory	38.0 GiB
Avg memory	27.6 GiB
CPU user	1070.0s
CPU sys	96.9s
Peak spill	0 B

File an issue against this benchmark runner

adriangb · 2026-04-11T22:03:52Z

run benchmark clickbench_partitioned

env:
  DATAFUSION_EXECUTION_PARQUET_PUSHDOWN_FILTERS: true
  DATAFUSION_EXECUTION_PARQUET_REORDER_FILTERS: true

adriangb · 2026-04-11T22:04:00Z

run benchmark clickbench_partitioned

adriangbot · 2026-04-11T22:06:27Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4230259644-1096-22fwk 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (72f296f) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T22:06:31Z

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4230259802-1097-gvvx6 6.12.55+ #1 SMP Sun Feb 1 08:59:41 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing filter-pushdown-dynamic-bytes (72f296f) to ec00112 (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete

File an issue against this benchmark runner

adriangbot · 2026-04-11T22:21:17Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                   HEAD ┃          filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │           1.20 / 4.40 ±6.32 / 17.03 ms │           1.19 / 4.48 ±6.43 / 17.34 ms │     no change │
│ QQuery 1  │         15.26 / 15.61 ±0.19 / 15.84 ms │         15.25 / 15.50 ±0.20 / 15.80 ms │     no change │
│ QQuery 2  │         43.40 / 43.67 ±0.21 / 43.95 ms │         44.30 / 44.69 ±0.23 / 45.00 ms │     no change │
│ QQuery 3  │         43.13 / 44.92 ±1.04 / 46.20 ms │         44.84 / 45.58 ±0.86 / 47.03 ms │     no change │
│ QQuery 4  │      280.50 / 284.22 ±3.81 / 291.33 ms │      284.32 / 292.02 ±6.19 / 300.20 ms │     no change │
│ QQuery 5  │      337.78 / 340.47 ±2.59 / 344.50 ms │      335.18 / 340.36 ±3.40 / 344.10 ms │     no change │
│ QQuery 6  │           5.81 / 8.02 ±2.24 / 11.97 ms │           6.55 / 7.99 ±1.62 / 11.01 ms │     no change │
│ QQuery 7  │         21.66 / 22.23 ±0.50 / 23.08 ms │         17.88 / 18.49 ±0.60 / 19.24 ms │ +1.20x faster │
│ QQuery 8  │      398.40 / 413.49 ±9.74 / 427.56 ms │      420.41 / 426.35 ±4.32 / 431.77 ms │     no change │
│ QQuery 9  │      631.02 / 632.32 ±1.20 / 634.35 ms │      640.52 / 648.38 ±5.50 / 656.86 ms │     no change │
│ QQuery 10 │      117.27 / 120.04 ±2.65 / 124.91 ms │      117.76 / 119.51 ±1.85 / 122.66 ms │     no change │
│ QQuery 11 │      130.16 / 131.18 ±1.03 / 132.82 ms │      131.42 / 133.28 ±1.48 / 135.25 ms │     no change │
│ QQuery 12 │      364.71 / 367.07 ±2.63 / 371.32 ms │      337.55 / 344.28 ±7.00 / 355.44 ms │ +1.07x faster │
│ QQuery 13 │     476.83 / 493.77 ±13.26 / 517.17 ms │     471.11 / 487.73 ±13.75 / 508.91 ms │     no change │
│ QQuery 14 │      375.86 / 381.59 ±3.63 / 386.66 ms │      377.08 / 386.62 ±6.14 / 394.42 ms │     no change │
│ QQuery 15 │     341.48 / 356.50 ±11.92 / 372.21 ms │     348.62 / 370.99 ±23.03 / 406.22 ms │     no change │
│ QQuery 16 │     708.25 / 720.52 ±16.91 / 751.69 ms │     716.30 / 748.74 ±21.59 / 776.50 ms │     no change │
│ QQuery 17 │     704.86 / 730.38 ±33.03 / 793.79 ms │     742.54 / 803.80 ±34.20 / 846.88 ms │  1.10x slower │
│ QQuery 18 │  1428.40 / 1471.53 ±44.70 / 1547.37 ms │  1521.86 / 1640.36 ±59.86 / 1680.62 ms │  1.11x slower │
│ QQuery 19 │        35.41 / 47.99 ±21.33 / 90.59 ms │         37.64 / 39.73 ±1.29 / 41.17 ms │ +1.21x faster │
│ QQuery 20 │     718.33 / 739.90 ±17.46 / 758.38 ms │     731.50 / 746.87 ±20.71 / 786.56 ms │     no change │
│ QQuery 21 │     734.87 / 753.30 ±20.34 / 791.73 ms │      759.04 / 775.67 ±9.74 / 788.48 ms │     no change │
│ QQuery 22 │  1136.67 / 1165.06 ±15.41 / 1182.14 ms │      921.43 / 926.84 ±4.59 / 934.42 ms │ +1.26x faster │
│ QQuery 23 │      259.34 / 266.56 ±4.26 / 271.53 ms │     279.48 / 339.25 ±33.32 / 374.43 ms │  1.27x slower │
│ QQuery 24 │         90.70 / 93.91 ±3.08 / 98.57 ms │         70.44 / 75.03 ±3.74 / 80.27 ms │ +1.25x faster │
│ QQuery 25 │      174.13 / 177.53 ±5.43 / 188.32 ms │      146.38 / 149.47 ±1.93 / 152.02 ms │ +1.19x faster │
│ QQuery 26 │      109.83 / 112.41 ±2.47 / 116.85 ms │      108.65 / 111.20 ±2.11 / 113.72 ms │     no change │
│ QQuery 27 │    972.69 / 990.14 ±12.55 / 1010.45 ms │     841.76 / 853.41 ±13.28 / 879.38 ms │ +1.16x faster │
│ QQuery 28 │  3349.49 / 3424.08 ±38.17 / 3457.40 ms │  3336.21 / 3361.97 ±15.49 / 3380.08 ms │     no change │
│ QQuery 29 │       49.31 / 90.71 ±48.97 / 164.56 ms │         51.81 / 56.62 ±5.29 / 66.80 ms │ +1.60x faster │
│ QQuery 30 │     353.72 / 364.72 ±10.67 / 383.87 ms │     377.15 / 385.97 ±10.15 / 404.25 ms │  1.06x slower │
│ QQuery 31 │     361.16 / 385.50 ±17.43 / 409.50 ms │     359.40 / 374.58 ±11.23 / 387.58 ms │     no change │
│ QQuery 32 │  1182.15 / 1271.36 ±65.85 / 1351.65 ms │ 1140.73 / 1294.83 ±110.12 / 1484.54 ms │     no change │
│ QQuery 33 │ 1602.25 / 1739.86 ±115.08 / 1890.83 ms │  1481.52 / 1522.82 ±40.97 / 1600.31 ms │ +1.14x faster │
│ QQuery 34 │  1517.03 / 1542.80 ±34.52 / 1610.50 ms │  1451.48 / 1467.75 ±10.99 / 1482.60 ms │     no change │
│ QQuery 35 │      382.49 / 391.23 ±5.38 / 398.06 ms │      386.21 / 395.91 ±5.96 / 402.53 ms │     no change │
│ QQuery 36 │      100.59 / 107.74 ±4.32 / 113.43 ms │      100.64 / 102.43 ±1.60 / 104.91 ms │     no change │
│ QQuery 37 │         50.59 / 56.13 ±4.13 / 62.37 ms │         45.99 / 49.34 ±2.88 / 53.67 ms │ +1.14x faster │
│ QQuery 38 │         53.39 / 57.90 ±2.47 / 60.19 ms │         53.70 / 55.87 ±1.96 / 58.52 ms │     no change │
│ QQuery 39 │     160.80 / 180.90 ±11.30 / 195.49 ms │      172.09 / 181.59 ±6.00 / 189.52 ms │     no change │
│ QQuery 40 │         30.55 / 31.81 ±1.45 / 34.49 ms │         28.91 / 30.63 ±1.53 / 32.60 ms │     no change │
│ QQuery 41 │         26.77 / 27.19 ±0.43 / 27.91 ms │         23.58 / 25.29 ±1.44 / 27.65 ms │ +1.08x faster │
│ QQuery 42 │         21.96 / 22.85 ±0.93 / 24.36 ms │         22.29 / 24.13 ±1.91 / 27.64 ms │  1.06x slower │
└───────────┴────────────────────────────────────────┴────────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 20623.50ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 20226.32ms │
│ Average Time (HEAD)                          │   479.62ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   470.38ms │
│ Queries Faster                               │         11 │
│ Queries Slower                               │          5 │
│ Queries with No Change                       │         27 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	104.4s
Peak memory	42.0 GiB
Avg memory	28.9 GiB
CPU user	923.4s
CPU sys	109.7s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	102.3s
Peak memory	41.3 GiB
Avg memory	30.5 GiB
CPU user	925.1s
CPU sys	100.2s
Peak spill	0 B

File an issue against this benchmark runner

adriangbot · 2026-04-11T22:22:19Z

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)

Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Details

Comparing HEAD and filter-pushdown-dynamic-bytes
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃         filter-pushdown-dynamic-bytes ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.19 / 4.41 ±6.33 / 17.08 ms │          1.22 / 4.46 ±6.36 / 17.18 ms │     no change │
│ QQuery 1  │        14.38 / 14.58 ±0.15 / 14.81 ms │        14.33 / 14.69 ±0.20 / 14.91 ms │     no change │
│ QQuery 2  │        42.82 / 43.37 ±0.34 / 43.83 ms │        43.55 / 44.22 ±0.37 / 44.66 ms │     no change │
│ QQuery 3  │        40.51 / 43.63 ±3.10 / 48.70 ms │        44.41 / 46.39 ±1.05 / 47.42 ms │  1.06x slower │
│ QQuery 4  │     291.23 / 298.31 ±5.26 / 306.06 ms │     286.25 / 294.08 ±4.69 / 299.87 ms │     no change │
│ QQuery 5  │     344.45 / 348.29 ±2.83 / 352.50 ms │     340.63 / 345.58 ±3.15 / 348.94 ms │     no change │
│ QQuery 6  │           5.01 / 5.55 ±0.41 / 6.26 ms │           6.22 / 6.78 ±1.04 / 8.85 ms │  1.22x slower │
│ QQuery 7  │        16.74 / 17.48 ±0.53 / 18.17 ms │        16.84 / 17.58 ±0.45 / 18.01 ms │     no change │
│ QQuery 8  │     414.59 / 427.02 ±7.89 / 436.27 ms │     423.68 / 429.21 ±3.00 / 432.78 ms │     no change │
│ QQuery 9  │     654.25 / 658.50 ±2.69 / 662.58 ms │     646.47 / 651.10 ±2.83 / 655.17 ms │     no change │
│ QQuery 10 │       91.66 / 94.69 ±3.27 / 100.74 ms │        90.56 / 92.81 ±2.42 / 97.10 ms │     no change │
│ QQuery 11 │     103.27 / 104.20 ±0.87 / 105.82 ms │     102.93 / 104.60 ±0.95 / 105.52 ms │     no change │
│ QQuery 12 │     334.80 / 342.88 ±5.34 / 349.51 ms │     340.25 / 345.06 ±3.75 / 350.94 ms │     no change │
│ QQuery 13 │     457.12 / 467.29 ±8.86 / 483.08 ms │    446.91 / 473.39 ±19.74 / 506.43 ms │     no change │
│ QQuery 14 │     347.95 / 351.47 ±2.68 / 355.57 ms │     344.49 / 346.23 ±1.76 / 349.39 ms │     no change │
│ QQuery 15 │    348.49 / 371.82 ±15.33 / 393.33 ms │     346.69 / 360.96 ±8.84 / 370.89 ms │     no change │
│ QQuery 16 │    715.80 / 743.08 ±25.15 / 786.01 ms │     719.22 / 729.99 ±6.94 / 740.08 ms │     no change │
│ QQuery 17 │     713.65 / 720.00 ±6.89 / 732.60 ms │     713.89 / 717.74 ±3.79 / 724.15 ms │     no change │
│ QQuery 18 │ 1464.24 / 1482.84 ±14.48 / 1508.20 ms │ 1373.49 / 1452.19 ±44.67 / 1512.56 ms │     no change │
│ QQuery 19 │        35.76 / 40.30 ±7.64 / 55.54 ms │        34.89 / 36.37 ±1.59 / 38.42 ms │ +1.11x faster │
│ QQuery 20 │    721.50 / 739.22 ±18.16 / 765.72 ms │    706.69 / 716.71 ±10.44 / 735.73 ms │     no change │
│ QQuery 21 │     769.23 / 778.25 ±9.25 / 789.86 ms │     756.05 / 763.87 ±5.18 / 771.82 ms │     no change │
│ QQuery 22 │  1141.56 / 1144.48 ±3.91 / 1152.13 ms │  1130.23 / 1133.58 ±2.92 / 1138.87 ms │     no change │
│ QQuery 23 │ 3090.97 / 3113.58 ±22.99 / 3153.17 ms │ 3062.89 / 3081.66 ±17.28 / 3108.68 ms │     no change │
│ QQuery 24 │      98.90 / 101.75 ±1.87 / 104.56 ms │      98.72 / 102.57 ±2.64 / 106.27 ms │     no change │
│ QQuery 25 │     138.75 / 140.70 ±1.53 / 142.45 ms │     138.59 / 141.51 ±2.11 / 144.46 ms │     no change │
│ QQuery 26 │      97.61 / 102.12 ±2.91 / 105.94 ms │     101.07 / 102.81 ±1.53 / 105.30 ms │     no change │
│ QQuery 27 │     852.97 / 858.30 ±6.68 / 871.04 ms │     846.47 / 853.63 ±6.38 / 863.98 ms │     no change │
│ QQuery 28 │ 3274.64 / 3297.34 ±17.81 / 3316.93 ms │ 3291.86 / 3323.03 ±18.11 / 3341.74 ms │     no change │
│ QQuery 29 │        48.95 / 54.28 ±5.34 / 63.63 ms │        50.77 / 56.86 ±8.08 / 72.48 ms │     no change │
│ QQuery 30 │     360.85 / 368.38 ±3.80 / 370.99 ms │     355.02 / 360.78 ±5.15 / 368.08 ms │     no change │
│ QQuery 31 │    367.44 / 382.03 ±11.55 / 393.53 ms │     366.33 / 377.65 ±5.80 / 382.83 ms │     no change │
│ QQuery 32 │ 1056.37 / 1072.57 ±23.66 / 1119.05 ms │ 1138.61 / 1282.17 ±96.69 / 1440.00 ms │  1.20x slower │
│ QQuery 33 │ 1474.26 / 1485.47 ±11.37 / 1506.57 ms │ 1486.44 / 1515.70 ±38.01 / 1590.30 ms │     no change │
│ QQuery 34 │ 1450.02 / 1481.44 ±23.04 / 1513.41 ms │ 1445.49 / 1463.25 ±10.68 / 1473.78 ms │     no change │
│ QQuery 35 │     398.40 / 401.67 ±1.85 / 404.10 ms │     390.99 / 394.54 ±2.90 / 398.85 ms │     no change │
│ QQuery 36 │     117.43 / 122.07 ±3.95 / 127.25 ms │     113.44 / 121.18 ±4.06 / 125.23 ms │     no change │
│ QQuery 37 │        48.49 / 50.27 ±1.44 / 52.56 ms │        48.15 / 49.86 ±1.27 / 51.56 ms │     no change │
│ QQuery 38 │        75.81 / 77.49 ±1.60 / 80.21 ms │        75.25 / 77.46 ±1.64 / 79.66 ms │     no change │
│ QQuery 39 │     213.13 / 223.68 ±7.68 / 232.78 ms │     216.79 / 222.60 ±5.08 / 230.80 ms │     no change │
│ QQuery 40 │        24.87 / 25.60 ±0.93 / 27.20 ms │        22.26 / 25.87 ±2.32 / 28.76 ms │     no change │
│ QQuery 41 │        19.10 / 20.96 ±1.58 / 23.89 ms │        20.03 / 22.16 ±1.95 / 25.26 ms │  1.06x slower │
│ QQuery 42 │        19.26 / 19.92 ±0.62 / 20.81 ms │        19.59 / 20.21 ±0.49 / 20.78 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                            ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                            │ 22641.32ms │
│ Total Time (filter-pushdown-dynamic-bytes)   │ 22723.07ms │
│ Average Time (HEAD)                          │   526.54ms │
│ Average Time (filter-pushdown-dynamic-bytes) │   528.44ms │
│ Queries Faster                               │          1 │
│ Queries Slower                               │          4 │
│ Queries with No Change                       │         38 │
│ Queries with Failure                         │          0 │
└──────────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric	Value
Wall time	114.3s
Peak memory	38.1 GiB
Avg memory	28.5 GiB
CPU user	1078.7s
CPU sys	84.1s
Peak spill	0 B

clickbench_partitioned — branch

Metric	Value
Wall time	114.8s
Peak memory	42.1 GiB
Avg memory	32.1 GiB
CPU user	1074.7s
CPU sys	93.6s
Peak spill	0 B

File an issue against this benchmark runner

Applies apache#20363 on top of the io-dynamic branch, resolving conflicts in opener.rs and source.rs by adapting the new predicate_conjuncts/selectivity_tracker fields to the direct ParquetMorselizer construction pattern used on this branch. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Introduces a runtime adaptive filter selectivity tracking system for Parquet pushdown. Each filter is monitored with Welford online stats and moves through a state machine: New -> RowFilter|PostScan -> (promoted / demoted / dropped). Key changes: - New selectivity.rs module (SelectivityTracker, TrackerConfig, SelectivityStats, FilterState, PartitionedFilters, FilterId). - New OptionalFilterPhysicalExpr wrapper in physical_expr_common. HashJoinExec wraps dynamic join filters in it. - Removes reorder_filters config + supporting code. - Adds filter_pushdown_min_bytes_per_sec, filter_collecting_byte_ratio_threshold, filter_confidence_z config. - Predicate storage: Option<Arc<PhysicalExpr>> -> Option<Vec<(FilterId, Arc<PhysicalExpr>)>> on ParquetSource/ParquetOpener. - build_row_filter takes Vec<(FilterId,...)> + SelectivityTracker, returns RowFilterWithMetrics. DatafusionArrowPredicate reports per-batch stats back to the tracker. - ParquetOpener calls tracker.partition_filters() and apply_post_scan_filters_with_stats; records filter_apply_time. - Proto reserves tag 6 (was reorder_filters); adds 3 new optional fields. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Optional/dynamic filters from hash-join build sides were unconditionally placed as PostScan on first encounter, losing late-materialization benefits even when filter columns were small relative to the projection. With few file splits opened in parallel, the tracker rarely accumulated enough samples to promote them mid-query. Apply the same byte_ratio_threshold heuristic used for static filters. The CI lower-bound promotion and CI upper-bound demotion paths still apply, including Drop for ineffective optional filters. Local TPC-DS sf1 (M-series, pushdown_filters=true): | Query | Main | Branch | Branch+Fix | |-------|-------|--------|------------| | Q24 | 72 | 452 | 70 | | Q17 | 124 | 212 | 121 | | Q25 | 182 | 379 | 203 | | Q29 | 152 | 312 | 145 | | Q7 | 224 | 297 | 220 | | Q58 | 129 | 191 | 133 | | Q64 | 28213 | 672 | 578 | | Q9 | 228 | 96 | 87 | | Q76 | 172 | 105 | 156 | Q76 regresses slightly vs the no-fix branch (CASE/hash_lookup is CPU- heavy at row level) but still beats main. Also updates dynamic_filter_pushdown_config.slt to match the Optional(DynamicFilter ...) display introduced earlier in the branch. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

After moving optional filters to RowFilter via byte_ratio, queries with 1-row-group-per-file inputs (e.g. TPC-DS) had no chance to demote when the chosen filter turned out to be CPU-dominated and ineffective: partition_filters runs once per file open, all 12 split openers fire in parallel and see no stats, and the existing Demote/Drop branches never re-trigger for the lifetime of the scan. Add a per-FilterId Arc<AtomicBool> "skip flag" owned by SelectivityTracker. Once a filter has accumulated enough samples and its CI upper bound on bytes-per-second falls below min_bytes_per_sec, the hot per-batch update() path flips the flag — but only for filters recorded as optional at first encounter (mandatory filters must always execute or the result set changes). Both consumers honour it: * DatafusionArrowPredicate::evaluate returns an all-true mask without invoking physical_expr (filter columns are still decoded; CPU is reclaimed but I/O is not, pending arrow-rs API). * apply_post_scan_filters_with_stats `continue`s past the filter, skipping evaluation and the per-batch tracker.update. Local TPC-DS sf1 (M-series, pushdown_filters=true), worst regressors from main pushdown=off baseline: | Query | Main(off) | Branch(byte-ratio) | +skip-flag | |-------|-----------|--------------------|------------| | Q72 | 619 | 554 | 261 | | Q50 | 221 | 521 | 135 | | Q23 | 892 | 1217 | 680 | | Q67 | 310 | 510 | 306 | | Q18 | 128 | 312 | 178 | | Q13 | 399 | 558 | 363 | | Q53 | 103 | 167 | 93 | | Q63 | 106 | 173 | 93 | | Q76 | 132 | 268 | 105 | Q24-class wins are unaffected (Q24 holds at 70 ms vs 379 ms on main). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Q18 (and several other TPC-DS regressions) had a post-scan customer_demographics filter — `Optional(DynamicFilter)` on the single projected column `cd_demo_sk` — that burned 90 ms of CPU per scan but could never be skipped. The filter was correctly placed at PostScan (projection ⊆ filter columns ⇒ byte_ratio = 1.0 > threshold) but the mid-stream skip path never fired. Root cause: `SelectivityStats::update` only incremented `sample_count` when `batch_bytes > 0`. When the projection is a subset of the filter columns, `other_bytes_per_row = 0` and therefore `batch_bytes = 0` on every call, so the Welford counter stayed at zero, the CI upper bound stayed `None`, and the skip check short-circuited. Meanwhile the filter kept running per batch. Admit samples with `batch_bytes = 0`. The recorded effectiveness for those samples is legitimately zero (no late-materialization payoff), so the CI upper bound converges on zero after a few batches and the skip flag flips for optional filters — exactly what we want: CPU spent, no byte savings, optional ⇒ drop. Local TPC-DS sf1 (M-series, pushdown=on) vs main pushdown=off: | Query | Main(off) | Before | After | |-------|-----------|--------|-------| | Q18 | 99 | 182 | 118 | | Q67 | 312 | 503 | 346 | | Q26 | 80 | 151 | 94 | | Q85 | 149 | 246 | 157 | | Q91 | 64 | 108 | 58 | | Q53 | 103 | 144 | 99 | | Q63 | 103 | 148 | 99 | | Q13 | 399 | 558 | 376 | | Q72 | 619 | 489 | 277 | | Q24 | 379 | 70 | 70 | | Q64 | 28213 | -- | 519 | Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

adriangb force-pushed the filter-pushdown-dynamic-bytes branch from e0240af to 09cdb0b Compare February 15, 2026 13:11

adriangb force-pushed the filter-pushdown-dynamic-bytes branch from f8ee955 to 290784a Compare April 11, 2026 19:00

adriangb force-pushed the filter-pushdown-dynamic-bytes branch from 290784a to 18ab75f Compare April 11, 2026 19:15

adriangb force-pushed the filter-pushdown-dynamic-bytes branch 4 times, most recently from 65f15e1 to 72f296f Compare April 11, 2026 22:02

adriangb force-pushed the filter-pushdown-dynamic-bytes branch from 72f296f to 17e8370 Compare April 18, 2026 05:11

adriangb and others added 3 commits April 19, 2026 10:04

Conversation

adriangb commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

1. New module: selectivity.rs (1,554 lines)

2. New wrapper: OptionalFilterPhysicalExpr (in physical_expr_common)

3. Removal of reorder_filters config option

4. Three new configuration options (in ParquetOptions)

5. Changes to ParquetOpener / opener.rs

6. Changes to ParquetSource / source.rs

7. Changes to build_row_filter() / row_filter.rs

8. Changes to HashJoinExec

9. Protobuf schema updates

10. Test and benchmark updates

Are these changes tested?

Are there any user-facing changes?

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

Dandandan commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

Dandandan commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

Dandandan commented Feb 15, 2026

Uh oh!

Dandandan commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

Dandandan commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

adriangb commented Feb 15, 2026

Uh oh!

alamb-ghbot commented Feb 15, 2026

Uh oh!

alamb commented Feb 15, 2026

Uh oh!

adriangb commented Apr 11, 2026

Uh oh!

adriangb commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

Uh oh!

adriangb commented Apr 11, 2026

Uh oh!

adriangb commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

Uh oh!

adriangbot commented Apr 11, 2026

adriangb commented Feb 15, 2026 •

edited

Loading

1. New module: `selectivity.rs` (1,554 lines)

2. New wrapper: `OptionalFilterPhysicalExpr` (in `physical_expr_common`)

3. Removal of `reorder_filters` config option

4. Three new configuration options (in `ParquetOptions`)

5. Changes to `ParquetOpener` / opener.rs

6. Changes to `ParquetSource` / source.rs

7. Changes to `build_row_filter()` / row_filter.rs

8. Changes to `HashJoinExec`