Commit 1416ed4
authored
Add benchmarks for Parquet struct leaf-level projection pruning (#21180)
## Rationale for this change
This PR adds benchmarks that measure the perf of projecting individual
fields from struct columns in Parquet files. #20925 introduced
leaf-level projection masking so that `select s['small_int']` on a
struct with large string fields only reads the small integer leaf,
skipping the expensive string decoding entirely
3 dataset shapes are coevered, each with ~262K rows of 8kb string
payloads: a narrow struct (2 leaves), a wide struct (5 leaves), and a
nested struct. Each shape benchmarks full-struct reads against
single-field projections1 parent ba399a8 commit 1416ed4
2 files changed
Lines changed: 408 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
247 | 247 | | |
248 | 248 | | |
249 | 249 | | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
250 | 255 | | |
251 | 256 | | |
252 | 257 | | |
| |||
0 commit comments