perf: Cast entire Date32 array to Date64 on 1st failure#21948
perf: Cast entire Date32 array to Date64 on 1st failure#21948huymq1710 wants to merge 9 commits intoapache:mainfrom
Conversation
|
Thank you. I updated |
|
Previous work on this might show some insight - #15361 (comment) |
alamb
left a comment
There was a problem hiding this comment.
Thanks @huymq1710 and @kumarUjjawal and @Omega359
THis looks good to me at a high level -- I would like to get the benchmark in place so we can verify performance
| // handle time specifiers and falls back to a Date64 cast. | ||
|
|
||
| // Covers full fallback (every row triggers the cast) | ||
| c.bench_function("to_char_array_date32_datetime_patterns_1000", |b| { |
There was a problem hiding this comment.
Thank you @huymq1710
Can you please put this new benchmark into a separate PR so that we can use our automated benchmark runner to compare the performance of main and your proposal?
There was a problem hiding this comment.
Got it, I temporary removed benchmark changes, I will create other PR to add back after this PR is merged
There was a problem hiding this comment.
@alamb One clarification, do you want this merged first or the pr with benchmark?
There was a problem hiding this comment.
Typically I like to merge the PR with the benchmark first, then merge up the PR with performance improvements, and then validate the benchmark results using the run benchmarks command.
|
@huymq1710 can you open the pr with the benchmark |
|
I see, created |
## Which issue does this PR close? <!-- We generally require a GitHub issue to be filed for all bug fixes and enhancements and this helps us generate change logs for our releases. You can link an issue to this PR using the GitHub syntax. For example `Closes apache#123` indicates that this PR will close issue apache#123. --> - Closes #. ## Rationale for this change <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> ## What changes are included in this PR? <!-- There is no need to duplicate the description in the issue here but it is sometimes worth providing a summary of the individual changes in this PR. --> Benchmark for apache#21948 ## Are these changes tested? <!-- We typically require tests for all PRs in order to: 1. Prevent the code from being accidentally broken by subsequent changes 2. Serve as another way to document the expected behavior of the code If tests are not included in your PR, please explain why (for example, are they covered by existing tests)? --> ## Are there any user-facing changes? <!-- If there are user-facing changes then we may require documentation to be updated before approving the PR. --> <!-- If there are any breaking changes to public APIs, please add the `api change` label. -->
|
run benchmark |
|
Hi @kumarUjjawal, Supported benchmarks:
Usage: Per-side configuration ( env:
SHARED_SETTING: enabled
baseline:
ref: v45.0.0
env:
DATAFUSION_RUNTIME_MEMORY_LIMIT: 1G
changed:
ref: v46.0.0
env:
DATAFUSION_RUNTIME_MEMORY_LIMIT: 2GFile an issue against this benchmark runner |
|
run benchmark to_char |
|
🤖 Criterion benchmark running (GKE) | trigger CPU Details (lscpu)Comparing cast-entire-date32-array (205f116) to b2fd2d3 (merge-base) diff File an issue against this benchmark runner |
|
🤖 Criterion benchmark completed (GKE) | trigger Instance: CPU Details (lscpu)Details
Resource Usagebase (merge-base)
branch
File an issue against this benchmark runner |
|
Benchmark looks good! |
Which issue does this PR close?
to_charfor array conversions #17152Rationale for this change
1.
Vec<Option<String>>extra allocationsto_charto allocate less, fix NULL handling #20635 by usingStringBuilder2. Per-row cast on fallback
What changes are included in this PR?
Cast the entire Date32 array to Date64 once on first failure, instead of per-row
Are these changes tested?
Yes
Date32 + datetimepatterns (all rows trigger the fallback)Date32 + mixed patterns(roughly half do)Detail
cargo bench --bench to_char Compiling datafusion-functions v53.1.0 (/Users/qmac/Documents/GitHub/datafusion/datafusion/functions) Finished `bench` profile [optimized] target(s) in 1m 10s Running benches/to_char.rs (target/release/deps/to_char-d2acea4b7a3e2fba) Gnuplot not found, using plotters backend Benchmarking to_char_array_date_only_patterns_1000: Warming up for 3.00Benchmarking to_char_array_date_only_patterns_1000: Collecting 100 sampto_char_array_date_only_patterns_1000 time: [104.01 µs 104.37 µs 104.74 µs] change: [−0.5692% +0.3244% +1.0788%] (p = 0.47 > 0.05) No change in performance detected. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 2 (2.00%) high mild Benchmarking to_char_array_datetime_patterns_1000: Warming up for 3.000Benchmarking to_char_array_datetime_patterns_1000: Collecting 100 samplto_char_array_datetime_patterns_1000 time: [165.96 µs 166.43 µs 166.94 µs] change: [−4.1660% −3.2055% −2.3626%] (p = 0.00 < 0.05) Performance has improved. Found 6 outliers among 100 measurements (6.00%) 2 (2.00%) low mild 3 (3.00%) high mild 1 (1.00%) high severe Benchmarking to_char_array_mixed_patterns_1000: Collecting 100 samples to_char_array_mixed_patterns_1000 time: [139.58 µs 140.02 µs 140.49 µs] change: [+1.0116% +1.4122% +1.8178%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 2 (2.00%) high mild Benchmarking to_char_scalar_date_only_pattern_1000: Warming up for 3.00Benchmarking to_char_scalar_date_only_pattern_1000: Collecting 100 sampto_char_scalar_date_only_pattern_1000 time: [60.073 µs 63.184 µs 66.309 µs] change: [−4.0228% +0.6839% +6.1581%] (p = 0.79 > 0.05) No change in performance detected. Benchmarking to_char_scalar_datetime_pattern_1000: Warming up for 3.000Benchmarking to_char_scalar_datetime_pattern_1000: Collecting 100 samplto_char_scalar_datetime_pattern_1000 time: [117.65 µs 124.76 µs 131.84 µs] change: [−3.3009% +2.6889% +8.6148%] (p = 0.37 > 0.05) No change in performance detected. Found 30 outliers among 100 measurements (30.00%) 16 (16.00%) low mild 14 (14.00%) high mild Benchmarking to_char_array_date32_datetime_patterns_1000: Warming up foBenchmarking to_char_array_date32_datetime_patterns_1000: Collecting 10to_char_array_date32_datetime_patterns_1000 time: [289.67 µs 290.77 µs 291.88 µs] change: [−20.661% −20.240% −19.863%] (p = 0.00 < 0.05) Performance has improved. Benchmarking to_char_array_date32_mixed_patterns_1000: Warming up for 3Benchmarking to_char_array_date32_mixed_patterns_1000: Collecting 100 sto_char_array_date32_mixed_patterns_1000 time: [194.47 µs 195.28 µs 196.11 µs] change: [−16.230% −15.738% −15.285%] (p = 0.00 < 0.05) Performance has improved.Are there any user-facing changes?
No