Commit 5a427cb
authored
perf: Optimize
## Which issue does this PR close?
- Closes #21364.
## Rationale for this change
For `Utf8` and `LargeUtf8` inputs, we can optimize `substr` to avoid
copying the output strings; instead, we can return a `StringViewArray`
that points into the input value buffer.
Benchmarks (M4 Max):
no count, short strings (size=1024):
- string_view: 5.97 µs -> 5.96 µs (-0.2%)
- string: 7.80 µs -> 4.99 µs (-36.1%)
- large_string: 8.47 µs -> 4.90 µs (-42.2%)
no count, short strings (size=4096):
- string_view: 23.10 µs -> 22.90 µs (-0.9%)
- string: 31.24 µs -> 18.31 µs (-41.4%)
- large_string: 34.10 µs -> 17.70 µs (-48.1%)
with count, long strings (size=1024, count=64, strlen=128):
- string_view: 10.16 µs -> 10.79 µs (+6.2%)
- string: 11.90 µs -> 8.38 µs (-29.6%)
- large_string: 11.93 µs -> 8.30 µs (-30.5%)
with count, long strings (size=4096, count=64, strlen=128):
- string_view: 39.37 µs -> 38.79 µs (-1.5%)
- string: 46.22 µs -> 30.25 µs (-34.6%)
- large_string: 46.57 µs -> 30.49 µs (-34.5%)
short count, long strings (size=1024, count=6, strlen=128):
- string_view: 11.65 µs -> 11.57 µs (-0.7%)
- string: 14.97 µs -> 11.37 µs (-24.1%)
- large_string: 14.92 µs -> 11.37 µs (-23.8%)
short count, long strings (size=4096, count=6, strlen=128):
- string_view: 45.88 µs -> 43.82 µs (-4.5%)
- string: 58.38 µs -> 43.55 µs (-25.4%)
- large_string: 58.59 µs -> 43.58 µs (-25.6%)
scalar start, no count, short strings (size=1024, strlen=12):
- string_view: 6.07 µs -> 6.10 µs (+0.5%)
- string: 7.81 µs -> 5.06 µs (-35.2%)
scalar start, no count, short strings (size=4096, strlen=12):
- string_view: 23.08 µs -> 22.62 µs (-2.0%)
- string: 31.07 µs -> 18.86 µs (-39.3%)
scalar start, no count, long strings (size=1024, strlen=128):
- string_view: 9.99 µs -> 10.65 µs (+6.6%)
- string: 12.01 µs -> 8.17 µs (-32.0%)
scalar start, no count, long strings (size=4096, strlen=128):
- string_view: 38.57 µs -> 39.79 µs (+3.2%)
- string: 46.83 µs -> 31.67 µs (-32.4%)
scalar start=1, no count, long strings (size=1024, strlen=128):
- string_view: 9.78 µs -> 10.48 µs (+7.2%)
- string: 12.02 µs -> 8.16 µs (-32.1%)
scalar start=1, no count, long strings (size=4096, strlen=128):
- string_view: 38.54 µs -> 40.18 µs (+4.3%)
- string: 46.36 µs -> 31.73 µs (-31.6%)
scalar args, short strings (size=1024, count=6, strlen=12):
- string_view: 11.30 µs -> 11.23 µs (-0.7%)
- string: 15.04 µs -> 11.52 µs (-23.4%)
scalar args, short strings (size=4096, count=6, strlen=12):
- string_view: 44.34 µs -> 43.98 µs (-0.8%)
- string: 59.63 µs -> 45.02 µs (-24.5%)
scalar args, long strings (size=1024, count=64, strlen=128):
- string_view: 10.51 µs -> 12.05 µs (+14.6%)
- string: 12.21 µs -> 8.67 µs (-28.9%)
- large_string: 12.20 µs -> 8.66 µs (-29.0%)
scalar args, long strings (size=4096, count=64, strlen=128):
- string_view: 40.13 µs -> 41.89 µs (+4.4%)
- string: 46.96 µs -> 32.44 µs (-30.9%)
- large_string: 47.24 µs -> 32.49 µs (-31.2%)
This PR doesn't modify the `string_view` code path; I've included the
benchmark results above for completeness, but any changes should just be
benchmarking noise.
## What changes are included in this PR?
* Implement optimization
* Other minor code cleanup
* Add a benchmark (only somewhat related to this optimization but
related to future optimization work)
## Are these changes tested?
Yes.
## Are there any user-facing changes?
No.substr for Utf8, LargeUtf8 (#21366)1 parent fab3b71 commit 5a427cb
3 files changed
Lines changed: 153 additions & 21 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
201 | 201 | | |
202 | 202 | | |
203 | 203 | | |
204 | | - | |
| 204 | + | |
205 | 205 | | |
206 | 206 | | |
207 | 207 | | |
| |||
220 | 220 | | |
221 | 221 | | |
222 | 222 | | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
223 | 243 | | |
224 | 244 | | |
225 | 245 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
424 | 424 | | |
425 | 425 | | |
426 | 426 | | |
427 | | - | |
| 427 | + | |
428 | 428 | | |
429 | 429 | | |
430 | 430 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
24 | | - | |
| 23 | + | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| |||
134 | 134 | | |
135 | 135 | | |
136 | 136 | | |
137 | | - | |
| 137 | + | |
138 | 138 | | |
139 | 139 | | |
140 | 140 | | |
141 | | - | |
| 141 | + | |
142 | 142 | | |
143 | 143 | | |
144 | 144 | | |
| |||
275 | 275 | | |
276 | 276 | | |
277 | 277 | | |
278 | | - | |
| 278 | + | |
279 | 279 | | |
280 | 280 | | |
281 | 281 | | |
| |||
296 | 296 | | |
297 | 297 | | |
298 | 298 | | |
299 | | - | |
300 | | - | |
301 | | - | |
| 299 | + | |
| 300 | + | |
302 | 301 | | |
303 | | - | |
| 302 | + | |
304 | 303 | | |
305 | 304 | | |
306 | 305 | | |
| |||
319 | 318 | | |
320 | 319 | | |
321 | 320 | | |
322 | | - | |
323 | | - | |
324 | | - | |
325 | | - | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
326 | 357 | | |
327 | 358 | | |
328 | 359 | | |
329 | | - | |
330 | | - | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
331 | 419 | | |
332 | 420 | | |
333 | 421 | | |
| |||
347 | 435 | | |
348 | 436 | | |
349 | 437 | | |
350 | | - | |
351 | | - | |
352 | | - | |
| 438 | + | |
| 439 | + | |
353 | 440 | | |
354 | 441 | | |
355 | 442 | | |
356 | 443 | | |
357 | 444 | | |
358 | 445 | | |
359 | 446 | | |
360 | | - | |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
361 | 452 | | |
362 | 453 | | |
363 | 454 | | |
| |||
734 | 825 | | |
735 | 826 | | |
736 | 827 | | |
| 828 | + | |
| 829 | + | |
| 830 | + | |
| 831 | + | |
| 832 | + | |
| 833 | + | |
| 834 | + | |
| 835 | + | |
| 836 | + | |
| 837 | + | |
| 838 | + | |
| 839 | + | |
| 840 | + | |
| 841 | + | |
| 842 | + | |
| 843 | + | |
| 844 | + | |
| 845 | + | |
| 846 | + | |
| 847 | + | |
| 848 | + | |
737 | 849 | | |
0 commit comments