Skip to content

Spark make_valid_utf8 function implementation#20633

Merged
Jefffrey merged 20 commits intoapache:mainfrom
kazantsev-maksim:spark_make_valid_utf8
Apr 16, 2026
Merged

Spark make_valid_utf8 function implementation#20633
Jefffrey merged 20 commits intoapache:mainfrom
kazantsev-maksim:spark_make_valid_utf8

Conversation

@kazantsev-maksim
Copy link
Copy Markdown
Contributor

Which issue does this PR close?

N/A

Rationale for this change

Add new spark function: https://spark.apache.org/docs/latest/api/sql/index.html#make_valid_utf8

What changes are included in this PR?

  • Implementation
  • SLT tests

Are these changes tested?

Yes, tests added as part of this PR.

Are there any user-facing changes?

No, these are new function.

@github-actions github-actions Bot added sqllogictest SQL Logic Tests (.slt) spark labels Mar 1, 2026
@kazantsev-maksim kazantsev-maksim changed the title Spark make_valid_utf8 function implementation Draft: Spark make_valid_utf8 function implementation Mar 1, 2026
@kazantsev-maksim kazantsev-maksim changed the title Draft: Spark make_valid_utf8 function implementation Spark make_valid_utf8 function implementation Mar 1, 2026
@kazantsev-maksim kazantsev-maksim marked this pull request as draft March 1, 2026 18:56
@kazantsev-maksim kazantsev-maksim marked this pull request as ready for review March 5, 2026 17:00
@kazantsev-maksim
Copy link
Copy Markdown
Contributor Author

kazantsev-maksim commented Mar 25, 2026

@Jefffrey Could you please do a review when you have time?

Copy link
Copy Markdown
Contributor

@Jefffrey Jefffrey left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Implementation looks good, but we need more test cases; as it is none of the tests actually test invalid utf8 path

@Jefffrey Jefffrey added this pull request to the merge queue Apr 16, 2026
Merged via the queue into apache:main with commit bd2af68 Apr 16, 2026
31 checks passed
@Jefffrey
Copy link
Copy Markdown
Contributor

Thanks @kazantsev-maksim

@kazantsev-maksim kazantsev-maksim deleted the spark_make_valid_utf8 branch April 16, 2026 07:36
@kazantsev-maksim
Copy link
Copy Markdown
Contributor Author

Thanks for the review @Jefffrey.

Rich-T-kid pushed a commit to Rich-T-kid/datafusion that referenced this pull request Apr 21, 2026
## Which issue does this PR close?

N/A

## Rationale for this change

Add new spark function:
https://spark.apache.org/docs/latest/api/sql/index.html#make_valid_utf8

## What changes are included in this PR?

- Implementation
- SLT tests

## Are these changes tested?

Yes, tests added as part of this PR.

## Are there any user-facing changes?

No, these are new function.

---------

Co-authored-by: Kazantsev Maksim <mn.kazantsev@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

spark sqllogictest SQL Logic Tests (.slt)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants