[FEA] Regexp: Add support for word and non-word boundaries in regexp pattern #4289
Labels
cudf_dependency
An issue or PR with this label depends on a new feature in cudf
feature request
New feature or request
Is your feature request related to a problem? Please describe.
We currently fall back to CPU for regexp patterns containing word (
\b
) and non-word (\B
) boundaries in regexp patterns.Here is one example of a difference between CPU and GPU for regexp_replace.
The test is effectively running
regexp_replace("A\nB", "\b", "_REPLACE")
.CPU output:
GPU output:
Describe the solution you'd like
Support on GPU consistently with Spark.
Describe alternatives you've considered
None
Additional context
None
The text was updated successfully, but these errors were encountered: