-
Notifications
You must be signed in to change notification settings - Fork 240
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for regexp_extract on the GPU #4285
Conversation
build |
- `$` does not match the end of a string if the string ends with a line-terminator | ||
([cuDF issue #9620](https://github.com/rapidsai/cudf/issues/9620)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This issue was resolved in #4239 but the docs did not get updated. I can raise a separate PR for this change if necessary.
build |
3ace09d
to
62ac1ef
Compare
Signed-off-by: Andy Grove <andygrove@nvidia.com>
90f174f
to
d8c9c0e
Compare
withResource(str.getBase.extractRe(cudfRegexPattern)) { extract => | ||
withResource(str.getBase.matchesRe(cudfRegexPattern)) { matches => | ||
withResource(str.getBase.isNull) { isNull => | ||
withResource(extract.getColumn(i - 1)) { extractedGroup => |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It would be nice if we could just ask cuDF to extract a single column and I have filed rapidsai/cudf#9855 to request that capability
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/stringFunctions.scala
Show resolved
Hide resolved
Signed-off-by: Andy Grove <andygrove@nvidia.com>
Signed-off-by: Andy Grove <andygrove@nvidia.com>
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/stringFunctions.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/stringFunctions.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/org/apache/spark/sql/rapids/stringFunctions.scala
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, just a small comment on a test.
build |
Signed-off-by: Andy Grove andygrove@nvidia.com
Closes #4002 and #4284
This PR adds support for
regexp_extract
.