Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace usage of PreprocessLogParsingStage with PreprocessNLPStage #842

Conversation

dagardner-nv
Copy link
Contributor

Description

  • Code in PreprocessLogParsingStage was about 99% the same as PreprocessNLPStage
  • The only thing PreprocessLogParsingStage provided was different default constructor values, along with some special handling of punctuation. However the cudf subword_tokenizer removes all punctuation.

fixes #801

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@dagardner-nv dagardner-nv added help wanted Extra attention is needed non-breaking Non-breaking change 3 - Ready for Review labels Apr 4, 2023
@dagardner-nv dagardner-nv requested a review from a team as a code owner April 4, 2023 16:26
@dagardner-nv dagardner-nv added improvement Improvement to existing functionality and removed help wanted Extra attention is needed labels Apr 4, 2023
@mdemoret-nv
Copy link
Contributor

/merge

@rapids-bot rapids-bot bot merged commit bfcee5f into nv-morpheus:branch-23.07 Apr 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement to existing functionality non-breaking Non-breaking change
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

[FEA]: PreprocessLogParsingStage should be folded into PreprocessNLPStage
2 participants