Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValueError: Transformation generated invalid chunkstring #10

Closed
Kowsalya-Mouttouramane opened this issue May 16, 2022 · 2 comments
Closed
Assignees
Labels
bug Something isn't working

Comments

@Kowsalya-Mouttouramane
Copy link

Kowsalya-Mouttouramane commented May 16, 2022

image
test = ["Les voitures autonomes déplacent la responsabilité de l'assurance vers les constructeurs"]
vectorizer_fr = KeyphraseCountVectorizer(spacy_pipeline='fr_dep_news_trf', pos_pattern='<N.*>+', stop_words ='french')
vectorizer_fr.fit(test)

It generates a valueError : Transformation generated invalid chunkstring:
<><><><><><><><><><><><>

This works with other languages, the problem is only with the french spacy models (whatever french model).
Can anyone help me solve this error, please ?

@hboisgibault
Copy link

Hi @Kowsalya-Mouttouramane,
You can check issue #2. In case of the French pipeline, you need to add the transformer pipeline component.
You can use my fork or create a pull request to activate custom components.

@TimSchopf
Copy link
Owner

Closing this as duplicate to issue #2

@TimSchopf TimSchopf added the bug Something isn't working label Jun 19, 2022
@TimSchopf TimSchopf self-assigned this Jun 19, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants