Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENH] Guardian - infer language and add to corpus #925

Merged
merged 1 commit into from
Mar 14, 2023

Conversation

PrimozGodec
Copy link
Collaborator

@PrimozGodec PrimozGodec commented Jan 11, 2023

Derived from #874

Description of changes

Infer language from articles and add it to the corpus

In the current implementation, Corpus variables are created when GuardianAPI is initialized, so they are shared between multiple corpora. The problem appears when different values are added to DiscreteVariable by different calls, so values accumulate. This PR changes that variables are defined in each call separately.

Includes
  • Code changes
  • Tests
  • Documentation

@codecov-commenter
Copy link

codecov-commenter commented Jan 11, 2023

Codecov Report

Merging #925 (904e1b8) into master (25952ea) will increase coverage by 0.19%.
The diff coverage is 87.19%.

❗ Current head 904e1b8 differs from pull request most recent head 1de1617. Consider uploading reports for the commit 1de1617 to get more accurate results

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #925      +/-   ##
==========================================
+ Coverage   77.53%   77.73%   +0.19%     
==========================================
  Files          86       86              
  Lines       12261    12300      +39     
  Branches     1608     1614       +6     
==========================================
+ Hits         9507     9561      +54     
+ Misses       2455     2441      -14     
+ Partials      299      298       -1     

@PrimozGodec
Copy link
Collaborator Author

/rebase

@VesnaT
Copy link
Contributor

VesnaT commented Mar 7, 2023

The change breaks the Publication Date:

image

@PrimozGodec
Copy link
Collaborator Author

I forgot about setting have_time and have_date attributes. Now it should be fixed

@VesnaT VesnaT merged commit 20dc93f into biolab:master Mar 14, 2023
@PrimozGodec PrimozGodec deleted the lagdetect-guardian branch March 14, 2023 09:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants