Skip to content

v0.9.0

Latest
Compare
Choose a tag to compare
@vmenger vmenger released this 10 Jul 13:17
b5f046c

0.9.0 (2024-07-10)

Added

  • Mantra GSC corpus for evaluation
  • Loading and exporting InfoExtractionDataset as dictionaries or JSON files
  • Metric support for multi-class qualifiers
  • In the RuleBasedEntityMatcher, option to add terms as a dict (in addition to str, list and Term)
  • In the RuleBasedEntityMatcher, option to add terms from dict (add_terms_from_dict), json (add_terms_from_json) or csv (add_terms_from_csv)
  • In the Term class, an option to override arguments that were not set

Changed

  • Moved regression test cases to data directory in more open format, so they are re-usable
  • Made the default field for Qualifier optional
  • InfoExtractionDataset and InfoExtractionMetrics use Qualifier objects for qualifiers rather than dict
  • InfoExtractionDataset and InfoExtractionMetrics no longer track or use qualifier defaults
  • Made qualifiers optional for metrics in Annotation
  • Added a normalize method to Normalizer, so it can be used/tested directly
  • The logic for determining whether the RuleBasedEntityMatcher should internally use the phrase matcher or the matcher is simplified

Deprecated

  • ❗ The create_concept_dict method, which is now replaced by add_terms_from_csv in RuleBasedEntityMatcher
  • ❗ In the RuleBasedEntityMatcher, the load_concepts method, which is now replaced by add_terms_from_dict and add_terms_from_json