Skip to content

Commit

Permalink
Merge pull request #1157 from weaviate/add-kagome-kr-tokenizer
Browse files Browse the repository at this point in the history
Add support for the kagome_kr tokenizer
  • Loading branch information
dirkkul committed Jul 4, 2024
2 parents 2541408 + 60d7849 commit a09598b
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions weaviate/collections/classes/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -134,6 +134,8 @@ class Tokenization(str, Enum):
Tokenize using GSE (for Chinese and Japanese).
`TRIGRAM`
Tokenize into trigrams.
`KAGOME_KR`
Tokenize using the 'Kagome' tokenizer and a Korean MeCab dictionary (for Korean).
"""

WORD = "word"
Expand All @@ -142,6 +144,7 @@ class Tokenization(str, Enum):
FIELD = "field"
GSE = "gse"
TRIGRAM = "trigram"
KAGOME_KR = "kagome_kr"


class GenerativeSearches(str, Enum):
Expand Down

0 comments on commit a09598b

Please sign in to comment.