-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some issue with Unicode characters, maybe #5
Comments
Hi, we have a similar issue. The issue appeared here:
(UkkonenTrie.cs -> Line: 207) word: "walross" I guess, that you internally transform the "ß" to ss? Or that the code is interpretating the "ss" as "ß"? Our current workaround is to tralce all "ß" with "ss" and thats it, but it has annoying implications. Thank you Kind Regards |
This issue seems like being caused by globalization, I solved this issue by add the following runtime option: UPDATED====== For my case, the root cause of this issue is that at least in "en-US" and "中文(中国)" Culture, "ANYSTR".StartsWith("ANYSTR\u200B") returns True |
Got an exception
System.ArgumentOutOfRangeException: startIndex cannot be larger than length of string. Parameter name: startIndex at System.String.Substring(Int32 startIndex, Int32 length) at Gma.DataStructures.StringSearch.UkkonenTrie
1.TestAndSplit(Node
1 inputs, String stringPart, Char t, String remainder, T value) at Gma.DataStructures.StringSearch.UkkonenTrie1.Update(Node
1 inputNode, String stringPart, String rest, T value) at Gma.DataStructures.StringSearch.UkkonenTrie`1.Add(String key, T value) at TPB.Business.PirateBayDumpProcessor.Process(FileInfo file) in D:_Projects\TPB\TPB.Business\PirateBayDumpProcessor.cs:line 57 at TPB.ConsoleTester.Program.Main(String[] args) in D:_Projects\TPB\TPB.ConsoleTester\Program.cs:line 12} | System.ArgumentOutOfRangeExceptionwhen trying (pun not intended): trie.Add(entry.Name, entry);
where entry.Name was Tjockare än vatten (Thicker Than Water) - S02 E08 - 720p x265 H
The text was updated successfully, but these errors were encountered: