-
-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DETECTION] EUC-KR files are not detected correctly. #356
Labels
detection
Related to the charset detection mechanism, chaos/mess/coherence
Comments
kzrnm
added
detection
Related to the charset detection mechanism, chaos/mess/coherence
help wanted
Extra attention is needed
labels
Oct 5, 2023
Ousret
added a commit
that referenced
this issue
Oct 19, 2023
I could reproduce this and propose a patch that improves the situation. The file will be kept in our data collection if you don't oppose it. |
Ousret
added a commit
that referenced
this issue
Oct 19, 2023
Ousret
added a commit
that referenced
this issue
Oct 19, 2023
Ousret
added a commit
that referenced
this issue
Oct 19, 2023
Ousret
added a commit
that referenced
this issue
Oct 19, 2023
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
EUC-KR files are not detected correctly. Charset-Normalizer 2.1.1 detected it correctly.
Notice
I hereby announce that my raw input is not :
Provide the file
https://github.com/competitive-verifier/competitive-verifier/blob/bc30581761d4ae94f79f1daf8e9647dc2a7a67f0/examples/tests/encoding/EUC-KR.txt
Verbose output
Expected encoding
A clear and concise description of what you expected as encoding. Any more details about how the current guess is wrong
is very much appreciated.
Desktop (please complete the following information):
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: