handle unicode strings properly #167

jodok · 2018-06-01T07:48:37Z

#36
Azure/azure-cli#6408

wiggin15 · 2018-06-14T06:36:49Z

This change is not compatible with Python 3 (see Travis build). Do you think you can fix it? Thanks

Delgan · 2018-06-16T18:07:57Z

Hi @jodok

Are you sure the issue is from colorama side?

Encoding in Python is notoriously known to be hard.

I think the problem may come from environment or terminal.

I could not reproduce your issue, but as you linked to #36, I tried to print s = u'í' with and without calling colorama.init().
In both cases, the unicode í character was properly displayed.
Then I tried with your patch, and when printing í after colorama.init(), it was replaced by an empty string ''. 😕

I guess my Windows terminal is for some reason correctly configured to handle unicode, I would be disappointed not to be able to enjoy it with colorama.

jodok · 2018-06-17T07:41:34Z

my root cause is this issue: Azure/azure-cli#6408 - and there the issue only occurs if it is called in an non-interactive session (e.g. from jenkins).

jodok · 2018-07-20T20:23:27Z

@wiggin15 are you happy with my fixup?

wiggin15 · 2018-07-21T18:32:48Z

Hi @jodok . The fixup for Python 3 looks good but I'm not sure about the solution. If I understand correctly, this change will skip and ignore non-ascii character, which may be a problem, for two reasons:

The underlying terminal may support non-ascii characters. In this case we can write them and don't need to ignore them.
This can cause strange output - e.g. when the user prints "Bokmål" (or, say, u'Bokm\xe5l') we will write "Bokml" which doesn't look right.

Perhaps we should consider trying to encode with self.stream.encoding instead of ascii?
(by the way, I am also unable to reproduce this reliably)

tartley · 2020-10-13T14:13:52Z

Hey. FYI, yesterday I created a PR to test releases before we push them to PyPI. When that is merged, I'll be more confident about resuming merges and releases. I'll try to look at this PR soon. Thank you for creating it!

tartley · 2021-10-07T04:51:52Z

Hey. Huge thanks for this idea. But I'm going to close this for now, because:

In order to be considered for merging, any changes needs to come with accompanying tests that demonstrate the problem that they fix.
wiggin's reservations here don't seem to have been addressed.

Please do re-open or re-submit if you think I'm making a mistake, or if you address the above problems. Many thanks!

jodok added 2 commits June 16, 2018 13:40

handle unicode strings properly (tartley#36, Azure/azure-cli#6408)

49f2f54

!fixup python3 compatibility

3562d1a

jodok force-pushed the jb/fix-unicode branch from 9c703ca to 3562d1a Compare June 16, 2018 10:41

jodok mentioned this pull request Jul 20, 2018

az acr build throws UnicodeEncodeError: 'ascii' codec can't encode character Azure/azure-cli#6408

Closed

tartley closed this Oct 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

handle unicode strings properly #167

handle unicode strings properly #167

jodok commented Jun 1, 2018

wiggin15 commented Jun 14, 2018

Delgan commented Jun 16, 2018 •

edited

Loading

jodok commented Jun 17, 2018

jodok commented Jul 20, 2018

wiggin15 commented Jul 21, 2018

tartley commented Oct 13, 2020

tartley commented Oct 7, 2021 •

edited

Loading

handle unicode strings properly #167

handle unicode strings properly #167

Conversation

jodok commented Jun 1, 2018

wiggin15 commented Jun 14, 2018

Delgan commented Jun 16, 2018 • edited Loading

jodok commented Jun 17, 2018

jodok commented Jul 20, 2018

wiggin15 commented Jul 21, 2018

tartley commented Oct 13, 2020

tartley commented Oct 7, 2021 • edited Loading

Delgan commented Jun 16, 2018 •

edited

Loading

tartley commented Oct 7, 2021 •

edited

Loading