Everything you did not want to know about Unicode in Python 3

“Readers of this blog on my twitter feed know me as a person that likes to rant about Unicode in Python 3 a lot. This time will be no different. I’m going to tell you more about how painful “doing Unicode right” is and why. “Can you not just shut up Armin?”. I spent two weeks fighting with Python 3 again and I need to vent my frustration somewhere. On top of that there is still useful information in those rants because it teaches you how to deal with Python 3. Just don’t read it if you get annoyed by me easily.

There is one thing different about this rant this time. It won’t be related to WSGI or HTTP or any of that other stuff at all. Usually I’m told that I should stop complaining about the Python 3 Unicode system because I wrote code nobody else writes (HTTP libraries and things of that sort) I decided to write something else this time: a command line application. And not just the app, I wrote a handy little library called click to make this easier.

Note that I’m doing what about every newby Python programmer does: writing a command line application. The “Hello World” of Python programs. But unlike the newcomer to Python I wanted to make sure the application is as stable and Unicode supporting as possible for both Python 2 and Python 3 and make it possible to unittest it. So this is my report on how that went…”