Unicode

Locale magic (literally)

Submitted by Christoph on 9 August, 2009 - 13:42

Another programming-centric post and follow-up on Thursday's post about locale issues with Turkish.

So, I showed some Problems with locale-dependant mappings using the case of Turkish, that mapps small Latin character i to uppercase İ, which also has a dot on top. Now join me on some more magic. On Unix you need to have the proper locale generated, which under debian works with dpkg-reconfigure locales.

»

Wrong Diacritics with Pinyin

Submitted by Christoph on 2 August, 2009 - 16:46

Dear lazyweb,

I am looking for examples of bad Pinyin where invalid diacritics are being used.

I have two already for the third tone, but I am curious if other tones also see similar errors.

Xiàndài Hànyû Dàcídiân (Circumflex)
Wŏ huì shuō yìdiănr (Breve)

»

Followup on "Python doctest and Unicode"

Submitted by Christoph on 21 July, 2009 - 12:05

I complained about Python doctest and Unicode some time ago. This was an itch I finally wanted to scratch, so I followed the popular saying: "Luke, read the source".

Turns out the error in question is fixed pretty easily. Python needs to properly encode the output, so a conversion to the output stream's encoding did the trick. Now a new issue came up.

»

Write upside down using Unicode IPA characters

Submitted by Christoph on 20 October, 2008 - 16:34

¡ǝpoɔıun oʇ sʞuɐɥʇ uʍop ǝpısdn ǝʇıɹʍ uɐɔ noʎ 'ʇou ɹo ʇı ǝʌǝılǝq.

This is nothing new (see [1] or [2]), but I wrote a small Python program for this to pipe my instant messaging through it. Needless to say that with KDE's Kopete I found yet another application that seems not to work with non-ASCII characters.

»

Read more

Python doctest and Unicode

Submitted by Christoph on 25 September, 2008 - 06:18

Today I found out about Python's doctest module. "The doctest module searches for pieces of text that look like interactive Python sessions, and then executes those sessions to verify that they work exactly as shown."

»

Christoph's CJK-centered concerns

Navigation

tags in site content

Archive

Blogs I read

Locale magic (literally)

Wrong Diacritics with Pinyin

Followup on "Python doctest and Unicode"

Write upside down using Unicode IPA characters

Python doctest and Unicode