A few weeks ago I started a beginner level Chinese (Mandarin) evening class at the local community college. We're about four weeks in and I'm really enjoying it, will definitely do more.
I thought it would be an interesting project to load a Chinese-English dictionary onto my Kindle for reference. I've already played with kindlegen, which takes a
(
Read more... )
Comments 9
I wonder whether the reason is that the Kindle is not displaying Chinese text, but rather Japanese text!
That would explain why characters that are used in Japanese (such as 他) would display but but characters that aren't used in Japanese (such as 她) would not.
And might explain why setting the locale to P.R. China would work: it might select a different default font, one with coverage of Simplified Chinese characters rather than Japanese ones.
Reply
Reply
Reply
That's true, too, due to "Han unification".
Most shapes are identical or at least very similar, but in some cases, there are noticeable differences. (Perhaps a comparison in Latin script terms, which allows for different character shapes, too, is hand-written vs. printed lower-case "a" or "g".)
However, I don't see why that would result in a character not displaying at all...
Because the font it uses doesn't cover every single Unicode character but only the ones it thinks will be needed?
If that were the reason, though, then it would mean that Amazon-provided eBooks (where the characters show up) must use a different font than self-made ones or plain-text UTF8 files (where certain characters don't).
Reply
Perhaps the Kindle prefers to show characters in a consistent style based on the language setting, rather than mixing up different styles because not all characters are available in each style. Since it's a reading device, I can appreciate that.
Reply
I don't know much about this particular issue. It sounds like a disaster, if there are Unicode characters that don't have a single meaning but depend on having barely documented settings set correctly in one's device. Was there really not a better way for Unicode to be designed? I thought the whole point of it was to avoid this code of alternate character set problem...?
Reply
Leave a comment