Category Archives: Unicode

Unicode – the basics

An introduction to the basics of Unicode, distilled from several earlier posts. In the interests of presenting the big picture, I have painted with a broad brush — large areas are summarized; nits are not picked; hairs are not split; … Continue reading

Posted in Unicode | 3 Comments

Unicode for dummies — Encoding

Another entry in an irregular series of posts about Unicode. Typos fixed 2012-02-22. Thanks Anonymous, and Clinton, for reporting the typos. This is a story about encoding and decoding, with a minor subplot involving Unicode. As our story begins — … Continue reading


Posted in Unicode | 5 Comments

Unicode for dummies – just use UTF-8

Revised 2012-03-18 — fixed a bad link, and removed an incorrect statement about the origin of the terms “big-endian” and “little-endian”. Commenting on my previous post about Unicode, an anonymous commentator noted that the usage of the BOM [the Unicode … Continue reading

Posted in Unicode | 9 Comments

Unicode Beginners Introduction for Dummies Made Simple

I’ve been trying to grok Unicode, and it hasn’t been easy.  But today, I finally got it.  And, as it turns out, the basics of Unicode aren’t too difficult. The problems that I’ve been having turn out not to be … Continue reading

Posted in Unicode | 12 Comments