<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Unicode for dummies – just use UTF-8</title>
	<atom:link href="http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/feed/" rel="self" type="application/rss+xml" />
	<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/</link>
	<description>Adventures across space and time with the Python programming language</description>
	<lastBuildDate>Sat, 04 May 2013 20:54:54 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Aditya</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-2897</link>
		<dc:creator><![CDATA[Aditya]]></dc:creator>
		<pubDate>Mon, 29 Apr 2013 10:50:49 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-2897</guid>
		<description><![CDATA[Nice article,lucid ,concise and to the point.Though a small paragraph on why UTF-16 did come up or why someone/some people felt the need for UTF-16 would make &quot;always using UTF-8&quot; an even better &quot;informed&quot; decision. Juxtapose them ,I feel it may help. Anyways it&#039;s just a suggestion :)]]></description>
		<content:encoded><![CDATA[<p>Nice article,lucid ,concise and to the point.Though a small paragraph on why UTF-16 did come up or why someone/some people felt the need for UTF-16 would make &#8220;always using UTF-8&#8243; an even better &#8220;informed&#8221; decision. Juxtapose them ,I feel it may help. Anyways it&#8217;s just a suggestion <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Ferg</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-1391</link>
		<dc:creator><![CDATA[Steve Ferg]]></dc:creator>
		<pubDate>Sun, 18 Mar 2012 20:42:07 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-1391</guid>
		<description><![CDATA[You&#039;re absolutely correct.  This has been fixed.  Thanks!]]></description>
		<content:encoded><![CDATA[<p>You&#8217;re absolutely correct.  This has been fixed.  Thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pim</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-858</link>
		<dc:creator><![CDATA[Pim]]></dc:creator>
		<pubDate>Mon, 12 Sep 2011 09:20:13 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-858</guid>
		<description><![CDATA[I agree with Paul that the BOM is a useful signature check for UTF-8 files.
In fact, it&#039;s more useful for UTF-8 than for UTF-16 or UTF-32. If a computer program has to guess what encoding a BOM-less file is in, checking the first four bytes will provide enough information in the majority of cases. But not for UTF-8! UTF-8 will look just like any 1-byte charset in the absense of a BOM.]]></description>
		<content:encoded><![CDATA[<p>I agree with Paul that the BOM is a useful signature check for UTF-8 files.<br />
In fact, it&#8217;s more useful for UTF-8 than for UTF-16 or UTF-32. If a computer program has to guess what encoding a BOM-less file is in, checking the first four bytes will provide enough information in the majority of cases. But not for UTF-8! UTF-8 will look just like any 1-byte charset in the absense of a BOM.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: costy</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-766</link>
		<dc:creator><![CDATA[costy]]></dc:creator>
		<pubDate>Wed, 27 Apr 2011 17:07:45 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-766</guid>
		<description><![CDATA[Hello! I’m at work surfing around your blog from my new iphone 3gs! Just wanted to say I love reading through your blog and look forward to all your posts! Carry on the great work!]]></description>
		<content:encoded><![CDATA[<p>Hello! I’m at work surfing around your blog from my new iphone 3gs! Just wanted to say I love reading through your blog and look forward to all your posts! Carry on the great work!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Walter Dörwald</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-419</link>
		<dc:creator><![CDATA[Walter Dörwald]]></dc:creator>
		<pubDate>Thu, 28 Oct 2010 09:36:41 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-419</guid>
		<description><![CDATA[Also see http://docs.python.org/library/codecs.html#encodings-and-unicode for an explaination of UTF-8 and the BOM]]></description>
		<content:encoded><![CDATA[<p>Also see <a href="http://docs.python.org/library/codecs.html#encodings-and-unicode" rel="nofollow">http://docs.python.org/library/codecs.html#encodings-and-unicode</a> for an explaination of UTF-8 and the BOM</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Simon Hibbs</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-418</link>
		<dc:creator><![CDATA[Simon Hibbs]]></dc:creator>
		<pubDate>Thu, 28 Oct 2010 08:30:21 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-418</guid>
		<description><![CDATA[Quote: These terms are derived from “Big End In” and “Little End In.”

I&#039;ve seen that theory posited before, but Danny Cohen explicitly refers to Guliver in his article, so it&#039;s pretty clear his usage of the terms derives from Swift and the Lilliputians dispute as to which end of an egg should be eaten first.]]></description>
		<content:encoded><![CDATA[<p>Quote: These terms are derived from “Big End In” and “Little End In.”</p>
<p>I&#8217;ve seen that theory posited before, but Danny Cohen explicitly refers to Guliver in his article, so it&#8217;s pretty clear his usage of the terms derives from Swift and the Lilliputians dispute as to which end of an egg should be eaten first.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-296</link>
		<dc:creator><![CDATA[Paul]]></dc:creator>
		<pubDate>Mon, 07 Jun 2010 16:14:29 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-296</guid>
		<description><![CDATA[Its a shame that the BOM is frowned upon when a file is encoded UTF8, since it provides a useful signature check.]]></description>
		<content:encoded><![CDATA[<p>Its a shame that the BOM is frowned upon when a file is encoded UTF8, since it provides a useful signature check.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: flow</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-292</link>
		<dc:creator><![CDATA[flow]]></dc:creator>
		<pubDate>Sat, 05 Jun 2010 15:41:03 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-292</guid>
		<description><![CDATA[you are perfectly right that character encoding schemes often feel, and act, like ‘brittle ice’. however, i want to promote the idea that especially in an environment like ms windows, with all the legacy stuff lying around, where several layers of the system apply differing heuristics to render text and rely on differing assumptions, trying to ‘max out on utf-8’ is the only thing that will work in the long run. this strategy means (eg in a web application) that when tests or actual usage reveal an encoding error, go through the entire tool stack — os, web server, database, everything — and re-affirm that yes, i want to use utf-8 here. make sure the web page comes out with a meta header stating utf-8, and that the http response header also states utf-8. make sure your source files are all in utf-8, and that your editor / ide also uses that. ah, and for filenames: [-a-z0-8_.], except for tested exceptions that work. this last one is mainly intended for when you want to package your sources in a *.tgz or *.zip and unpack it on another platform; this will often lead to funny results when you go beyond 7-bit us ascii.]]></description>
		<content:encoded><![CDATA[<p>you are perfectly right that character encoding schemes often feel, and act, like ‘brittle ice’. however, i want to promote the idea that especially in an environment like ms windows, with all the legacy stuff lying around, where several layers of the system apply differing heuristics to render text and rely on differing assumptions, trying to ‘max out on utf-8’ is the only thing that will work in the long run. this strategy means (eg in a web application) that when tests or actual usage reveal an encoding error, go through the entire tool stack — os, web server, database, everything — and re-affirm that yes, i want to use utf-8 here. make sure the web page comes out with a meta header stating utf-8, and that the http response header also states utf-8. make sure your source files are all in utf-8, and that your editor / ide also uses that. ah, and for filenames: [-a-z0-8_.], except for tested exceptions that work. this last one is mainly intended for when you want to package your sources in a *.tgz or *.zip and unpack it on another platform; this will often lead to funny results when you go beyond 7-bit us ascii.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ΤΖΩΤΖΙΟΥ</title>
		<link>http://pythonconquerstheuniverse.wordpress.com/2010/06/04/unicode-for-dummies-just-use-utf-8/#comment-290</link>
		<dc:creator><![CDATA[ΤΖΩΤΖΙΟΥ]]></dc:creator>
		<pubDate>Fri, 04 Jun 2010 23:13:07 +0000</pubDate>
		<guid isPermaLink="false">http://pythonconquerstheuniverse.wordpress.com/?p=430#comment-290</guid>
		<description><![CDATA[The “Just use UTF-8 and forget about the BOM” is a sound advice, one with which I couldn&#039;t agree more. However, this leaves millions of MS Windows users on brittle ice. Which version of Windows supports correctly UTF-8 as the default “codepage” for the system (and accordingly the “chcp 65001” thing for the command prompt ecosystem)?]]></description>
		<content:encoded><![CDATA[<p>The “Just use UTF-8 and forget about the BOM” is a sound advice, one with which I couldn&#8217;t agree more. However, this leaves millions of MS Windows users on brittle ice. Which version of Windows supports correctly UTF-8 as the default “codepage” for the system (and accordingly the “chcp 65001” thing for the command prompt ecosystem)?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
