<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: SGML Python parsers benchmark</title>
	<atom:link href="http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/</link>
	<description>Everything considered harmful</description>
	<lastBuildDate>Thu, 04 Mar 2010 12:28:24 +0100</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: A song for the lovers &#187; html5lib is getting faster</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-49771</link>
		<dc:creator>A song for the lovers &#187; html5lib is getting faster</dc:creator>
		<pubDate>Sat, 22 Sep 2007 15:08:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-49771</guid>
		<description>&lt;p&gt;[...] ran the benchmark again with the 1014 revision of html5lib and I noticed a major speedup (altough miles far from the [...]&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>[...] ran the benchmark again with the 1014 revision of html5lib and I noticed a major speedup (altough miles far from the [...]</p>]]></content:encoded>
	</item>
	<item>
		<title>By: ludo</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-47383</link>
		<dc:creator>ludo</dc:creator>
		<pubDate>Sun, 02 Sep 2007 21:21:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-47383</guid>
		<description>&lt;p&gt;I told you sgmlop kick ass. :)&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I told you sgmlop kick ass. <img src='http://www.oluyede.org/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>]]></content:encoded>
	</item>
	<item>
		<title>By: Lawrence</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-47105</link>
		<dc:creator>Lawrence</dc:creator>
		<pubDate>Thu, 30 Aug 2007 23:09:58 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-47105</guid>
		<description>&lt;p&gt;I used the SVN version, sorry I forgot to mention that.&lt;/p&gt;

&lt;p&gt;I guess you&#039;re right, maybe you can optimize the Python code a bit but I guess the real improvement is to port the tokenizer to C or maybe pyrex.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I used the SVN version, sorry I forgot to mention that.</p>

<p>I guess you&#8217;re right, maybe you can optimize the Python code a bit but I guess the real improvement is to port the tokenizer to C or maybe pyrex.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: jgraham</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-47103</link>
		<dc:creator>jgraham</dc:creator>
		<pubDate>Thu, 30 Aug 2007 21:56:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-47103</guid>
		<description>&lt;p&gt;Which version of html5lib did you use? The version in SVN should be somewhat faster than the quite outdated 0.9 release, but not enough to make up the difference compared to BeautifulSoup. I have a few ideas for making html5lib faster but it&#039;s basically limited by the need to process the input character so I don&#039;t think it will ever be truly fast without significant rearchitecture or a port of the tokenizer stage to C.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Which version of html5lib did you use? The version in SVN should be somewhat faster than the quite outdated 0.9 release, but not enough to make up the difference compared to BeautifulSoup. I have a few ideas for making html5lib faster but it&#8217;s basically limited by the need to process the input character so I don&#8217;t think it will ever be truly fast without significant rearchitecture or a port of the tokenizer stage to C.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: karl dubost, W3C</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-47064</link>
		<dc:creator>karl dubost, W3C</dc:creator>
		<pubDate>Thu, 30 Aug 2007 10:27:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-47064</guid>
		<description>&lt;p&gt;You have missed one in your candidates. HTML 5 Editor&#039;s draft defines a parsing for HTML Tag soup with a well define error recovery mechanism to create a DOM. Anne Van Kesteren and a few other persons built an HTML 5 parser in python.&lt;/p&gt;

&lt;p&gt;http://www.w3.org/html/wg/html5/
http://code.google.com/p/html5lib/&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>You have missed one in your candidates. HTML 5 Editor&#8217;s draft defines a parsing for HTML Tag soup with a well define error recovery mechanism to create a DOM. Anne Van Kesteren and a few other persons built an HTML 5 parser in python.</p>

<p><a href="http://www.w3.org/html/wg/html5/" rel="nofollow">http://www.w3.org/html/wg/html5/</a>
<a href="http://code.google.com/p/html5lib/" rel="nofollow">http://code.google.com/p/html5lib/</a></p>]]></content:encoded>
	</item>
	<item>
		<title>By: Just another WordPress weblog</title>
		<link>http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/comment-page-1/#comment-46720</link>
		<dc:creator>Just another WordPress weblog</dc:creator>
		<pubDate>Sun, 26 Aug 2007 19:46:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.oluyede.org/blog/2007/08/25/sgml-python-parsers-benchmark/#comment-46720</guid>
		<description>&lt;p&gt;[...] Lawrence Oluyede&#8217;s Blog (Lawrence Oluyede): SGML Python parsers benchmark [...]&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>[...] Lawrence Oluyede&#8217;s Blog (Lawrence Oluyede): SGML Python parsers benchmark [...]</p>]]></content:encoded>
	</item>
</channel>
</rss>
