<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Importing the complete wikipedia database in 5 hours</title>
	<atom:link href="http://www.eriky.com/2008/11/importing-the-complete-english-wikipedia-database/feed" rel="self" type="application/rss+xml" />
	<link>http://www.eriky.com/2008/11/importing-the-complete-english-wikipedia-database</link>
	<description>Just another geek with a blog</description>
	<lastBuildDate>Mon, 02 Aug 2010 19:20:22 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
	<item>
		<title>By: Erik-Jan</title>
		<link>http://www.eriky.com/2008/11/importing-the-complete-english-wikipedia-database/comment-page-1#comment-120</link>
		<dc:creator>Erik-Jan</dc:creator>
		<pubDate>Mon, 02 Aug 2010 19:20:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.eriky.com/?p=3#comment-120</guid>
		<description>Sorry for the late reply, but yes this can take very long. The more memory you put in your pc, the faster the indexing goes because swapping is the main cause I think.</description>
		<content:encoded><![CDATA[<p>Sorry for the late reply, but yes this can take very long. The more memory you put in your pc, the faster the indexing goes because swapping is the main cause I think.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: GeBoZhuang</title>
		<link>http://www.eriky.com/2008/11/importing-the-complete-english-wikipedia-database/comment-page-1#comment-117</link>
		<dc:creator>GeBoZhuang</dc:creator>
		<pubDate>Mon, 28 Jun 2010 14:04:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.eriky.com/?p=3#comment-117</guid>
		<description>Hi Erik, I&#039;m also trying to import wikipedia using Plan B:) But after many tries, there is some problem remained and I wanna to know whether you had encountered it.

After DROP PRIMARY KEY, DROP INDEX and MODIFY (auto_increment column) on tables &quot;page&quot;, &quot;revision&quot; and &quot;text&quot;, the importing speed is indeed very fast(about 2000/sec). But when the importing is finished and I want to ADD the primary key and the indexes, the speed is very slow, especialy on table &quot;text&quot;. It taking more than 4 hours before I canceled it, since I don&#039;t know whether there is some problems.   

On your computer, how much time did it take to recreate the indexes after importing ? Thank you :)</description>
		<content:encoded><![CDATA[<p>Hi Erik, I&#8217;m also trying to import wikipedia using Plan B:) But after many tries, there is some problem remained and I wanna to know whether you had encountered it.</p>
<p>After DROP PRIMARY KEY, DROP INDEX and MODIFY (auto_increment column) on tables &#8220;page&#8221;, &#8220;revision&#8221; and &#8220;text&#8221;, the importing speed is indeed very fast(about 2000/sec). But when the importing is finished and I want to ADD the primary key and the indexes, the speed is very slow, especialy on table &#8220;text&#8221;. It taking more than 4 hours before I canceled it, since I don&#8217;t know whether there is some problems.   </p>
<p>On your computer, how much time did it take to recreate the indexes after importing ? Thank you <img src='http://www.eriky.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
</channel>
</rss>
