On Thu, Dec 04, 2008 at 07:09:36PM -0500, Gregory Maxwell wrote:
It takes about ~10 minutes to create a mwdump to git converter using git-fast-import. I did this for amusement once in order to run git-blame on articles.
How fast was the git import? How many articles did you try to import? How was the storage requirements? How effective was the git blame, since it would only work at the line (paragraph) level?
I considered doing this but I got sidetracked doing a word-level blame function (see http://hewgill.com/journal/entries/461-wikipedia-blame) and never got back to the git import.
I would like to see a properly maintained copy of wikipedia in git, particularly so I could clone and keep it up to date.
Greg Hewgill http://hewgill.com