I'm a student looking to work on MediaWiki during this year's Google Summer of Code, and one of the ideas I've been interested in is in various formats for the data dumps (and dump work in general).
How useful would dumps from wikipedia be, if they were in sqlite databases? Would it be useful to have all the dumps as sqlite (history, stubs, current, etc)? Or are there certain dumps (current, for example) which would be very useful as databases?
The dumps wouldn't be direct dumps from the mysql database (unlike the old SQL Dumps) - they'll be in a format optimized for data processing and imports. I'll also write supporting code such as libraries for reading the databases, etc.
What do you folks think?