Le mer 02/12/09 21:32, "Madeleine Price Ball" meprice@fas.harvard.edu a écrit:
We think that the specific knowledge of the
publishers should be how to select the content - which content goes where in which form - and not technical questions such as compression, storage or retrieving the data on the user's end.
OK, if I shouldn't be talking to you guys, tell me who to talk to.
Yes, selecting content is very difficult. I couldn't get Peru or SJ to contribute meaningfully to generating a simple blacklist of articles that should NOT be included on the OLPC activity. (Recall it is being given to young children!) I ended up making the blacklist myself based on my own gut feelings. If Peru's board of education or OLPC's "director of content" couldn't get their act together for this simple task, expecting others to do this task for you will be a huge roadblock to getting content out.
Traffic based content is simple and effective and it doesn't involve a lot of opinions on what should or should not be included.
Yes, compiling such stats with incoming link counter & interwiki counter, it is possible to get a pretty accurate and "neutral" selection quickly for any Wikipedia. I have scripts to do that automaticaly: http://kiwix.svn.sourceforge.net/viewvc/kiwix/selection_tools/
Emmanuel