Le mer 02/12/09 21:32, "Madeleine Price Ball" meprice(a)fas.harvard.edu a écrit:
We think that
the specific knowledge of the
publishers should be how to
select the content - which content goes where in
which form - and not
technical questions such as compression, storage or
retrieving the data on
the user's end.
OK, if I shouldn't be talking to you guys, tell me who to talk to.
Yes, selecting content is very difficult. I couldn't get Peru or SJ to
contribute meaningfully to generating a simple blacklist of articles
that should NOT be included on the OLPC activity. (Recall it is being
given to young children!) I ended up making the blacklist myself based
on my own gut feelings. If Peru's board of education or OLPC's
"director of content" couldn't get their act together for this simple
task, expecting others to do this task for you will be a huge
roadblock to getting content out.
Traffic based content is simple and effective and it doesn't involve a
lot of opinions on what should or should not be included.
Yes, compiling such stats with incoming link counter & interwiki counter,
it is possible to get a pretty accurate and "neutral" selection quickly for any
Wikipedia.
I have scripts to do that automaticaly:
http://kiwix.svn.sourceforge.net/viewvc/kiwix/selection_tools/
Emmanuel