Hi all,
I've been tinkering with an idea I've had for importing Project
Gutenberg books into Wikisource:
http://tools.wmflabs.org/pg2ws/
The idea is that, if Wikidata makes a link between a PG ID number and a
Wikisource Index page, then we can go through that Index page one page
at a time, and copy the page's text from the PG book to the WS page.
The interface so far isn't very brilliant, but I'm just trying to figure
out if this is worthwhile or not. Basically, it's a matter of selecting
the right chunk of text in the right-most text box (the full PG text)
and hitting the button to move it left into the centre box. Then
cleaning it up (manually and with the magic cleaning button) to make it
match the image, and then uploading it to Wikisource.
It's a bad tool though, because it doesn't handle the running header,
and the copy-across button doesn't do nice things with {{hws}} etc. —
not to mention all the other things it doesn't do.
Anyway, just thought I'd mention it. :-) Anyone think this is an avenue
worth exploring? Certainly I'd love to be able to say we've got
everything PG has /and more/!
—Sam
PS changes made by this tool are all tagged as "OAuth CID: 638" —
https://en.wikisource.org/w/index.php?title=Special:RecentChanges&tagfi…