When featuring the script, don't limit it to just excluding en.wikipedia.org-- in fact, filter out any mention of "wikipedia". And yes, it's good that you're setting it so that it'll alert you instead of immediately removing the text, since Google can provide false positives.
On 11/23/06, Chris Picone ccool2ax@gmail.com wrote:
I've been seeing a rise in in-article copyvios. Last night I got one in [[Content managment system]]. I know that only some paragrahs have these copyvios, and not entire articles, so complete rewirtes aren't necessary. Thus, I'm attempting to write a script that (a) opens tabs with "Special:Random" on them (b) select the first setence from each paragraph (line break) (c) Google the sentence (d) If there are any exact matches not from en.wikipedia.org, put up a little message for me to check and remove the copyvio. (e) repeat.
Problem is, all I know is Applescript. If any of you Perl or pywikipedia or AWB-types have another way of writing this, can someone write it so the general community can use it to remove copyvios? (or is this possible with AWB?)
Chris (Ccool2ax) _______________________________________________ WikiEN-l mailing list WikiEN-l@Wikipedia.org To unsubscribe from this mailing list, visit: http://mail.wikipedia.org/mailman/listinfo/wikien-l