1) Yes the source code is available. User:Eran has posted it here https://github.com/valhallasw/plagiabot
2) This bot ONLY works on new edits within a couple of hours of them occurring. This reducing the number of false positives. It DOES NOT look at old edits.
3) This requires human follow up and common sense. One needs to make sure that a) the source is not PD/CCBYSA b) that it is not wiki text that has been moved around c) that the authors of both are not the same, etc
4) True positive rate is around 50% which is from my perspective good / useful. This bot has flagged a lot of copyright issues would have been missed otherwise.
Thanks James
Just out of curiosity, the other day I found two articles with a long section with identical wording, only names and numbers had been changed. Example: The town of ....... has a population of ...... . The town is know for its challenges in fighting poverty. According to local authorities, trhey have undertaken housing and sanitation projects bla bla bla.
When I queried it, the author of the earlier article responded to say that 'it was acceptable' so that beginners could find it easier to start writing articles. From that I dug deeper and discovered that he had tutored the writer of the derived article.
Regards, and a great weekend,
Rui
2015-04-04 3:49 GMT+02:00 James Heilman jmh649@gmail.com:
- Yes the source code is available. User:Eran has posted it here
https://github.com/valhallasw/plagiabot
- This bot ONLY works on new edits within a couple of hours of them
occurring. This reducing the number of false positives. It DOES NOT look at old edits.
- This requires human follow up and common sense. One needs to make sure
that a) the source is not PD/CCBYSA b) that it is not wiki text that has been moved around c) that the authors of both are not the same, etc
- True positive rate is around 50% which is from my perspective good /
useful. This bot has flagged a lot of copyright issues would have been missed otherwise.
-- James Heilman MD, CCFP-EM, Wikipedian
The Wikipedia Open Textbook of Medicine www.opentextbookofmedicine.com _______________________________________________ Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines Wikimedia-l@lists.wikimedia.org Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe
wikimedia-l@lists.wikimedia.org