It should be relatively easy to catch a significant percentage of those
copyright violations with the assistance of automated search tools. The
trick is to do it at a large scale in near-realtime, which might require
some computationally intensive and bandwidth intensive work. James, can I
suggest that you take this discussion to Wiki-Research-l? There are a
number of ways that the copyright violation problem could be addressed and
I think this would be a good subject for discussion on that list, or at
Wikimania. Depending on how the discussion on Research goes, it might be
good to invite some dev or tech ops people to participate in the discussion
as well.
Pine
On Sun, Jul 20, 2014 at 7:05 PM, Leigh Thelmadatter <osamadre(a)hotmail.com>
wrote:
This is one of the best ideas Ive read on here!
Date: Sun, 20 Jul 2014 20:00:28 -0600
From: jmh649(a)gmail.com
To: wikimedia-l(a)lists.wikimedia.org; eloquence(a)gmail.com;
fschulenburg(a)wikimedia.org; ladsgroup(a)gmail.com; jorlowitz(a)gmail.com;
madman.enwiki(a)gmail.com; west.andrew.g(a)gmail.com
Subject: [Wikimedia-l] Catching copy and pasting
early
Come across another few thousand edits of copy and paste violations again
today. These have occurred over more than a year. It is wearing me out.
Really what is the point on collaborating on Wikipedia if it is simply a
copyright violation. We need a solution and one has been proposed here a
couple of years ago
https://en.wikipedia.org/wiki/Wikipedia:Turnitin
We now need programmers to carry it out. The Wiki Education Foundation
has
expressed interest. We will need support from the
foundation as this
software will likely need to mesh closely with edits as they come in. I
am
willing to offer $5,000 dollars Canadian (almost
the same as American)
for
a working solution that tags potential copyright
issues in near real time
with a greater than 90% accuracy. It is to function on at least all
medical
and pharmacology articles but I would not
complain if it worked on all of
Wikipedia. The WMF is free to apply.
--
James Heilman
MD, CCFP-EM, Wikipedian
The Wikipedia Open Textbook of Medicine
www.opentextbookofmedicine.com
_______________________________________________
Wikimedia-l mailing list, guidelines at:
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
<mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
_______________________________________________
Wikimedia-l mailing list, guidelines at:
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
<mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>