[Wikipedia-l] Automatically checking for copyright violations

Mark Williamson node.ue at gmail.com
Mon Jun 20 21:11:06 UTC 2005


...and it would also flag every single page in Wikipedia, because they
can also be found in absoluteastronomy, etc.

If you're talking about new pages only, it might be OK but it depends
how long search strings are -- is it looking for 7 words in a row that
are identical, 10 words, or 50?

Mark

On 20/06/05, Angela <beesley at gmail.com> wrote:
> The message below was sent to the Board today. Would implementing some
> sort of automatic copyvio checker be feasible?
> 
> The second part of the email suggests it is too difficult to contact
> us about copyright violations. With the addition of the "contact us"
> link in the sidebar, I thought this would stop being a problem. Is
> there any other way of making it easier?
> 
> Angela.
> 
> ---- Forwarded message ----
> 
> > In regards to the continuing copyright issues because some members do
> > not
> > respect copyrights, I might recommend implementing something like what
> > http://copyscape.com uses.  From what I can tell, they use a Google API
> > to do a
> > search of text found in one page to see what other pages have the same
> > text.
> > Using a similar methodology, you could flag new pages that are
> > substantially
> > like pages that exist on the Internet for further review.   While this
> > wouldn't
> > tackle all of the copyright violations, it would go a long way towards
> > making
> > it easier to weed out blatant violations like the one I reported.
> >
> > The issue of some individuals having absolutely no respect for
> > copyrights and
> > plagiarism is a serious problem that Wikipedia needs to address.  Some
> > people
> > seem to think that Wikipedia is their personal means of bringing down
> > copyright
> > laws and "freeing" content.  This is a shame because these individuals
> > threaten
> > the long term possibilities for Wikipedia.
> >
> > On a related note, it should be easier to report copyright violations on
> > the
> > Wikipedia website.  The current set up is tremendously burdensome to
> > figure how
> > to report copyright violations.   There needs to be a simple link from
> > all
> > pages to a simple contact form that allows one to report a violation
> > without
> > having any knowledge of how Wikipedia works.  Doing this would put
> > members on
> > notice that Wikipedia isn't a rogue operation where anything goes and
> > that it
> > takes copyright issues seriously.
> >
> _______________________________________________
> Wikipedia-l mailing list
> Wikipedia-l at Wikimedia.org
> http://mail.wikipedia.org/mailman/listinfo/wikipedia-l
> 


-- 
SI HOC LEGERE SCIS NIMIVM ERVDITIONIS HABES
QVANTVM MATERIAE MATERIETVR MARMOTA MONAX SI MARMOTA MONAX MATERIAM
POSSIT MATERIARI
ESTNE VOLVMEN IN TOGA AN SOLVM TIBI LIBET ME VIDERE



More information about the Wikipedia-l mailing list