[Foundation-l] Wikipedia meets git

Luca de Alfaro luca at dealfaro.org
Sun Oct 18 00:26:58 UTC 2009


Whoops, sorry, due to a glitch of a DNS setting, the demo on
pt.wikipedia.org will be up later today or tomorrow.  The demo on
it.wikipedia.org is up now.

Luca

On Sat, Oct 17, 2009 at 4:48 PM, Luca de Alfaro <luca at dealfaro.org> wrote:

> Dear James,
>
> you are absolutely right that we were lacking demos: we worked flat out to
> produce some, and if you visit http://wikitrust.soe.ucsc.edu/ , you can
> see that there are now a couple of Wikipedias on which you can try this.
>
> We wrote our own text analysis engine.  The reason is that the typical diff
> algorithms you find in git, svn, etc, are very fragile for the analysis of
> wiki text:
>
>    - They are typically not able to deal with text reordering.  If you
>    swap the order of two paragraphs, it will look to them as if you inserted
>    one of the two paragraphs.  We wanted to be able to trace text across block
>    moves.
>    - They typically analyze text across the two last revisions only.  We
>    wanted to be able to remember which text used to be present, and has
>    subsequently been deleted, so that if the text is later reinserted, we can
>    still correctly attribute it to the original author.   Otherwise, if I want
>    to look like the author of text, I can simply delete (or replace) the
>    content of a page, do a few quick-fire edits to confuse the system, and then
>    reinsert the content with some minor changes.
>
> We took a lot of pain to make sure that the text attribution system works
> in a robust way with respect to these kind of phenomena.   I am sure it is
> not perfect yet, and we welcome all feedback.
>
> Luca
>
>
> On Fri, Oct 16, 2009 at 5:17 AM, jamesmikedupont at googlemail.com <
> jamesmikedupont at googlemail.com> wrote:
>
>> On Fri, Oct 16, 2009 at 2:08 PM, Gerard Meijssen
>> <gerard.meijssen at gmail.com> wrote:
>> > Hoi,
>> > After a minute of googling I find http://wikitrust.soe.ucsc.edu/home ..
>> I am
>> > sure it is there for you as well.
>>
>>
>> Yes the page is there, it seems to be a good idea.
>>
>> only I am missing some html pages so that we can see what it looks
>> like, a wordlevel blame.
>> the colorized pages are missing.
>>
>> On this page: http://wikitrust.soe.ucsc.edu/home
>> it says : "In the meantime, you can look at our list of colored pages,
>> or look at screenshots of English Wikipedia pages analyzed by
>> WikiTrust. " and the colored pages link to
>> http://wikitrust.soe.ucsc.edu/index.php/Colored_pages which are
>> missing....
>>
>> mike
>>
>> _______________________________________________
>> foundation-l mailing list
>> foundation-l at lists.wikimedia.org
>> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l
>>
>
>


More information about the foundation-l mailing list