Hello, all.
After receiving the feedback from our presentation of the status of Flagged-Revision study in Wikimania 2010, we now need some help from the community to finalize this challenging project.
Our purpose is to compare the patters we have identified for vandalism revertion in GE Wikipedia with 2 other communities also using Flagged-Revisions, namely Polish and Russian Wikipedias.
So, we would need:
* If there exist a list of well-known tags added to the comments field whenever a revert is performed in any of these 2 editions.
* If so, a list of matching words or tags we should look for using regexps to identify such reverts.
* Any other useful hints (like tags that show up frequently but whose meaning could change depending on subsequent words).
Any comment tag identifying a revert is useful, but we're mainly focused on vandalism revert.
That's also why we are not currently using the MD5 hash approach (we do need to differentiate among different types of reverts, not only detect them).
Please, forward this message to any relevant mailing list where local wikimedians contributing to these languages could help us.
You can answer to the list, or directly to this email account. Thanks in advance.
Felipe.
Hi;
2010/8/20 Felipe Ortega glimmer_phoenix@yahoo.es
That's also why we are not currently using the MD5 hash approach (we do need to differentiate among different types of reverts, not only detect them).
But you can use MD5 to identify *all* the reverts, and then classify them with their comments.
Regards, emijrp
--- El vie, 20/8/10, emijrp emijrp@gmail.com escribió:
De: emijrp emijrp@gmail.com Asunto: Re: [Wiki-research-l] Polish and Russian vandalism revert tags Para: "Research into Wikimedia content and communities" wiki-research-l@lists.wikimedia.org Fecha: viernes, 20 de agosto, 2010 18:22
Hi;
2010/8/20 Felipe Ortega glimmer_phoenix@yahoo.es
That's also why we are not currently using the MD5 hash approach (we do need to differentiate among different types of reverts, not only detect them).
But you can use MD5 to identify *all* the reverts, and then classify them with their comments.
Sure, and I think MD5 is a pretty efficient way of identify reverts. The point is that, once we start looking at comments, it's better to go ahead with them, since in theory we don't need to check against MD5 if we missed some revert.
However, we will have to compare MD5 and comments procedures. Comments are considered a fairly conservative approach, though a good proxy, and that's why we selected them.
F.
Regards, emijrp
-----Adjunto en línea a continuación-----
_______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Also, you can ask in the village pumps in PL and RU. You will get replies from the recent changes patrollers.
2010/8/20 Felipe Ortega glimmer_phoenix@yahoo.es
--- El *vie, 20/8/10, emijrp emijrp@gmail.com* escribió:
De: emijrp emijrp@gmail.com Asunto: Re: [Wiki-research-l] Polish and Russian vandalism revert tags Para: "Research into Wikimedia content and communities" < wiki-research-l@lists.wikimedia.org> Fecha: viernes, 20 de agosto, 2010 18:22
Hi;
2010/8/20 Felipe Ortega <glimmer_phoenix@yahoo.eshttp://mc/compose?to=glimmer_phoenix@yahoo.es
That's also why we are not currently using the MD5 hash approach (we do need to differentiate among different types of reverts, not only detect them).
But you can use MD5 to identify *all* the reverts, and then classify them with their comments.
Sure, and I think MD5 is a pretty efficient way of identify reverts. The point is that, once we start looking at comments, it's better to go ahead with them, since in theory we don't need to check against MD5 if we missed some revert.
However, we will have to compare MD5 and comments procedures. Comments are considered a fairly conservative approach, though a good proxy, and that's why we selected them.
F.
Regards, emijrp
-----Adjunto en línea a continuación-----
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.orghttp://mc/compose?to=Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
--- El sáb, 21/8/10, emijrp emijrp@gmail.com escribió:
De: emijrp emijrp@gmail.com Asunto: Re: [Wiki-research-l] Polish and Russian vandalism revert tags Para: "Research into Wikimedia content and communities" wiki-research-l@lists.wikimedia.org Fecha: sábado, 21 de agosto, 2010 12:07
Also, you can ask in the village pumps in PL and RU. You will get replies from the recent changes patrollers.
Will do, thanks for the tip! :-)
F.
2010/8/20 Felipe Ortega glimmer_phoenix@yahoo.es
--- El vie, 20/8/10, emijrp emijrp@gmail.com escribió:
De: emijrp emijrp@gmail.com Asunto: Re: [Wiki-research-l] Polish and Russian vandalism revert tags Para: "Research into Wikimedia content and communities" wiki-research-l@lists.wikimedia.org
Fecha: viernes, 20 de agosto, 2010 18:22
Hi;
2010/8/20 Felipe Ortega glimmer_phoenix@yahoo.es
That's also why we are not currently using the MD5 hash approach (we do need to differentiate among different types of reverts, not only detect them).
But you can use MD5 to identify *all* the reverts, and then classify them with their comments.
Sure, and I think MD5 is a pretty efficient way of identify reverts. The point is that, once we start looking at comments, it's better to go ahead with them, since in theory we don't need to check against MD5 if we missed some revert.
However, we will have to compare MD5 and comments procedures. Comments are considered a fairly conservative approach, though a good proxy, and that's why we selected them.
F.
Regards, emijrp
-----Adjunto en línea a continuación-----
_______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
-----Adjunto en línea a continuación-----
_______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
wiki-research-l@lists.wikimedia.org