On 29/12/2008, Brian <Brian.Mingus(a)colorado.edu> wrote:
Using logistic regression we achieve 83% precision at
77% recall
with our model.* Compared to the rule-based methods that are currently
applied*
*in Wikipedia, our approach increases the F -Measure performance by 49%
while*
*being faster at the same time.*
In my experience and reasonably expert knowledge of spam fighting,
these are not very good statistics. If they had achieved over 99% then
I would have been impressed, with if they did that with even fewer
false positives then I would have been thoroughly impressed.
And I don't consider it either-or. We should fight spammers of all
kinds with all techniques that work.
--
-Ian Woollard
We live in an imperfectly imperfect world. Life in a perfectly
imperfect world would be much better.