Re: [Wikitech-l] Re: [WikiEN-l] Ranking articles using machine-generated stats

5 Oct 2005

On 10/5/05, Neil Harris &lt;usenet(a)tonal.clara.co.uk&gt; wrote:
...
  Neil Harris wrote:
  The corpus-based measures are particularly
interesting; they mean we
 don't need to bug Google for a million search keys. 
 Although if anyone from Google is monitoring this list, and wants to
 give me a Google Account with 1.25M search keys, I'd be happy to set off
 the appropriate script... or send it to you to run. 
In any case, the number of results reported is a very approximate
estimate.  See, for example,
http://blog.outer-court.com/archive/2005-02-08.html#n72

I think it'd be much easier to use a standard measure of usefulness:
look at access logs on wikipedia's end.  If article A gets twice the
number of hits per day as article B, it would seem natural that
someone would be twice as likely to look it up in a paper-based
encyclopedia.  (There are certainly exceptions like hot news stories
or controversial topics during a revert war, but I think it'd take you
a long way...)

I like Neil's list too, but that, as they observed, is a lot more work.

-- Evan, monitoring this list :)

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

Re: [Wikitech-l] Re: [WikiEN-l] Ranking articles using machine-generated stats