Hi all,
On 22 April 2010 23:31, Platonides Platonides@gmail.com wrote:
No. With the amount of traffic it has, space needs would be immense, and Wikimedia is not interested in logging all accesses.
I understand, but I think that they might be discarding relevant meta-information that would enrich Wikipedia. Also, a short sample (i.e. last week) would suffice for many exploratory works.
You may get a sampled feed for processing after contacting the foundation.
Can you tell me what is the best way to "contact the foundation" ?
How does knowing the page from which they reached wikipedia help to estimate the document relevance?
I'm interested in information from all the web. About document relevance - anchor information is a very valuable signal in web information retrieval. With referral information it would be possible to extract this and link it to the corresponding article. Also, by looking at referral data from Google, Bing or Yahoo, we could identify and use the query terms used to reach each article.
I think that there are some possibilities worth exploring here.
What if your referer was your facebook personal page leaking your full real name?
This is a valid concern, but is this a possible scenario? If so, it seems that this could be seen as a FB security breach - if a profile is private, its information should not be passed to others.
Thanks again for all feedback, -- Sérgio Nunes