Hi all,
On 22 April 2010 23:31, Platonides <Platonides(a)gmail.com> wrote:
No. With the amount of traffic it has, space needs would be immense, and
Wikimedia is not interested in logging all accesses.
I understand, but I think that they might be discarding relevant
meta-information that would enrich Wikipedia. Also, a short sample
(i.e. last week) would suffice for many exploratory works.
You may get a sampled feed for processing after
contacting the foundation.
Can you tell me what is the best way to "contact the foundation" ?
How does knowing the page from which they reached
wikipedia help to
estimate the document relevance?
I'm interested in information from all the web.
About document relevance - anchor information is a very valuable
signal in web information retrieval. With referral information it
would be possible to extract this and link it to the corresponding
article. Also, by looking at referral data from Google, Bing or Yahoo,
we could identify and use the query terms used to reach each article.
I think that there are some possibilities worth exploring here.
What if your referer was your facebook personal page
leaking your full
real name?
This is a valid concern, but is this a possible scenario?
If so, it seems that this could be seen as a FB security breach - if a
profile is private, its information should not be passed to others.
Thanks again for all feedback,
--
Sérgio Nunes