All,
I'm trying to gauge the value of my internal company Wiki, and I want to produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc. Can anyone point me to where either the database structures are described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page (Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines... I think I asked the latter Question before - but got no replies...
Many thanks, Jon
----------------------------------------------------------- This e-mail was sent by GlaxoSmithKline Services Unlimited (registered in England and Wales No. 1047315), which is a member of the GlaxoSmithKline group of companies. The registered address of GlaxoSmithKline Services Unlimited is 980 Great West Road, Brentford, Middlesex TW8 9GS. -----------------------------------------------------------
Jon Bartlett wrote:
All,
I'm trying to gauge the value of my internal company Wiki, and I want to produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc. Can anyone point me to where either the database structures are described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page (Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines... I think I asked the latter Question before - but got no replies...
Many thanks, Jon
The wiki doesn't store the reads. It can have a per page read count, but nothing more. What you can do is analyse your server access log to get the data you want.
Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?
2009/11/25 Platonides Platonides@gmail.com
Jon Bartlett wrote:
All,
I'm trying to gauge the value of my internal company Wiki, and I want to
produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
Can anyone point me to where either the database structures are
described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page
(Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
I think I asked the latter Question before - but got no replies...
Many thanks, Jon
The wiki doesn't store the reads. It can have a per page read count, but nothing more. What you can do is analyse your server access log to get the data you want.
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
Nevio,
Platonides suggested looking at your web server access log. The following works for me:
[2864](root@ncq-wiki01)/usr/local/apache2/logs/Nov: grep 'GET.*a_page_name' 20091126_access.log | wc -l 8 [2865](root@ncq-wiki01)/usr/local/apache2/logs/Nov:
You'd need to script that to generalise it and identify all the pages [you need] - eliminating *.css, *.js, all flavours of images etc. is a start.
/Sam
-----Original Message----- From: mediawiki-l-bounces@lists.wikimedia.org [mailto:mediawiki-l-bounces@lists.wikimedia.org] On Behalf Of nevio carlos de alarcão Sent: 26 November 2009 10:50 To: MediaWiki announcements and site admin list Subject: Re: [Mediawiki-l] Usage stats questions
Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?
2009/11/25 Platonides Platonides@gmail.com
Jon Bartlett wrote:
All,
I'm trying to gauge the value of my internal company Wiki, and I want to
produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
Can anyone point me to where either the database structures are
described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page
(Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
I think I asked the latter Question before - but got no replies...
Many thanks, Jon
The wiki doesn't store the reads. It can have a per page read count, but nothing more. What you can do is analyse your server access log to get the data you want.
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
-- {+}Nevinho Venha para o Movimento Colaborativo http://sextapoetica.com.br !! _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
This email was sent to you by Thomson Reuters, the global news and information company. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.
Hi Sam.Sexton, thanks for clarification. Is Thomson Reuters that news agency you work on? It got a internal wiki? Could you send an email to nevinho@bb.com.br for us to engage in a conversation? Thanks for attention, Nevio
2009/11/26 Sam.Sexton@thomsonreuters.com
Nevio,
Platonides suggested looking at your web server access log. The following works for me:
[2864](root@ncq-wiki01)/usr/local/apache2/logs/Nov: grep 'GET.*a_page_name' 20091126_access.log | wc -l 8 [2865](root@ncq-wiki01)/usr/local/apache2/logs/Nov:
You'd need to script that to generalise it and identify all the pages [you need] - eliminating *.css, *.js, all flavours of images etc. is a start.
/Sam
-----Original Message----- From: mediawiki-l-bounces@lists.wikimedia.org [mailto: mediawiki-l-bounces@lists.wikimedia.org] On Behalf Of nevio carlos de alarcão Sent: 26 November 2009 10:50 To: MediaWiki announcements and site admin list Subject: Re: [Mediawiki-l] Usage stats questions
Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?
2009/11/25 Platonides Platonides@gmail.com
Jon Bartlett wrote:
All,
I'm trying to gauge the value of my internal company Wiki, and I want to
produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
Can anyone point me to where either the database structures are
described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page
(Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
I think I asked the latter Question before - but got no replies...
Many thanks, Jon
The wiki doesn't store the reads. It can have a per page read count, but nothing more. What you can do is analyse your server access log to get the data you want.
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
-- {+}Nevinho Venha para o Movimento Colaborativo http://sextapoetica.com.br !! _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
This email was sent to you by Thomson Reuters, the global news and information company. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
I can see in the Wiki db that there is a page_counter (in table 'page')- I can only assume this is a count of number of page 'hits'. It would be possible to automate queries that run daily/hourly etc to look at these values, and then interrogate them over time. However, I don't understand several of the fields in the MW database - what for example is the 'Page_latest', 'page_touched' and 'Page_latest' fields ? is there a db descriptions somewhere ?
Jon
-----Original Message----- From: mediawiki-l-bounces@lists.wikimedia.org [mailto:mediawiki-l-bounces@lists.wikimedia.org] On Behalf Of Sam.Sexton@thomsonreuters.com Sent: 26 November 2009 11:02 To: mediawiki-l@lists.wikimedia.org Subject: Re: [Mediawiki-l] Usage stats questions
Nevio,
Platonides suggested looking at your web server access log. The following works for me:
[2864](root@ncq-wiki01)/usr/local/apache2/logs/Nov: grep 'GET.*a_page_name' 20091126_access.log | wc -l 8 [2865](root@ncq-wiki01)/usr/local/apache2/logs/Nov:
You'd need to script that to generalise it and identify all the pages [you need] - eliminating *.css, *.js, all flavours of images etc. is a start.
/Sam
-----Original Message----- From: mediawiki-l-bounces@lists.wikimedia.org [mailto:mediawiki-l-bounces@lists.wikimedia.org] On Behalf Of nevio carlos de alarcão Sent: 26 November 2009 10:50 To: MediaWiki announcements and site admin list Subject: Re: [Mediawiki-l] Usage stats questions
Hi Platonides, I am also very interested in counting the number a particular page has been viewed on the corporate wiki I am administering, even if it is only few times. which code lines should be add to localssetings? Is there?
2009/11/25 Platonides Platonides@gmail.com
Jon Bartlett wrote:
All,
I'm trying to gauge the value of my internal company Wiki, and I want to
produce some usage statistics (primarily around pages being read, rather than edited). Typically, things like number of pages read today, or this week etc.
Can anyone point me to where either the database structures are
described, so I can create some offline queries myself ?
Also, anyone know if there is a way to get the Statistics page
(Special:Statistics) to Exclude hits from the various indexers/spiders for my internal company Wiki - as currently I suspect the huge number of daily page reads are due to search engines...
I think I asked the latter Question before - but got no replies...
Many thanks, Jon
The wiki doesn't store the reads. It can have a per page read count, but nothing more. What you can do is analyse your server access log to get the data you want.
MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
-- {+}Nevinho Venha para o Movimento Colaborativo http://sextapoetica.com.br !! _______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
This email was sent to you by Thomson Reuters, the global news and information company. Any views expressed in this message are those of the individual sender, except where the sender specifically states them to be the views of Thomson Reuters.
_______________________________________________ MediaWiki-l mailing list MediaWiki-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
----------------------------------------------------------- This e-mail was sent by GlaxoSmithKline Services Unlimited (registered in England and Wales No. 1047315), which is a member of the GlaxoSmithKline group of companies. The registered address of GlaxoSmithKline Services Unlimited is 980 Great West Road, Brentford, Middlesex TW8 9GS. -----------------------------------------------------------
Jon Bartlett wrote:
I can see in the Wiki db that there is a page_counter (in table 'page')- I can only assume this is a count of number of page 'hits'.
Right. Note however that it will be undercounted if you use a caching proxy, like squid.
It would be possible to automate queries that run daily/hourly etc to look at these values, and then interrogate them over time.
Yes. You could. But looking at the server logs is probably easier.
However, I don't understand several of the fields in the MW database - what for example is the 'Page_latest', 'page_touched' and 'Page_latest' fields ? is there a db descriptions somewhere ?
Jon
id of the last revision, the timestamp of last time the page cache has been invalidated (copies previous to that timestamp are outdated).
The schema is described at http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/maintenance/tables.sq...
There're also descriptions about it at www.mediawiki.org
mediawiki-l@lists.wikimedia.org