Is it possible to get an impression on how often a set of Wikidata items is being used?
I am interested in two ways of usage.
1. User statistics on a specific wikidata item. i.e. how often is a page visited and from where and how often does a wikidata item shows up in a query result?
2. How connected a Wikidata item is. Once an item is created how does it connect to other area's? Is everything connected, or are we actually looking at a collection of isolated data clusters? Is there a way to visualize this?
Any ideas on how to answer this?
Cheers,
Andra
Hi,
At least for the very first point, you have the usual stat tool: https://tools.wmflabs.org/pageviews/?project=wikidata.org&platform=all-a... for « how often is a page visited ».
Maybe the WDCM ( http://wdcm.wmflabs.org/ ) can help too but I never fully understood this tool.
Cdlt, ~nicolas
Le sam. 9 févr. 2019 à 10:25, Andra Waagmeester andra@micel.io a écrit :
Is it possible to get an impression on how often a set of Wikidata items is being used?
I am interested in two ways of usage.
- User statistics on a specific wikidata item. i.e. how often is a page
visited and from where and how often does a wikidata item shows up in a query result?
- How connected a Wikidata item is. Once an item is created how does it
connect to other area's? Is everything connected, or are we actually looking at a collection of isolated data clusters? Is there a way to visualize this?
Any ideas on how to answer this?
Cheers,
Andra
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi,
as of
- How connected a Wikidata item is. Once an item is created how does
it connect to other area's?
as Nicolas has pointed out the Wikidata Concepts Monitor (WDCM) system was built with the purpose to answer to this and similar questions.
All WDCM dashboards can be accessed from the appropriate section of http://wmdeanalytics.wmflabs.org (or http://wdcm.wmflabs.org/, the old domain).
Is everything connected, or are we actually looking at a collection of
isolated data clusters? Is there a way to visualize this?
I hope you understand how complex your question is. There are different statistical, analytical strategies that could be employed to provide an answer. The WDCM system relies on some of them. I would suggest learning how to work with the following data products in order to start investigating these questions:
- The WDCM Semantics dashboard http://wmdeanalytics.wmflabs.org/WDCM_SemanticsDashboard/ - The WDCM (S)itelinks dashboard http://wmdeanalytics.wmflabs.org/WDCM_SitelinksDashboard/ - The WDCM (T)itles dashboard http://wmdeanalytics.wmflabs.org/WDCM_TitlesDashboard/.
We use directed graphs, similarity maps and dendograms to visualize the structure of Wikidata item and Wikimedia projects linkage in respect to how the Wikidata items map (i.e. how they are re-used) across the Wikimedia universe.
The WDCM documentation page is on Wikitech https://wikitech.wikimedia.org/wiki/Wikidata_Concepts_Monitor but it needs an update badly because of a thorough re-engineering that started in December 2019 and is now getting close to completion.
In the meantime, I will be more than glad to answer to any questions that you might have. Note: a Google Hangouts session works better than tons of emails. Thank you for your understanding.
With best regards,
Goran S. Milovanović, PhD Data Scientist, Software Department Wikimedia Deutschland
------------------------------------------------ "It's not the size of the dog in the fight, it's the size of the fight in the dog." - Mark Twain ------------------------------------------------
On Sat, Feb 9, 2019 at 10:51 AM Nicolas VIGNERON vigneron.nicolas@gmail.com wrote:
Hi,
At least for the very first point, you have the usual stat tool: https://tools.wmflabs.org/pageviews/?project=wikidata.org&platform=all-a... for « how often is a page visited ».
Maybe the WDCM ( http://wdcm.wmflabs.org/ ) can help too but I never fully understood this tool.
Cdlt, ~nicolas
Le sam. 9 févr. 2019 à 10:25, Andra Waagmeester andra@micel.io a écrit :
Is it possible to get an impression on how often a set of Wikidata items is being used?
I am interested in two ways of usage.
- User statistics on a specific wikidata item. i.e. how often is a page
visited and from where and how often does a wikidata item shows up in a query result?
- How connected a Wikidata item is. Once an item is created how does it
connect to other area's? Is everything connected, or are we actually looking at a collection of isolated data clusters? Is there a way to visualize this?
Any ideas on how to answer this?
Cheers,
Andra
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
On Mon, Feb 11, 2019 at 11:18 AM Goran Milovanovic goran.milovanovic_ext@wikimedia.de wrote:
Hi,
as of
- How connected a Wikidata item is. Once an item is created how does it connect to other area's?
as Nicolas has pointed out the Wikidata Concepts Monitor (WDCM) system was built with the purpose to answer to this and similar questions.
All WDCM dashboards can be accessed from the appropriate section of http://wmdeanalytics.wmflabs.org (or http://wdcm.wmflabs.org/, the old domain).
Is everything connected, or are we actually looking at a collection of isolated data clusters? Is there a way to visualize this?
I hope you understand how complex your question is. There are different statistical, analytical strategies that could be employed to provide an answer. The WDCM system relies on some of them. I would suggest learning how to work with the following data products in order to start investigating these questions:
- The WDCM Semantics dashboard
- The WDCM (S)itelinks dashboard
- The WDCM (T)itles dashboard.
We use directed graphs, similarity maps and dendograms to visualize the structure of Wikidata item and Wikimedia projects linkage in respect to how the Wikidata items map (i.e. how they are re-used) across the Wikimedia universe.
The WDCM documentation page is on Wikitech but it needs an update badly because of a thorough re-engineering that started in December 2019 and is now getting close to completion.
In the meantime, I will be more than glad to answer to any questions that you might have. Note: a Google Hangouts session works better than tons of emails. Thank you for your understanding.
That however only captures usage inside Wikimedia projects. It does not cover the query service, usage of the API, Linked data endpoint, dumps etc. I believe Andra is after that as well and we have no way to track that.
Cheers Lydia