Hello All!
Could anyone one give us some guidance on when to use a bot as opposed to, perhaps, a script fired off from a client box via cron or other?
Context: this is the PhiloBiblon project https://database.factgrid.de/wiki/Item:Q394169, and we are using FactGrid https://database.factgrid.de/wiki/Main_Page.
The use case I have in mind is integrity checking for certain constraints that our legacy data platform enforced but that are not directly enforceable in our current setup. For examples:
- Detecting when our external identifier https://database.factgrid.de/wiki/Property:P476 has an illegal value (which has sometimes happened and has been difficult to detect by eye) - Detecting when one of our objects (i.e. one that is marked with that external identifier) has an item-valued property that is filled with an object of the wrong type
I think it would give us some comfort to know that such integrity violations are detected. We would probably like to add more constraints as we go along. We might extend that code eventually to try to automatically fix such things, but for now, just detecting them would be great.
Does a bot make sense for that use case?
Thanks,
Max Ziff
Hi Max,
(not an expert, but to provide you with some feedback).
We have the same challenges with our Wikibase instance. We did a lot with the Quality Constraints extension ( https://www.mediawiki.org/wiki/Extension:Wikibase_Quality_Extensions). This requires a learning curve and setup on your local system. Your first case might actually still fit into the possibilities of your extensions.
For some cases, we developed our own extension (done by Professional Wiki: https://www.wikibase.consulting/automating-values-in-wikibase/). Highly recommend looking into that tool of course.
You could also do a lot with SPARQL and regular expressions, although that is another learning experience. Your second question would be suitable for this I assume.
Finally, we are looking into ShapeExpressions, but that does not really help to check whether the values of your item confirm a predetermined structure.
So what I am trying to say, there are lots of possibilities to validate the data within your instance. Clearly define what you want to check provides better way to find which solution might fit your need.
Does this help?
Best,
Maarten
On Mon, Aug 1, 2022 at 8:54 PM Donald Ziff daziff@berkeley.edu wrote:
Hello All!
Could anyone one give us some guidance on when to use a bot as opposed to, perhaps, a script fired off from a client box via cron or other?
Context: this is the PhiloBiblon project https://database.factgrid.de/wiki/Item:Q394169, and we are using FactGrid https://database.factgrid.de/wiki/Main_Page.
The use case I have in mind is integrity checking for certain constraints that our legacy data platform enforced but that are not directly enforceable in our current setup. For examples:
- Detecting when our external identifier
https://database.factgrid.de/wiki/Property:P476 has an illegal value (which has sometimes happened and has been difficult to detect by eye)
- Detecting when one of our objects (i.e. one that is marked with that
external identifier) has an item-valued property that is filled with an object of the wrong type
I think it would give us some comfort to know that such integrity violations are detected. We would probably like to add more constraints as we go along. We might extend that code eventually to try to automatically fix such things, but for now, just detecting them would be great.
Does a bot make sense for that use case?
Thanks,
Max Ziff
Wikibaseug mailing list -- wikibaseug@lists.wikimedia.org To unsubscribe send an email to wikibaseug-leave@lists.wikimedia.org
wikibaseug@lists.wikimedia.org