Hi all,
I have a question about the SPARQL endpoint running on https://query.wikidata.org for which I would like to ask for your help.
I am currently running an experiment to figure out how many Wikidata entries refer to identifiers in our dataset (i.e. using property P727) but I am receiving in the results entries that have apparently been deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is there a way to detect them using SPARQL, perhaps some meta-property or some information in a statement, or is it simply because the endpoint is not in sync with the main repo.
Thank you in advance!
Kind regards, Hugo Hugo Manguinhas Technical R&D Coordinator
T: +31 (0)70 314 0967 M: E: Hugo.Manguinhas@europeana.eu Skype: hugo.manguinhas
Be part of Europe's online cultural movement - join the Europeana Network Association: http://bit.ly/NetworkAssociation
#AllezCulture! Disclaimer: This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the system manager. If you are not the named addressee you should not disseminate, distribute or copy this email. Please notify the sender immediately by email if you have received this email by mistake and delete this email from your system.
Hi!
I am currently running an experiment to figure out how many Wikidata entries refer to identifiers in our dataset (i.e. using property P727) but I am receiving in the results entries that have apparently been deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is
Could you send me the query and the items you see that are wrong?
there a way to detect them using SPARQL, perhaps some meta-property or some information in a statement, or is it simply because the endpoint is not in sync with the main repo.
Short answer - unfortunately, no. Longer answer in https://phabricator.wikimedia.org/T128947#2104017
Op 18-3-2016 om 19:30 schreef Stas Malyshev:
Hi!
I am currently running an experiment to figure out how many Wikidata entries refer to identifiers in our dataset (i.e. using property P727) but I am receiving in the results entries that have apparently been deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is
Could you send me the query and the items you see that are wrong?
there a way to detect them using SPARQL, perhaps some meta-property or some information in a statement, or is it simply because the endpoint is not in sync with the main repo.
Short answer - unfortunately, no. Longer answer in https://phabricator.wikimedia.org/T128947#2104017
That ticket is a restricted task, so not everybody can see the longer answer.
Mbch331
Hi Stas, Mbch331,
Thanks both for your help!
Is there an alternative to SPARQL that can either check for deleted records or give only non-deleted ones? currently, I am checking if they are deleted by making a "ping" using HTTP head requests but this takes a huge amount of time since I need to check about 70k resources.
... btw, the SPARQL query that I am running is this one: SELECT ?wdt ?cho WHERE { ?wdt wdt:P727 ?cho }
Thanks again!
Best regards, Hugo
________________________________________ From: Wikidata [wikidata-bounces@lists.wikimedia.org] on behalf of Mbch331 [mbch331.wikipedia@gmail.com] Sent: 18 March 2016 20:41 To: Discussion list for the Wikidata project. Subject: Re: [Wikidata] Recognizing deleted resource in the Wikidata SPARQL endpoint
Op 18-3-2016 om 19:30 schreef Stas Malyshev:
Hi!
I am currently running an experiment to figure out how many Wikidata entries refer to identifiers in our dataset (i.e. using property P727) but I am receiving in the results entries that have apparently been deleted/deprecated (e.g. http://www.wikidata.org/entity/Q18573617)... is
Could you send me the query and the items you see that are wrong?
there a way to detect them using SPARQL, perhaps some meta-property or some information in a statement, or is it simply because the endpoint is not in sync with the main repo.
Short answer - unfortunately, no. Longer answer in https://phabricator.wikimedia.org/T128947#2104017
That ticket is a restricted task, so not everybody can see the longer answer.
Mbch331
_______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi!
Is there an alternative to SPARQL that can either check for deleted records or give only non-deleted ones? currently, I am checking if they are deleted by making a "ping" using HTTP head requests but this takes a huge amount of time since I need to check about 70k resources.
Well, SPARQL data store is not supposed to contain any deleted entries... But looks like there's some bug there. If you give me the list of the "bad" entries, it's easy to update them. Considerable harder is to find *why* they weren't updated in the first place. I'm still looking into it.