Hi Yusra,

     There are different issues applicable here. If the reactomeID property is used in an incorrect context it needs to be created, which means either deleted or updated. I am not sure if looking at the bot alone is sufficient, since others might create wikidata items to reflect a Reactome record, which can then be incorporated in next bot runs.

A possible solution is to rely on a combination of SPARQL [1] and EntitySchema [2]. In the past, I work with the Reactome team to align Reactome with Wikidata. The EntitySchema was created based on the data from that collaboration. 

The workflow that can be used is to use SPARQL to find those wikidata items that are coming from Reactome (by looking at the reference). Align those items with the EntitySchema and correct inconsistencies with that schema. 

Then look at the wikidata items that have a Reactome property, but lack the reference to Reactome and fix accordingly. If that statement is incorrect but has a proper reference, this is something that needs to be looked at. It might be that the issue is caused outside of Wikidata and needs to be dealt with at the primary source, otherwise, the incorrect use of Reactome ID will persist. If there is a reactome ID content comes without reference, then it is safe to simply delete that statement as curation step. 

I hope this help

Cheers, 

Andra

[1] https://w.wiki/WaB
[2] https://www.wikidata.org/wiki/EntitySchema:E39

On Fri, Jul 10, 2020 at 11:13 AM Yusra Haider <yhaider@ebi.ac.uk> wrote:

Hiya,

 

The issue is that other items might be using that property (https://www.wikidata.org/wiki/Property:P3937) in a different context altogether, and we have a specific format (specific fields) in which we export Reactome data and then put that data on wikidata. It doesn’t suit other content creators that their description, for example was suddenly updated to our specific description format – this is why it would be helpful to have a unique way to identify items created by out bot.

 

I hope I’ve been able to explain?

 

Thanks for your help!

Warm regards,

Yusra Haider

 

From: Wikidata-tech <wikidata-tech-bounces@lists.wikimedia.org> on behalf of Addshore <addshorewiki@gmail.com>
Reply to: Wikidata technical discussion <wikidata-tech@lists.wikimedia.org>
Date: Thursday, 9 July 2020 at 17:46
To: Wikidata technical discussion <wikidata-tech@lists.wikimedia.org>
Subject: Re: [Wikidata-tech] Uniquely Identifying Items Created by Bot

 

Hi there!

 

Not in a "nice way"

I'm intrigued, what don't you want to change items that have not been created by the bot?

Surely if they have the correct ID then that's all okay?

 

Mind going into a bit more detail? :)

If not I can point you in the direction of an API that would let you know who created an item.

 

On Wed, 8 Jul 2020 at 17:58, Yusra Haider <yhaider@ebi.ac.uk> wrote:

Hi,

 

I hope this is an appropriate question for this mailing list.

 

I am a software developer at Reactome and am working on our bot that writes Reactome data to Wikidata.

 

Our bot currently uses the property https://www.wikidata.org/wiki/Property:P3937 (the Reactome ID property) to identify and update Reactome items.

 

We are running into the issue that this property is being used by other people and our bot inadvertently ends up changing items that have not been created by the Reactome bot.

 

Is there any way to uniquely identify the items created by our bot?

 

Thanks and warm regards,

Yusra Hader

 

_______________________________________________
Wikidata-tech mailing list
Wikidata-tech@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech

_______________________________________________ Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech

_______________________________________________
Wikidata-tech mailing list
Wikidata-tech@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech