Magnus is right on here.  We are hoping to begin things by establishing wikidata as a central hub for gene identifier lookup and matching.  (This is a big a problem in bioinformatics).  With this (hopefully fairly bit stable) starting point of wikidata entities, we will be expanding the statements attached to the genes to enable many different kinds of queries.  For example, "what genes that encode proteins that are located in the cell membrane are known to be related to alzheimer's disease?".  In particular we are hoping to build a network of relationships between genes, drugs, and diseases..  

So, for now identifier matching queries, tomorrow the world ;).

And more importantly..  Given these gene starting points in the wikidata entity space, we can do queries over whatever anyone else decides to hook up to them!  Go nuts! (but add your evidence please..)

-Ben


 



On Wed, Oct 8, 2014 at 7:47 AM, Magnus Manske <magnusmanske@googlemail.com> wrote:
For one thing, they could serve as an alternative, stable identifier.

You can link up different, external identifiers, such as Entrez and HGNC:
https://tools.wmflabs.org/wikidata-todo/beacon.php?prop=351&source=354

(you can probably do that elsewhere, but Wikidata should attract more ID systems over time).

You can look up the Entrez ID:
https://tools.wmflabs.org/wikidata-todo/autolist2.php?language=en&project=wikipedia&category=&depth=12&wdq=string%5B351%3A%22103753527%22%5D&mode=undefined&statementlist=&run=Run&label_contains=&label_contains_not=&chunk_size=10000

You can ask "which genes encode proteins that are involved in cell cycle checkpointing?":
https://tools.wmflabs.org/wikidata-todo/autolist2.php?language=en&project=wikipedia&category=&depth=12&wdq=claim%5B688%3A(claim%5B682%3A14817998%5D)%5D&mode=undefined&statementlist=&run=Run&label_contains=&label_contains_not=&chunk_size=10000

Once we get "typed quantities" (how's that coming along??), you can also query on chromosome/region.





On Wed, Oct 8, 2014 at 2:28 PM, Derric Atzrott <datzrott@alizeepathology.com> wrote:
This sounds important, but what sort of queries might someone
want to run on this data?  Despite working at a bio-lab, I
am actually not terribly familiar with the human genome and
therefore the types of questions having this data on Wikidata
can answer.

Does anyone have a few example queries (don't have to be
possible to execute right now) to show what this could
be used for?

Thank you,
Derric Atzrott



_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l