That sounds very interesting. Would a use case be that on the creation of a new article in wikipedia to suggest an existing wikidata entry it might be connected to?

Am 25.11.2013 17:16 schrieb "Daniel Kinzler" <daniel.kinzler@wikimedia.de>:
Hello Nilesh!

Good to hear from you. I was off for a couple of days, and asked Lydia to make
introductions. Thanks Lydia!

A quick heads up:

The architecture we have discussed with the team at the HPI is a bit different
from what we designed for the GSoC project. The idea is to have a MediaWiki
extension that relies directly on the data in a MySQL table, and generates
suggestions from that. It does not care where the data comes from, so the
database table(s) server as an interface between the "front" (mediawiki) part
and the "back" (data analysis) part. This has two advantages: 1) front and back
are decoupled and only have to agree on the structure and interpretation of the
data in the database (this is the current TODO). 2) No new services need to be
deployed in the public-facing subnet.

I think your expertise with data ingestion could help the folks at the HPI quite
a bit. Also, the modular architecture allows for data analysis components to be
swapped out easily, and we would like to try and compare different approaches
for data analysis. One based on Hadoop and/or Myrrix could well be an option -
though I'm not sure whether Myrrix would be very useful, since the actual
generation of suggestions from the pre-processed data would already be covered.

This is just an idea, I think you can best figure things out among yourself.

Cheers,
Daniel

Am 25.11.2013 17:01, schrieb Lydia Pintscher:
> Hey everyone,
>
> I have the feeling it would be good to make an official introduction.
> Nilesh has been working on the Wikidata entity suggester. There is now
> a team of students who are working on the entity suggester to get it
> finished and ready for production as part of their bachelor project.
> It would be good if you could work together and coordinate on the
> public wikidata-tech list. I'm sure with you all working together we
> can provide the Wikidata community with the great entity suggester
> they are waiting for.
> Virginia and co: Are you still having issues with the data import?
> Maybe Nilesh can help you with that as a first good step.
>
>
> Cheers
> Lydia
>


_______________________________________________
Wikidata-tech mailing list
Wikidata-tech@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech