That sounds very interesting. Would a use case be that on the creation of a new article in wikipedia to suggest an existing wikidata entry it might be connected to? Am 25.11.2013 17:16 schrieb "Daniel Kinzler" daniel.kinzler@wikimedia.de:
Hello Nilesh!
Good to hear from you. I was off for a couple of days, and asked Lydia to make introductions. Thanks Lydia!
A quick heads up:
The architecture we have discussed with the team at the HPI is a bit different from what we designed for the GSoC project. The idea is to have a MediaWiki extension that relies directly on the data in a MySQL table, and generates suggestions from that. It does not care where the data comes from, so the database table(s) server as an interface between the "front" (mediawiki) part and the "back" (data analysis) part. This has two advantages: 1) front and back are decoupled and only have to agree on the structure and interpretation of the data in the database (this is the current TODO). 2) No new services need to be deployed in the public-facing subnet.
I think your expertise with data ingestion could help the folks at the HPI quite a bit. Also, the modular architecture allows for data analysis components to be swapped out easily, and we would like to try and compare different approaches for data analysis. One based on Hadoop and/or Myrrix could well be an option - though I'm not sure whether Myrrix would be very useful, since the actual generation of suggestions from the pre-processed data would already be covered.
This is just an idea, I think you can best figure things out among yourself.
Cheers, Daniel
Am 25.11.2013 17:01, schrieb Lydia Pintscher:
Hey everyone,
I have the feeling it would be good to make an official introduction. Nilesh has been working on the Wikidata entity suggester. There is now a team of students who are working on the entity suggester to get it finished and ready for production as part of their bachelor project. It would be good if you could work together and coordinate on the public wikidata-tech list. I'm sure with you all working together we can provide the Wikidata community with the great entity suggester they are waiting for. Virginia and co: Are you still having issues with the data import? Maybe Nilesh can help you with that as a first good step.
Cheers Lydia
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech