Hi Lydia,
That helps a lot, and makes it way more interesting. Rather than being a one-size-fits-all solution, as it seems to me, each property or each type of property (eg. different relationships) will need individual attention and different methods/metrics for recommendation.
The examples you gave, like continents, sex, relations like father/son, uncle/aunt/spouse, or place-oriented properties like place of birth, country of citizenship, ethnic group etc. - each type has a certain pattern to it (if a person was born in the US, US should be one of the countries he was a citizen of; US census/ethnicity statistics may be used to predict ethnic group etc.) I'm already starting to chalk out a few patterns and how they can be used for recommendation. In my proposal, should I go into details regarding these? Or should I just give a few examples and explain how the algorithms would work, to explain the idea?
Thanks, Nilesh
On Sun, Apr 28, 2013 at 2:10 AM, Lydia Pintscher < lydia.pintscher@wikimedia.de> wrote:
On Sat, Apr 27, 2013 at 10:25 PM, Nilesh Chakraborty nilesh@nileshc.com wrote:
I have a question - when someone creates a new statement, for suggesting "properties", I can use collaborative filtering to make suggestions. Example, explained in the simplest terms - suppose there are X cities in the dataset. The user is adding another city (writes 'city in Australia' for short description). The system checks all other cities, figures out
the
common properties and suggests them. Cool. But I can't get any "exact" ideas off the top of my head that can used to suggest "values" for the properties. Suppose one of the recommended properties is "population".
How
can I make the system guess its value? (Am I getting this right?) Have
you
guys got anything on your minds regarding this? Please point me to the right direction. :)
For your example I'd say that isn't really possible indeed. But take for example a country. Someone wants to add http://www.wikidata.org/wiki/Property:P30 to indicate which continent this country is on. Across all of Wikidata this property should have a very limited number of values. The same is true for things like the sex of a person. And then for something a bit more advanced: there are things like the property father. The suggested values for this should be other items that are persons. http://www.wikidata.org/wiki/Wikidata:List_of_properties has the list of all current properties. I am sure you can find more such cases.
Hope that makes it clearer.
Cheers Lydia
-- Lydia Pintscher - http://about.me/lydia.pintscher Community Communications for Technical Projects
Wikimedia Deutschland e.V. Obentrautstr. 72 10963 Berlin www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l