Hoi, When Wikipedia has an approach to specific articles that are not compatible with Wikidata, we can create items that fit our need and keep the original item for what it is .. for instance a list of people (in the case of the Wright brothers).
The notion that Wikidata defers to Wikipedia is not one can keep because there are bound to be Wikipedias who differ in their approach and have an article for both Wilbur and Orville Wright..
Yes, it is good to have a hope for algorithms in the future, in the mean time consider what percentage is wrong and that quite often not having data is more damaging than having data that can be manipulated with queries, tools. No data is no grip at all. We do have queries in WDQ/Autolist and we have tools in ToolScript and pywikipedia.
IMHO the most important thing we should do to get better quality is report on differences. This helps all projects involved in an import / export / comparison. Thanks, Gerard
On 29 April 2014 09:15, John Mark Vandenberg jayvdb@gmail.com wrote:
On Sun, Apr 27, 2014 at 8:28 PM, Amir Ladsgroup ladsgroup@gmail.com wrote:
there are some problems in using bio template for example they used it
for a
group of people
This is quite a difficult problem. Also look for infoboxes not at the top of a page, because the Wikipedia page contains two concepts. Here is an example with {{Bio}}:
https://it.wikipedia.org/wiki/Slashdot
In the journals area, I faced this many times with the article about a society not having an infobox for the society, but including an infobox in a section for their primary journal .
My bot has some very hacky code to detect the infobox type in a few languages
https://www.wikidata.org/wiki/User:JVbot/periodicalbot.py (the first function)
It would be good if we can create an algorithm that detects all these anomalies, or a special hidden parameter added to the invocation, to exclude those templates from automated parsing, but also lists all pages like this so that those pages can be split on the Wikipedias (unless notability rules prevent the split).
-- John Vandenberg
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l