Got it! Thanks. The developer of Openrefine-Wikibase
is telling me
that, inexplicably in my own instance, there is a discrepancy between
what is returned in the Mediawiki search UI:
(0 result)
What could be the reason for that? In theory both result sets should
match, do they? This problem should not be related to CirrusSearch,
but I may be wrong... Any idea?
Cheers,
Régis
Le jeu. 19 sept. 2019 à 12:15, Lucas Werkmeister
<mail(a)lucaswerkmeister.de <mailto:mail@lucaswerkmeister.de>> a écrit :
Special:Search is the general MediaWiki search, only partially
related to Wikibase, that’s why I specified you should test the
other one :)
Installing CirrusSearch is probably your best bet, if it’s
possible for you, yeah.
Cheers,
Lucas
On 19.09.19 10:52, Régis Robineau wrote:
Thank you for getting me on the right track.
You're right, we're
not using CirrusSearch for the moment, and the suggestion box (at
the top right corner) does not perform cross-languages search in
our instance: the autocomplete mechanism is only aware of the
current active language. But on the other hand the main Wikibase
search (via Special:Search) does seem to search all available
languages...
I will ask the openrefine-wikibase developer if he thinks there
is a way to solve this in his application. But anyway I guess the
best way to benefit from a proper cross-languages search, both
within Wikibase and for the reconciliation service, would be to
use CirrusSearch as well. What do you think?
Cheers,
Régis
Le jeu. 19 sept. 2019 à 00:08, Lucas Werkmeister
<mail(a)lucaswerkmeister.de <mailto:mail@lucaswerkmeister.de>> a
écrit :
It looks like the openrefine-wikibase reconciliation service
uses the wbsearchentities API to find items. As far as I’m
aware, the default SQL-based Wikibase search also searches
other languages, but still, I think the most likely reason
you’re getting different results is that Wikidata uses
WikibaseCirrusSearch
<https://www.mediawiki.org/wiki/Extension:WikibaseCirrusSearch>,
and I assume your wiki doesn’t. If you use entity search on
your wiki directly (i. e. not via Special:Search, but in
suggestion boxes), does cross-language search work as it
should or does it have the same problem?
Cheers,
Lucas
On 18.09.19 22:55, Régis Robineau wrote:
Hi all,
I'd need help on the Wikidata/Wikibase reconciliation
service for OpenRefine.
Context: I have my own Wikibase and WDQS instances in
production, and I want to set up a reconciliation service on
top of it, so that users can perform matchings from their
local OpenRefine. I'm using the same tool as Wikidata, i.e.
https://github.com/wetneb/openrefine-wikibase. The web
service works fine, I can reconcile strings in OpenRefine
against the data stored in my Wikibase instance...
Issue: But there is a noteworthy difference compared to how
the Wikidata reconciliation service works:
- with Wikidata, i.e. by using the web service URL with the
"en" language prefix
(
https://tools.wmflabs.org/openrefine-wikidata/en/api), I am
able to find matches among labels in any other language of a
Wikidata item. For instance, if I send a request for "Jacopo
de Fazio", which is an alias in French and Italian
for Q313460 <https://www.wikidata.org/wiki/Q313460>,
OpenRefine will match Q313460 as expected, even if I'm using
the "en" language code in the web service url.
- Whereas in my own instance, i.e. by using my own
"openrefine-wikibase" reconciliation service, it can only
perform matching of labels/aliases in the same language:
e.g. if I use
"https://my-service.org/openrefine-wikidata/en/api", the web
service only searches for labels in English in my Wikibase.
This means that I am forced to launch the reconciliation
process in OpenRefine for every single language, one by one.
I do not know how the Wikidata reconciliation service is
able to take into account all the labels/aliases in all the
languages of a given Wikidata item. The data is modeled in
the same way in Wikidata and in my Wikibase, and I do not
see any difference between the two in the way the RDF data
is structured into the respective triplestores...
How can I enable the same behaviour as in the Wikidata
reconciliation service? (i.e. to look for labels/aliases in
every languages in one API call)
This would heavily improve the reconciliation process in
OpenRefine for my users.
Thanks a lot for your help!
Régis
_______________________________________________
Wikidata-tech mailing list
Wikidata-tech(a)lists.wikimedia.org
<mailto:Wikidata-tech@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
_______________________________________________
Wikidata-tech mailing list
Wikidata-tech(a)lists.wikimedia.org
<mailto:Wikidata-tech@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
_______________________________________________
Wikidata-tech mailing list
Wikidata-tech(a)lists.wikimedia.org <mailto:Wikidata-tech@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
_______________________________________________
Wikidata-tech mailing list
Wikidata-tech(a)lists.wikimedia.org
<mailto:Wikidata-tech@lists.wikimedia.org>
_______________________________________________
Wikidata-tech mailing list
Wikidata-tech(a)lists.wikimedia.org