Hi! I'm interested in using SPARQL for my own Wikibase installation. But, I was getting errors in the munge step.
To reproduce the problem, I set up a new Wikibase instance and tried to load it into wikidata-query-rdf. It has only one item, one property, and one statement.
First I used the included dumpRdf.php to create the RDF file, attached.
Then I did the munge step. It produced the errors, attached.
I don't understand why there is "Unrecognized subjects" when the subjects listed do match the patterns, "Expected only sitelinks and subjects starting with http://www.wikidata.org/wiki/Special:EntityData/ and http://www.wikidata.org/entity/"
I don't understand what is the revision id it is looking for.
What is the proper procedure for indexing? If necessary, I can code in Java and fix the tools. I just need an overview on how this system works.
Thanks! Regards, Miguel
Hi!
First I used the included dumpRdf.php to create the RDF file, attached.
Then I did the munge step. It produced the errors, attached.
Notice something weird: in your RDF, the wdata prefix is:
@prefix wdata: http://www.wikidata.org/wiki/index.php/Special:EntityData/ .
Usually it is expected to be http://www.wikidata.org/wiki/Special:EntityData/
Also, if you're running your own instance, you may want to set up a different URL for wikidata prefix by setting conceptBaseUri (see docs/options.wiki). You may then also want to use -w option for munger to set up correct hostname for your setup and also -DwikibaseHost when you're running Blazegraph/updater.
Thanks Stas!
Thanks for spotting that. I didn't enable URL rewriting on my Mediawiki instance, thus the index.php is exposed.
I just manually replaced my own url with the www.wikidata.org, thus the weird combination.
One more thing - do the tools support https URLs?
Cheers Miguel
On Thu., 23 Mar. 2017, 07:10 Stas Malyshev, smalyshev@wikimedia.org wrote:
Hi!
First I used the included dumpRdf.php to create the RDF file, attached.
Then I did the munge step. It produced the errors, attached.
Notice something weird: in your RDF, the wdata prefix is:
@prefix wdata: http://www.wikidata.org/wiki/index.php/Special:EntityData/ .
Usually it is expected to be http://www.wikidata.org/wiki/Special:EntityData/
Also, if you're running your own instance, you may want to set up a different URL for wikidata prefix by setting conceptBaseUri (see docs/options.wiki). You may then also want to use -w option for munger to set up correct hostname for your setup and also -DwikibaseHost when you're running Blazegraph/updater.
-- Stas Malyshev smalyshev@wikimedia.org
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
Hi!
On 3/22/17 3:34 PM, Miguel Paraz wrote:
Thanks Stas!
Thanks for spotting that. I didn't enable URL rewriting on my Mediawiki instance, thus the index.php is exposed.
I just manually replaced my own url with the www.wikidata.org http://www.wikidata.org, thus the weird combination.
One more thing - do the tools support https URLs?
https should work. Ping me if you have any problem.
Hi,
For my https://example.com URLs, when I add "-w example.com" to the Java command line in munge.sh, I get:
Expected only sitelinks and subjects starting with http://example.com/wiki/Special:EntityData/ and http://example.com/entity/
When I substitute the https://example.com for http://example.com in my RDF, the munge works.
Thanks! Miguel
On Thu, 23 Mar 2017 at 09:47 Stas Malyshev smalyshev@wikimedia.org wrote:
Hi!
On 3/22/17 3:34 PM, Miguel Paraz wrote:
Thanks Stas!
Thanks for spotting that. I didn't enable URL rewriting on my Mediawiki instance, thus the index.php is exposed.
I just manually replaced my own url with the www.wikidata.org http://www.wikidata.org, thus the weird combination.
One more thing - do the tools support https URLs?
https should work. Ping me if you have any problem.
-- Stas Malyshev smalyshev@wikimedia.org
Wikidata-tech mailing list Wikidata-tech@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
Hi!
For my https://example.com URLs, when I add "-w example.com http://example.com" to the Java command line in munge.sh, I get:
Expected only sitelinks and subjects starting with http://example.com/wiki/Special:EntityData/ and http://example.com/entity/
When I substitute the https://example.com for http://example.com in my RDF, the munge works.
Hmm, it may be not accounting for https URLs.. I'll check that, thanks.
wikidata-tech@lists.wikimedia.org