Hi Finn,
you discovered a limitation of my tool. It currently does not support lexemes, since Wikidata Toolkit has not implemented the RDF export support for them. I am not even sure if there is a JSON-representation for lexemes? For now, I simply ignored lexemes. I should mention that somewhere in the interface.
Also, the "Filter entities" only works for predicates which are wikidata properties (it uses the WD search API), which is why dct:language and ontolex:sense do not appear there (even if lexemes were supported).
Regards, Benno
On 11.12.19 16:43, fn@imm.dtu.dk wrote:
Hi Benno,
Thanks for the contribution.
Does your tool work for lexemes and other lexicographic data. When I view "Filter entities" then I do not see the ability to set properties such as dct:language and ontolex:sense.
best regards Finn Årup Nielsen https://people.compute.dtu.dk/faan/
On 11/12/2019 15:08, Benno Fünfstück wrote:
Hi everyone,
I am happy to announce a new tool I've been working on for the last few months, WDumper. The tool is available at https://tools.wmflabs.org/wdumps/.
The idea is to provide a user interface to easily generate RDF dumps for subsets of the data contained in Wikidata. As an example, the tool can generate dumps with only english labels or for a subset of the properties.
The tool is based on Wikidata Toolkit and processes the original JSON dumps provided by Wikidata. When you submit a request to create a dump, it will be added to a queue. The queue is processed in regular intervals (the maximum wait time in queue is 1h).
You can view a list of created dumps on https://tools.wmflabs.org/wdumps/dumps. The generated dump can either be downloaded directly or uploaded to Zenodo for archival, which also generates a DOI for easy referencing in scientific publications.
I want to thank Prof. Dr. Markus Krötzsch for the original idea for this tool and support during the development of the tool. If you have any questions, feel free to ask them by mail or create an issue on the GitHub page: https://github.com/bennofs/wdumper. The current version does not have a lot of features yet, so ideas for extending the tool with additional filters or options that you'd like to use are valuable feedback as well.
Also a small word of caution: while I did of course test the tool, the Wikidata data model is quite complex. Since the tool is new, bugs are more likely, so always apply a sanity check to the results. If you find bugs, please tell me or create an issue on GitHub.
Regards, Benno Fünfstück
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata