I was trying to use this, but my Java is a bit rusty. How do I run the DumpProcessingExample?
Dear all,
I am happy to announce the very first release of Wikidata Toolkit [1],
the Java library for programming with Wikidata and Wikibase. This
initial release can download and parse Wikidata dump files for you, so
as to process all Wikidata content in a streaming fashion. An example
program is provided [2]. The libary can also be used with MediaWiki
dumps generated by other Wikibase installations (if you happen to work
in EAGLE ;-).
Maven users can get the library directly from Maven Central (see [1]);
this is the preferred method of installation. There is also an
all-in-one JAR at github [3] and of course the sources [4].
Version 0.1.0 is of course alpha, but the code that we have is already
well-tested and well-documented. Improvements that are planned for the
next release include:
* Faster and more robust loading of Wikibase dumps
* Support for various serialization formats, such as JSON and RDF
* Initial support for Wikibase API access
Nevertheless, you can already give it a try now. In later releases, it
is also planned to support more advanced processing after loading,
especially for storing and querying the data.
Feedback is welcome. Developers are also invited to contribute via github.
Cheers,
Markus
[1] https://www.mediawiki.org/wiki/Wikidata_Toolkit
[2]
https://github.com/Wikidata/Wikidata-Toolkit/blob/v0.1.0/wdtk-examples/src/main/java/org/wikidata/wdtk/examples/DumpProcessingExample.java
[3] https://github.com/Wikidata/Wikidata-Toolkit/releases
(you'll also need to install the third party dependencies manually when
using this)
[4] https://github.com/Wikidata/Wikidata-Toolkit/
_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l