Hey Aleksey,

The library allows you to access the position of the DumpReader and to resume from a stored position.


This functionality is used by Replicator, a CLI tool build on top of the JSON Dump Reader library.


Neither the library or the CLI tool support streaming dumps (unless that somehow magically ends up working). I'm happy to review pull requests with additions or enhancements.

Note that Replicator supports import from the Wikidata web API, including automatic fetching of dependencies. This works if you want to get a specific set of entities or just a few thousand for testing purposes. If you want all entities from Wikidata this approach is of course not viable.


Cheers
Software Crafter | Speaker | Student | Strategist | Contributor to Wikimedia and Open Source
~=[,,_,,]:3