Congratulations on completing the huge task of producing the EN Wikip.
I also look forward to pyzim.
Looking at the notes I see an example of reading an article. However, I would like to be able to read the zim metadata. Is this possible? Even further afield would it be possible to extract the search index so as to merge it with another index, even one prepared from another source?
As always great work,
Tim
On Sat, Jul 4, 2020 at 8:00 AM offline-l-request@lists.wikimedia.org wrote:
Send Offline-l mailing list submissions to offline-l@lists.wikimedia.org
Today's Topics:
- At long last, a new version of offline enwp (Stephane Coillet-Matillon)
- Re: At long last, a new version of offline enwp (Emmanuel Engelhart)
- [OPENZIM] Introducing Python-libzim (Emmanuel Engelhart)
- Re: [OPENZIM] Introducing Python-libzim (Wilfredo Rodríguez)
Hi Tim,
Yes you can read metadata, those are stored in M prefix so retrieving the tags for example would be fetching article at M/Tags. There's no way to query the list of metadata at the moment. Indexes are stored in X namespace. I'd suggest you loop down from File.article_count using File.get_article_by_id() as indexes and metadata are stored last. With current pylibzim, you don't have control over the creation of indexes but you could still extract those.
On Sat, Jul 4, 2020 at 12:46 PM Tim Moody tim@timmoody.com wrote:
Congratulations on completing the huge task of producing the EN Wikip.
I also look forward to pyzim.
Looking at the notes I see an example of reading an article. However, I would like to be able to read the zim metadata. Is this possible? Even further afield would it be possible to extract the search index so as to merge it with another index, even one prepared from another source?
As always great work,
Tim
On Sat, Jul 4, 2020 at 8:00 AM offline-l-request@lists.wikimedia.org wrote:
Send Offline-l mailing list submissions to offline-l@lists.wikimedia.org
Today's Topics:
- At long last, a new version of offline enwp (Stephane Coillet-Matillon)
- Re: At long last, a new version of offline enwp (Emmanuel Engelhart)
- [OPENZIM] Introducing Python-libzim (Emmanuel Engelhart)
- Re: [OPENZIM] Introducing Python-libzim (Wilfredo Rodríguez)
Offline-l mailing list Offline-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/offline-l