I bring this old issue up because I want to know if (or if not) progress (or plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media players. Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs on September 15, the Archos event. I really hope it will support NTFS so I can use the compression feature.
It would be really cool and convenient to have an offline copy of Wikipedia anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks.
Chengbin Zheng wrote:
I bring this old issue up because I want to know if (or if not) progress (or plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media players. Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs on September 15, the Archos event. I really hope it will support NTFS so I can use the compression feature.
It would be really cool and convenient to have an offline copy of Wikipedia anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks.
I don't think a static dump is the best way to keep wikipedia on your hd.
On Tue, Sep 1, 2009 at 9:00 PM, Platonides Platonides@gmail.com wrote:
Chengbin Zheng wrote:
I bring this old issue up because I want to know if (or if not) progress
(or
plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media
players.
Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs on
September
15, the Archos event. I really hope it will support NTFS so I can use the compression feature.
It would be really cool and convenient to have an offline copy of
Wikipedia
anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks.
I don't think a static dump is the best way to keep wikipedia on your hd.
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
It is the only way actually. Although I'm curious on what other ways one can use to keep Wikipedia
Archos PMPs are not computers, but they do have the ability to go on the Internet, and read an HTML file offline through the hard drive.
On Tue, Sep 1, 2009 at 8:31 PM, Chengbin Zheng chengbinzheng@gmail.comwrote:
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks.
Based on some quick extrapolation (the smaller dumps seem to be compressed at ~21-22x), it seems like the dump from a year ago would be about 300GB.
On Tue, Sep 1, 2009 at 9:47 PM, Benjamin Leesemufarmers@gmail.com wrote:
On Tue, Sep 1, 2009 at 8:31 PM, Chengbin Zheng chengbinzheng@gmail.comwrote:
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks.
Based on some quick extrapolation (the smaller dumps seem to be compressed at ~21-22x), it seems like the dump from a year ago would be about 300GB.
The static HTML dumps seems to include all namespaces. I made an estimate a few weeks ago that the main namespace for enwiki is now about 250GB when rendered as HTML. (It will compress to 12 GB or so.) Keep in mind that these estimates don't include any images, which would eat up massive amounts of space if you include them.
-Robert Rohde
Hi Chengbin, hi list,
static.wikimedia.org is currently not being updated and while the dumps processing has been assigned to and completely rewritten by Tomasz Finc (developer at WMF), there has not been made any assignment concerning HTML dumps.
We had a Wikipedia Offline meeting at Wikimania last week and discussed several issues. One issue is the fact, that WMF wants to see the ZIM file format being used for offline dumps and has suggested to include it into the regular dumping process. So one question was: When will that happen, what is the status of WMF ZIM dumping? As ZIM uses HTML extracts Tomasz clarified that once static.wikimedia.org has been rebuild to be stable and sutainable, integrating ZIM would be trivial. But he also informed us that this task has not yet been assigned.
As Brion Vibber and Erik Möller have been at the meeting as well we hope that this assignment will be made soon and this task has got higher priority.
This said I may also advise you not to you use the pure HTML dumps but the ZIM files for your Archos, because that's what they are meant for. A ZIM file containing all german Wikipedia articles (>900,000) is 1,4 GB, an additional full text search index takes another 1 GB.
Greets,
Manuel
Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
I bring this old issue up because I want to know if (or if not) progress (or plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media players. Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs on September 15, the Archos event. I really hope it will support NTFS so I can use the compression feature.
It would be really cool and convenient to have an offline copy of Wikipedia anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On Wed, Sep 2, 2009 at 8:13 AM, Manuel Schneider < manuel.schneider@wikimedia.ch> wrote:
Hi Chengbin, hi list,
static.wikimedia.org is currently not being updated and while the dumps processing has been assigned to and completely rewritten by Tomasz Finc (developer at WMF), there has not been made any assignment concerning HTML dumps.
We had a Wikipedia Offline meeting at Wikimania last week and discussed several issues. One issue is the fact, that WMF wants to see the ZIM file format being used for offline dumps and has suggested to include it into the regular dumping process. So one question was: When will that happen, what is the status of WMF ZIM dumping? As ZIM uses HTML extracts Tomasz clarified that once static.wikimedia.orghas been rebuild to be stable and sutainable, integrating ZIM would be trivial. But he also informed us that this task has not yet been assigned.
As Brion Vibber and Erik Möller have been at the meeting as well we hope that this assignment will be made soon and this task has got higher priority.
This said I may also advise you not to you use the pure HTML dumps but the ZIM files for your Archos, because that's what they are meant for. A ZIM file containing all german Wikipedia articles (>900,000) is 1,4 GB, an additional full text search index takes another 1 GB.
Greets,
Manuel
Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
I bring this old issue up because I want to know if (or if not) progress (or plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media players. Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs
on
September 15, the Archos event. I really hope it will support NTFS so I
can
use the compression feature.
It would be really cool and convenient to have an offline copy of
Wikipedia
anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Regards Manuel Schneider
Wikimedia CH - Verein zur Förderung Freien Wissens Wikimedia CH - Association for the advancement of free knowledge www.wikimedia.ch
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
I'm not familiar with the file extension .zim. What is that? Some sort of compressed html format like .chm? Where can I get a .zim file? I need to get check if this format is compatible with my Archos's Opera browser.
Hoi, For you information Okawix is localised at translatewiki.net. Thanks, GerardM
http://translatewiki.net/wiki/Translating:Okawix
2009/9/2 Manuel Schneider manuel.schneider@wikimedia.ch
Hi Chengbin, hi list,
static.wikimedia.org is currently not being updated and while the dumps processing has been assigned to and completely rewritten by Tomasz Finc (developer at WMF), there has not been made any assignment concerning HTML dumps.
We had a Wikipedia Offline meeting at Wikimania last week and discussed several issues. One issue is the fact, that WMF wants to see the ZIM file format being used for offline dumps and has suggested to include it into the regular dumping process. So one question was: When will that happen, what is the status of WMF ZIM dumping? As ZIM uses HTML extracts Tomasz clarified that once static.wikimedia.orghas been rebuild to be stable and sutainable, integrating ZIM would be trivial. But he also informed us that this task has not yet been assigned.
As Brion Vibber and Erik Möller have been at the meeting as well we hope that this assignment will be made soon and this task has got higher priority.
This said I may also advise you not to you use the pure HTML dumps but the ZIM files for your Archos, because that's what they are meant for. A ZIM file containing all german Wikipedia articles (>900,000) is 1,4 GB, an additional full text search index takes another 1 GB.
Greets,
Manuel
Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
I bring this old issue up because I want to know if (or if not) progress (or plans) are made to update the static HTML version of Wikipedia. B&H photos just leaked the next generation of Archos portable media players. Unbelievably, the rumors of a 500GB version is true! This is already tempting (especially the price at $420). Just waiting for specs
on
September 15, the Archos event. I really hope it will support NTFS so I
can
use the compression feature.
It would be really cool and convenient to have an offline copy of
Wikipedia
anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
BTW, does anyone know what is the size of the current static HTML English Wikipedia version uncompressed? Thanks. _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- Regards Manuel Schneider
Wikimedia CH - Verein zur Förderung Freien Wissens Wikimedia CH - Association for the advancement of free knowledge www.wikimedia.ch
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org