Kevin Webb wrote:
> I just managed to finish decompression. That took about 54 hours on an
> EC2 2.5x unit CPU. The final data size is 5469GB.
>
> As the process just finished I haven't been able to check the
> integrity of the XML, however, the bzip stream itself appears to be
> good.
>
> As was mentioned previously, it would be great if you could compress
> future archives using pbzib to allow for parallel decompression. As I
> understand it, the pbzip files are reverse compatible with all
> existing bzip2 utilities.
Looks like the trade off is slightly larger files due to pbzip2's
algorithm for individual chunking. We'd have to change the
buildFilters function in http://tinyurl.com/yjun6n5 and install the new
binary. Ubuntu already has it in 8.04 LTS making it easy.
Any takers for the change?
I'd also like to gauge everyones opinion on moving away from the large
file sizes of bz2 and going exclusively 7z. We'd save a huge amount of
space doing it at a slightly larger cost during compression.
Decompression of 7z these days is wicked fast.
let know
--tomasz
Hi everyone
I've been asked to add a contacts database to a mediawiki site so that users
can search for contacts. It will be a simple database. I have no idea how to
start this. Do i just create a contacts database on the mysql server? And
then add a search box?
I'm very new to MediaWiki but enjoying every minute of using it.
Thanks Mak.
Hello Everyone,
I was going through GSoc ideas for 2010 on Wikimedia and I am really
interested in Maps idea (
http://www.mediawiki.org/wiki/GSoC#Services_and_other_outside_technology). I
am writing this mail to show my interest in this project idea.
I have worked with maps earlier also and also I was one of the members of
Forum Nokia Developer Advisory Council for Nokia Maps Player. The Forum
Nokia Developer Advisory Council is a way to engage distinguished developers
early on in the development of Nokia products and materials. It enables
selected developers to incorporate their insights and feedback into
materials and products before they are released publicly.
I have adequate experience on working towards software development and
currently doing my master's at Arizona State university in computer science
department.
I was especially interested in mapping of tourist places to maps. What I
mean is that lets say there is a sentence "Eiffel tower is in Paris", when
you bring your mouse on Eiffel tower text written in this sentence, a map
pops up which displays Eiffel tower in Paris, the focus of map on Eiffel
tower. This will be interesting as then user wouldn't need to browse away
from page just to look up a map where she/he can find Eiffel tower location
in Paris.
If this project idea seems good, I can surely work towards it. At the moment
I am looking for someone who can mentor me regarding this project.
Hoping to hear some positive reply.
Regards,
Shubhendra Singh
Hi all!
This is a quick reminder that registration for the Wikimedia Developers'
Workshop (in Berlin, April 14-16) will end on Sunday, March 21. Most places are
already take, but we have room for 16 more people to attend.
So, if you want to come, sign up now at <http://www.amiando.com/WMCON10DEV.html>!
More information about the workshop is available at <http://tinyurl.com/wmdev10>.
-- Daniel Kinzler
Hi,
is there a MediaWiki feature or external tool to get a live feed of Commons file
uploads? (By live I mean something that can be used to show a realtime slideshow
of new images. I vaguely remember someone saying that the WMF office has a
screen with such a slideshow.)
If there is no such thing, what would be the best starting point to write one?
#commons.wikimedia(a)irc.wikimedia.org?
thanks
Gergő
>> Perhaps a separate IRC channel for the GSOC students/mentors (#wikimedia-gsoc?) (and who ever else wants to hangout in there) so that it doesn't get flooded with random questions and such and wouldn't be as hectic (or scary) for the newer irc users.
> That would be worse, splitting what knowledge/time is available into disparate segments is not great.
I agree that a seperate IRC channel is probably not a good idea. (There are enough mw channels already, and in most of them people give really great support, and there is no reason to cut the students off the main community.) However, a mailing list for the people involved with GSoC would be a great tool. Last year I did not really know who the other students and mentors where, and even although I did considerable effort in finding out, there was no place that had their contact info, or where we could have a discussion. A mailing list would go a long way in solving that. Last year I created a google group for this purpose about half way through the project, but this didn't take off. I suspect it's critical to have this sort of infrastructure in place before the projects start, and to give all students a poke towards it. This would also help detecting problems, both on student and mentor side, that would remain hidden if there is only communication
between the students and their mentors.
Cheers
--
Jeroen De Dauw
* Wiki: wiki.bn2vs.com
* Blog: blog.bn2vs.com
* Skype: rts.bn.vs
--
Don't panic. Don't be evil.70 72 6F 67 72 61 6D 6D 69 6E 67 20 34 20 6C 69 66 65!
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
The first beta release of the 1.16 branch is now available for
download. Please try it and tell us if it works for you. This beta
release is not recommended for use in a production environment.
Selected changes since MediaWiki 1.15 that may be of interest:
* Watchlists now have RSS/Atom feeds. RSS feeds generally are now
hidden, since Atom is a better protocol and is supported by virtually
all clients.
* It's now possible to block users from sending email via
Special:Emailuser.
* The maintenance script system was overhauled. Most maintenance
scripts now have a useful help page when you run them with --help.
* AdminSettings.php is no longer required in order to run maintenance
scripts. You can just set $wgDBadminuser and $wgDBadminpassword in
your LocalSettings.php instead.
* The preferences system was overhauled. Preferences are stored in a
more compact format. Changes to site default preferences will
automatically affect all users who have not chosen a different preference.
* Support for SQLite was improved. Some broken features were fixed,
and it now has an efficient full-text search.
* The user groups ACL system was improved by allowing rights to be
revoked, instead of just granted.
* A new localisation caching system was introduced, which will make
MediaWiki faster for almost everyone, especially when lots of
extensions are enabled.
By default, this new system makes a lot of database queries. If your
database is particularly slow, or if your system administrator limits
your query count, or if you want to squeeze as much performance as
possible out of Mediawiki, set $wgCacheDirectory to a writable path on
the local filesystem. Make sure you have the DBA extension for PHP
installed, this will improve performance further.
Full release notes:
http://svn.wikimedia.org/svnroot/mediawiki/tags/REL1_16_0beta1/phase3/RELEA…
**********************************************************************
Download:
http://download.wikimedia.org/mediawiki/1.16/mediawiki-1.16.0beta1.tar.gz
GPG signatures:
http://download.wikimedia.org/mediawiki/1.16/mediawiki-1.16.0beta1.tar.gz.s…
Public keys:
https://secure.wikimedia.org/keys.html
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkua4+gACgkQgkA+Wfn4zXmEegCfaZ153qv8zNiQk18JNwiTfi3w
NZcAni7Yf8v/lQcctKGya4JH4ow4hAFb
=JK00
-----END PGP SIGNATURE-----
Dear All,
Many thanks for all your work with Wikipedia, we use it daily for various
tasks and in our research on Wikipedia.
("we" = a subset of a research group of the Hungarian Academy of Sciences)
I just managed to wget the history
dump enwiki-20100130-pages-meta-history.xml.bz2 a week ago, and I would like
to combine this with status information about users. Do you happen to know
if the status (admin, steward, bot, etc.) of all users has been logged for
enwiki? In what way would this be available for download and
for non-profit basic research?
Thanks,
Illes
--
http://hal.elte.hu/fij
Good afternoon Wikimedians,
The deadline for application from Open Source Projects to Google Summer
of Code 2010 is looming (in about 48 hours), and I'm coordinating the
formal Wikimedia Foundation entry. There has already been some
excellent discussion on
http://www.mediawiki.org/wiki/Summer_of_Code_2010 but *we definitely
need more mentors*.
I've asked volunteer Rob Lanphier ([[user:RobLa]], IRC: robla) to help
me wrangle the conversation over the next 36 hours or so on
*#wikimedia-tech* to flush out folks who were thinking about mentoring
as well as additional project ideas...starting immediately. Friday
we'll finalize the application and submit.
Towards the Fun,
Danese Cooper
CTO, Wikimedia Foundation