As shaihulub told me on irc, it seems the parser using tokenizer.php
have been disabled. It looks like we are now using an old parser instead.
Lot of the recent bug reports on sourceforge are related to this swicth:
lot of features and bugfixing are no more implemented / fixed :-(
It seems there is a performance issue on the tokenizer, so should we :
1/ get ride of the tokenizer (current situation) and fix bug /
2/ improve the tokenizer performances
Currently we get a lot of big gaps between paragraphs as well as at the
beginning and end of articles. The parser generates things like this:
whenever there is a Category or inter-wiki link on a line of its own.
Please could someone modify the parser so that, after taking out the
category and inter-wiki links, it removes (ignores) all whitespace at
the beginning and end of the article, and reduces sequences of paragraph
breaks to a single one?
Personally, I think this whole thing is silly. I don't think there
should be a Wikipedia in Klingon, or Ferengi, or Pig-Latin, or any
other pseudo-language. (In the immortal words of William Shatner,
"Get a life!") If a few Trekkies want to start up a Klingon wiki,
they can download the MediaWiki software themselves and install it on
their own server. That's how I started the Disinfopedia. I didn't
need to ask anyone else for permission, because I didn't use anyone
else's resources (other than the software, which I thank the
Wikipedia community for giving away free).
Having said this, though, perhaps a few folks here will enjoy taking
a look at Google's language tools, which enable users to "Use the
Google Interface in Your Language," with choices including "Elmer
Fudd," "Pig Latin," "Esperanto," "Hacker," "Klingon" (of course), and
something called "Bork, bork, bork!" The URL is:
Can we have a Wikipedia in Bork, bork, bork!, please? :-)
Please pardon me if this has been suggested before or if it already exists.
I am in love with Wikipedia. What I, as a fairly new Wikipedia user,
would find terribly helpful would be a Wikipedia Lookup Button for my
browser similar to what Merriam Webster has:
1) I'm reading something on the web that I would like to know more about
2) I highlight the phrase or word that I want to look up in Wikipedia
3) I press the Wikipedia browser button
4) Boom! I'm taken right to the correct Wikipedia page
On the danish Wikipedia we have already localized everything into danish on the new MediaWiki 1.3, but a couple of hours ago all the newly introduced translations disappeared.
Please, whoever deleted them, put them back ASAP.
As Med mentioned in a mail in reply to Magnus, he & I are working on a
project called 'wikirover' (exists on sourceforge, empty for now).
Not very advanced yet, but you never know :p
I think some people may be interested, so i'll try to explain what we
have in mind.
The aim will be, ultimately, to make an offline Wikipedia, that could
be distributed on CD/DVD/...
But not necessarily the whole encyclopedia, a subpart would do it
(sports? movies? you name it).
The core will be a C++ library. It'll have tools to store raw articles
(and history) in an sqlite database, parse'em to different formats
(html first, then maybe others). Also search capabilities.
Maybe classes to access articles from live Wikipedia, mysql server,
and/or database dump.
Then an application that'll actually display articles, from local
database. It'll use different raw articles sources, and merge that
(like: first download 'movie' database, then 'actors/actresses', it'll
merge & make correct links between topics). With also update
capability (from live Wikipedia? dump? why not :p), things like that
(basically, things you'd expect from a regular encyclopedia).
And, potentially, a tool to manage those theme-related bases.
We'll try to make it crossplatform (we can test on Linux & Windows).
Library will use platform independant code, applications probably
wxWidgets (depends, nothing is sure yet :p)
From: "Tim Starling"
> SIL seems to have little time for constructed languages, listing only
> three. ISO 639-2, on the other hand, has a policy allowing any language
> with more than 50 documents to obtain a code. Hence, Klingon is included
> in ISO's short list, but not in SIL's much longer one.
> My proposal is to automatically allow any language considered one of
> SIL's main languages, and to only seek community approval when it is not
> listed. I think we should largely ignore the ISO list.
The SIL ethnologue list is quite flawed.
In this respect the ISO codes are more
The Ethnologue lists
Esperanto, Europanto, and Interlingua.
It further mentions that Interlingua is
a language of France...
It also claims that Esperanto is a language
of France, and that it has "200 to 2,000 people who
speak it as first language". If so it would be a
natural and non-artificial language for them
wouldn't it, those French native speakers of
Esperanto.... Highly irregular!
The list is flawed, and the fact that they include
"Europanto" is quite a joke, no kidding,
Europanto was a joke language developed
by translators within the EU and only for
amusement. To exclude Volapük which
had at one time hundreds of thousands
of learners and users and still has a small
community of active users is just wrong
if one is going to include "Europanto"
which no one really uses as a community
except joking translators within the
EU Brussels, European Union buildings...
as Ethnologue points out.
The Ethnologue list is definitely flawed and
worse as a resource in this respect than the
use of ISO codes.
with the discussion clearly showing majority support for the Klingon
Wikipedia, please can we finally re-enable it now?
Please allow me to remind you again that people are waiting to ACTUALLY
CONTRIBUTE to the damn thing.