On Sun, 20 Feb 2005 22:54:42 +0100, James R. Johnson modean52@comcast.net wrote:
Hey,
Is there any way to alter the alphabetical order used to sort
lists of articles? In the OE wiki, the letter æ (a and e together) should be alphabetized after a, so that "áetan, ániman, æfter" show up in that order, and ð and þ are arranged after d and t, respectively. There are also accented vowels that should show up in their unaccented versions as well, so that and, ániman, and ánlíepig are all under the letter "A".
James
Wikipedia-l mailing list Wikipedia-l@Wikimedia.org http://mail.wikipedia.org/mailman/listinfo/wikipedia-l
The zh community have a more serious problem: they want different sorting orders for Traditional and Simplified Chinese...
So I was looking at the related code, and it seems not too hard to implement specific (but fixed) sorting order within one language. However I only have limited time to work on this right now, and I don't fully understand how the category thing works.So I put up a test site with the basic implementation at http://tinyurl.com/5l24b. The test site is in English, and the sorting order is altered so that x, y, z come first, followed by a, b, c, etc.
The categorylinks tables will have to be rebuilt if this is to be deployed at the live sites. Not sure how expensive that will be.
Interested parties please visit the test site and provide comments either at the site or this list. If this seems to be a reasonable solution I will check it into cvs. Test site is running 1.4 from cvs.
The categorylinks tables will have to be rebuilt if this is to be deployed at the live sites. Not sure how expensive that will be.
Interested parties please visit the test site and provide comments either at the site or this list. If this seems to be a reasonable solution I will check it into cvs. Test site is running 1.4 from cvs.
One question that comes to my mind is how to define $wgCharOrder_en_cat. Obviously for proof of principle, you have only implemented A-Z and a-z. But isn't UTF8 quite large? Will wgCharOrder_en_cat need to include every possible character code? This could get quite cumbersome.
Barring that, the idea seems quite elegant.
On Sun, 6 Mar 2005 22:18:14 -0600, Richard Holton richholton@gmail.com wrote:
The categorylinks tables will have to be rebuilt if this is to be deployed at the live sites. Not sure how expensive that will be.
Interested parties please visit the test site and provide comments either at the site or this list. If this seems to be a reasonable solution I will check it into cvs. Test site is running 1.4 from cvs.
One question that comes to my mind is how to define $wgCharOrder_en_cat. Obviously for proof of principle, you have only implemented A-Z and a-z. But isn't UTF8 quite large? Will wgCharOrder_en_cat need to include every possible character code? This could get quite cumbersome.
Barring that, the idea seems quite elegant.
-- Rich Holton en.wikipedia:User:Rholton
I think each language will have a specific sorting order of it's own alphabet, which should be of manageable size. (Obviously not true for Chinese...) That should cover most of the articles of that specific wikipedia.
Richard Holton wrote:
One question that comes to my mind is how to define $wgCharOrder_en_cat. Obviously for proof of principle, you have only implemented A-Z and a-z. But isn't UTF8 quite large? Will wgCharOrder_en_cat need to include every possible character code? This could get quite cumbersome.
Start here: http://www.unicode.org/reports/tr10/
-- brion vibber (brion @ pobox.com)
zhengzhu wrote:
Interested parties please visit the test site and provide comments either at the site or this list. If this seems to be a reasonable solution I will check it into cvs. Test site is running 1.4 from cvs.
Please don't put something large and experimental like this into the 1.4 release branch.
-- brion vibber (brion @ pobox.com)
wikitech-l@lists.wikimedia.org