Good morning!
I collected month names at http://hu.wikipedia.org/wiki/Szerkeszt%C5%91:Bin%C3%A1ris/Tartalomjegyz%C3%A... as shown in archivebot.py. I need it for writing and testing the TOCbothttp://hu.wikipedia.org/wiki/Szerkeszt%C5%91:Bin%C3%A1ris/TOCbot, which has to recognize short and long month names in archive name patterns. The relevant part of my code is: * text += '<br>'.join([Site.mediawiki_message(MonthsS[i]) for i in range(12)]) + '\n|' text += '<br>'.join([Site.mediawiki_message(MonthsL[i]) for i in range(12)]) *for each Site in site.validLanguageLinks().
If there is any Dutch here, please check the *nl* row, because I saw that archivebot.py handles it in a specific way that I don't understand, and cannot check because of different locale.
Now, some rows show irregular behavior which I don't understand.
1. In rows *ne* and *ti* a part of the short names seems to be native, and some of the short names English. (Apr, Oct in *ne* and from Jul to Dec in *ti*.) Is that perhaps a MediaWiki bug? 2. In the last row, *zu*, all the long names have a 'u' at the beginning, just as in Python Unicode strings. Is that correct or some misbehaviour? 3. In the row *ba* there are two short names (ғин, май) that don't fit in the line. Or is that the other ten? They are different and lower case, while the others upper case and identical to the long name. Is that a bug or a feature? If bug, is it in MediaWiki? 4. There are two rows, *ba* and *ky*, where there are alternative versions in parenthesis, the basic version being *almost* identical to Russian. Does it mean that any of the given versions may appear in that wiki as a month name?
Question #4 is obsolete, I checked both wikis, and they use the month names exactly as it is shown in the table.
2011/4/25 Bináris wikiposta@gmail.com
Good morning!
I collected month names at
http://hu.wikipedia.org/wiki/Szerkeszt%C5%91:Bin%C3%A1ris/Tartalomjegyz%C3%A... as shown in archivebot.py. I need it for writing and testing the TOCbothttp://hu.wikipedia.org/wiki/Szerkeszt%C5%91:Bin%C3%A1ris/TOCbot, which has to recognize short and long month names in archive name patterns. The relevant part of my code is:
- text += '<br>'.join([Site.mediawiki_message(MonthsS[i]) for i in
range(12)]) + '\n|' text += '<br>'.join([Site.mediawiki_message(MonthsL[i]) for i in range(12)]) *for each Site in site.validLanguageLinks().
If there is any Dutch here, please check the *nl* row, because I saw that archivebot.py handles it in a specific way that I don't understand, and cannot check because of different locale.
Now, some rows show irregular behavior which I don't understand.
- In rows *ne* and *ti* a part of the short names seems to be native,
and some of the short names English. (Apr, Oct in *ne* and from Jul to Dec in *ti*.) Is that perhaps a MediaWiki bug?
This is likely because the translators over at translatewiki.net haven't
got to those names yet, and MediaWiki therefore doesn't have a translation for it yet. When translating the messages are shown alphabeticaly (AFAIR), and since the month names don't have a prefix they are scattered all over the place.
- In the last row, *zu*, all the long names have a 'u' at the
beginning, just as in Python Unicode strings. Is that correct or some misbehaviour?
That is correct. u- is their noun class, probably class 11 in the Bantu
class system (see http://en.wikipedia.org/wiki/Noun_class#Bantu_languages).
- In the row *ba* there are two short names (ғин, май) that don't fit
in the line. Or is that the other ten? They are different and lower case, while the others upper case and identical to the long name. Is that a bug or a feature? If bug, is it in MediaWiki?
Don't know about this one.
- There are two rows, *ba* and *ky*, where there are alternative
versions in parenthesis, the basic version being *almost* identical to Russian. Does it mean that any of the given versions may appear in that wiki as a month name?
-- Bináris
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
Thank you, Jon, these were really useful answers. I think the script will try to get the month names from mediawiki, and there will be an option to overwrite them for a selected language.