Yipeee !
Thanks to Shaihulud, who was so kind to do the technical biz while on holiday, nl:wiktionay is now on UTF-8. I have been adding new words that contain special characters and it works like a dream.
The other good news it that the current content is also really as it should be. So all the information copied from Timwi's website is really looking good. It also means that I can now add new content to other wiktionaries without fear of mucking things up. On that note, there is content on en:wiktionary that is not UTF-8 (English - see the edit page..)
The word that I have done first is Nederlands. :)
http://nl.wiktionary.org/wiki/Nederlands
Thanks,
GerardMnt-Dispo
When I try to reach en:wiktionary.org or nl:, de:, I get an error message: redirection limit exceeded or cookie problems. So I tried Internet Explorer which does not load the page either, no error message though.
Can someone please find out what the problem is ??
Thanks,
GerardM
Gerard Meijssen wrote:
> Ray,
> My point today was that it would be good if all wiktionarie are
> converted to UTF-8. When you read what was said by Tim Starling, it
> refers to UTF-8. That you think that I winge is beside the point, the
> point is that UTF-8 will solve issues. My alphabet soup I acquired the
> hard way, I worked for it and it allows me to say things from
> experience about maintaining data in an IT environment..
>
> If you want to discuss anything meaningfull, talk about UTF-8. The
> rest is bullshit. Argument for: it eases programming and maintenance.
> It allows for the creation of articles with names like ჰოლანდიური or
> ភាសាហុលឍង់េដ or 네델란드어. The only drawback is that these characters
> will not be seen in an URL.
>
> Please be constructive and keep to what is at issue. If you think
> conversion to UTF-8 is not a good move, let us please know why not.
>
> Thanks,
> GerardM
Nobody is seriously arguing against UTF-8, so please don't make up a
story about my being against it. Just curb your impatience.
UTF-8 was not mentioned in Tim's comments. Do I need to qute his entire
comment for you to understand that? My complaint is about a discussion
on one subject which is hijacked to pretend to be something completely
different.
Ec
On the beer parlour of en:wiktionary, Tim Starling says that it would be helpfull if all wiktionaries work in the same way. The article was about UTF-8.
The opportunity now is to have all wiktionaries transfer to UTF-8. This is the only way to have articles with titles like ჰოლანდიური or ភាសាហុលឍង់េដ or 네델란드어. As the purpose of wiktionaries is to have articles of words in any language, it should be obvious that this is the way to go.
PS I would like these articles in nl:wiktionary :) GerardM
Ray Saintonge wrote:
> Gerard.Meijssen wrote:
>
>> On the beer parlour of en:wiktionary, Tim Starling says that it would be helpfull if all wiktionaries work in the same way. The article was about UTF-8.
>> The opportunity now is to have all wiktionaries transfer to UTF-8. This is the only way to have articles with titles like ჰოლანდიური or ភាសាហុលឍង់េដ or 네델란드어. As the purpose of wiktionaries is to have articles of words in any language, it should be obvious that this is the way to go.
>>
>> PS I would like these articles in nl:wiktionary :) GerardM
>>
>>
> English Wiktionary shifted to UTF-8 at a very early stage of its development. There are still a few minor problems left over from the transfer, but none important enough to make us rush to clean them up. nl ended up with something else because the software was copied from the nl Wikipedia in the interest of having something workable available quickly. I don't think that there is much opposition to having all the Wiktionaries on UTF-8.
>
> It is only speculation on my part to say that Tim would favour having every project on UTF-8 (he can speak best for himself), but the Beer Parlour context had to do with first letter capitalization. See http://en.wiktionary.org/wiki/Wiktionary:Beer_parlour/case-sensitivity_vote at the very end.
>
> Please don't confuse the two issues.
>
> Ec
>
> _______________________________________________
> Wiktionary-l mailing list
> Wiktionary-l(a)Wikipedia.org
> http://mail.wikipedia.org/mailman/listinfo/wiktionary-l
>
>
When you read the post, you will find that in the post about the case sensitivity vote, UTF-8 was mentioned. In that context quote "I'd really much prefer it if all the Wiktionaries were converted, or none of them." and later "... asked me to switch of first letter capitalisation for de.wiktionary.org, but I refused. ..."
Please read the article and you will see that the first part has to do with UTF-8 . Read the effing article..
>From a professional IT point of view having many environments all different is a disaster waiting to happen. To prevent them, all situations should be/have to be tested. This requires scarce resources. Silly backward compatability for no good reason makes the work of developpers and system management people needlessly complicated.
PS I am AS/400 certified CISSP, MCP, CNE, Tivoli certified and know little of this environment but basically the more different configurations the bigger the headache.
GerardM
When you read the post, you will find that in the post abou
On wikitech there is a discussion of the relative merits of having UTF-8 and monstrosities (for editors) like "è" or ᏲᏁᎦ/ᎩᎢᏏ
It seems that you can search wikipedia for èh or something. This will not find you èh. As wiktionary has many monstrosities, it would be best to convert them to something legible again.
I have been carefull not to add these things to en:wiktionary after I became aware of this problem, but I do know that many of these exist. In order to prevent them getting new ones, I think the best way of preventing this is by converting all wiktionaries to UTF-8.
I am still hoping that nl:wiktionary may soon be converted to UTF-8; there are many words that we cannot add to our dictionary because of this backwardness.
Thanks,
GerardMare many words that we cannot add to our dictionary