What is the solution for varied spellings which do not depend on an authority such as color vs. colour in English? What about when certain aspects of spelling are independent? colourise, colourize, colorize all exist in English, colorise seems not to exist.
What about the other orthographical variations I mentioned in an earlier post such as ASCII vs curved apostrophe in most languages or Hebrew and Arabic with varying degrees of pointing?
Andrew Dunbar (hippietrail)
On 7/30/05, Gerard Meijssen gerard.meijssen@gmail.com wrote:
Lars Aronsson wrote:
Gerard Meijssen wrote:
As the Dutch spelling will change in August 2006, we added things like spelling authorities.
Great, then you can introduce a new authority for every century of a language, so the 19th century Swedish can be told apart from 20th century Swedish. But how are spelling authorities different from languages? Couldn't "19th century Swedish" and "Dutch after 2006" be treated as languages of their own?
No, an authority is just that. When the Spelling is marked as depreciated, you change the record in ValidSpelling. The reason for this extra table is, that the word "paardenbloem" will be depreciated and the older version of "paardebloem" is to be appreciated again. The Spelling does give a date, that one is about when it was introduce. A spelling authority is an organisation that decides on a specific spelling. The NTU is such an organisation for the Dutch language, the change in 2006 follows the change of 1996.
As I said in a previous mail, this was created because I will have to accomodate this change in a contemporary language. When 19th century Swedish changed to a more modern version, the old spelling can be depreciated by adding the ValidUntil in a ValidSpelling record. The old spelling and the new Spelling are related by a Relation record.
So the design will be set for some time allowing us to learn what it is we have. It will be changed when we know what to change and why. The requirements for content will be minimal and improvements will happen because of a collective effort.
Have you started to populate the database yet? What cycle span do you plan for between evaluations and redesigns?
At this moment I am working on the database design, Erik is working on Wikidata and when he has finished his part, we will work on the implementation as described on Meta. As to data, there are several many that will be included into the Ultimate Wiktionary. The content of the Wiktionaries is important among these. We have a wordlist in Stellingwerfs of 18.865 waiting in the wings enough to rank it as the sixth biggest Wiktionary. I have been informed that another version will have some 20K+ articles .. We have some 222.930 correctly spelled Dutch words that we may use (only spelling with hypenations). We have the content of several glossaries and thesauri.
So at this stage questions on the database stuff are very welcome. We do not expect to start including data until somewhere September. But when we do, it will be available for everybody to play with it. Thanks, Gerard _______________________________________________ Wikitech-l mailing list Wikitech-l@wikimedia.org http://mail.wikipedia.org/mailman/listinfo/wikitech-l