Hi,
Le Tuesday 28 June 2005 19:20, Gerard Meijssen a écrit :
Andrew Dunbar wrote:
The two ways to do it are:
1) Parse the database. This is very difficult due to a myriad of article
formats and a very large number of articles in which the format is just
broken. I did however develop a parser just good enough to count articles
and translations by language for the non-broken examples. Sadly the hard
drive on which the code lived was destroyed in a grey-out or power surge.
2) Use of templates. On Wiktionary there is quite a bit of "anarchy" or
"democracy" at present so it's very difficult to introduce new features
and also to have such features used for their proposed purpose without
being extended. Also people losing interest in their new ideas, and the
fact that there are *so many* articles their to go back through and
classify after agreeing on a way to do so.
I think the best way would be to a) come to an agreement of how to make
a language-tracking template. b) Create a new parser that can find
language headings in their various variant forms. c) Use the data created
by the parser with a bot to tag all the existing articles. d) Make the new
tags compulsory.
Hoi,
When there is a decision that a particular template is to be used, it is
possible to use a bot to replace one pattern that indicates that a word
is in a language with this template. Many templates can be changes in
succession that will help us to implement the chosen template. I do
suggest that the use of the templates already in use on many of the
wiktionaries makes sense as it will help foster cooperation between the
different Wiktionaries as it helps us to share content. Important to
note is, that the content of these templates is a matter of choise for
the individual Wiktionary as long as the definition is shared by all.. I
will be happy to help in implementing one fixed set of templates on the
English wiktionary. When the known patterns have been replaced by the
selected templates, it will be possible to create a list with articles
that do not have the new templates. These have to be changed manually in
order to identify them as words in a particular language.
This was done on the French Wiktionary when it changed from plain wiki mark up
to template mark up. I did it with the help of a bot.
Thanks,
GerardM
Regards,
Yann
--
http://www.non-violence.org/ | Site collaboratif sur la non-violence
http://www.forget-me.net/ | Alternatives sur le Net
http://fr.wikipedia.org/ | Encyclopédie libre
http://www.forget-me.net/pro/ | Formations et services Linux