Hoi,
I am extremely happy that I can inform you on Boxing day that a tired
Erik has produced the first tangible result of the Wikidata / Ultimate
Wiktionary project. It shows that we want great content in many
languages, that we want to include thesaurus information and that we
are happy to include with gratitude content like the Gemet thesaurus.
:) I want to thank Erik for working really hard to make this happen :)
Thanks,
Gerard ..
This is the text of Erik's E-mail to the Wikitech mailinglist:
*****************************
Ho ho ho,
we now have our first read-only prototype of Ultimate Wiktionary /
Wikidata, using a subset of the final UW design. This subset is a
complex, versioned relational database that can model
- lexical items (words, short phrases) with multiple meanings
- synonyms and translations, on the meaning level
- other relationships between them, on the meaning level.
The prototype is at:
http://epov.org/wd-gemet/index.php/Main_Page
It contains over 70,000 words in 22 languages; many of them have
definitions. The definitions usually come in 4 languages. As I
understand it, we can have this data under the GFDL, but it's just one
of many building blocks we will use in seeding the UW.
There will be at least one significant upgrade to this protoype before
the end of the year. All the tables and fields for versioning complex
relations without ballooning up the database are already there. I'm not
sure if the model I have in mind for versioning works yet, and I hope to
test and demonstrate it soon. (Versioning, in my opinion, is the single
greatest challenge for Wikidata.) I also want to show how we try to "eat
our own dogfood" in Ultimate Wiktionary by localizing the user interface
using the content of the dictionary.
All the records are already hooked up to pages and revisions, so you can
use [[Special:Allpages]] and the like to navigate. When there are
identical words in different languages, all the translations and
definitions are shown on the same page.
Our goal with Ultimate Wiktionary is to provide an even more complex
application that will make this data collaboratively editable, to add
dynamic user-based views, APIs, and crucial features such as
inflections, etymologies, complex relations and attributes, and much
more. This will be a huge challenge. Fortunately, more funding seems to
be on the horizon, allowing us to put more developers on the job.
Ultimate Wiktionary is just one application of Wikidata, and we will try
to generalize as much functionality as possible, so that it will be
reasonably straightforward to build new Wikidata apps. In particular,
versioning and all basic relation types should be handled on the
Wikidata level. There are thousands of possible new applications for
Wikimedia and other MediaWiki users if we get this right.
Please take a look at the prototype. The view component is a quick and
dirty hack, but the backend is approaching some stability. There are
some small inconsistencies in the data here and there, some of them
inherited from GEMET. Due to time constraints, I also had to stop the
import at about 80% leading to a few red links; I'll try to import the
remaining terms in the next few days.
Finally, expect a paper explaining some of the key ideas of Wikidata and
UW, showing the first user interface prototypes, and defining future
development milestones and applications. I will also try to describe
some of the forthcoming changes to the MediaWiki core that come with the
need of Wikidata to handle multiple languages in one instalation; these
changes will benefit multi-language projects like Meta and Commons.
I will be at the 22C3 on December 30 to demonstrate this prototype as
well as the completed namespace manager, and to answer questions.
Best,
a very tired Erik
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)wikimedia.org
http://mail.wikipedia.org/mailman/listinfo/wikitech-l
Hoi
An e-mail awaits I was extremely happy to sen awaits approval for
further distribution.. Its problem is that it has too many
recipients..
Could someone please oblige me :)
Thanks,
GerardM
Sirs,
I am now asking a wiktionary to be initalized for the already existing lombard wikipedia: http://lmo.wikipedia.org.
Many thanks.
Sincerely yours,
Claudio Meneghini
---------------------------------
Nouveau : téléphonez moins cher avec Yahoo! Messenger ! Découvez les tarifs exceptionnels pour appeler la France et l'international.Téléchargez la version beta.
Hoi,
As the suggested logo was not included by the mail server, I did upload
for you to Commons.. It can be found here
http://commons.wikimedia.org/wiki/Image:Wiktionary2.gif
Thanks,
GerardM
Hoi,
There are some people that do not like the name "Ultimate Wiktionary"
for the Ultimate Project. The name is used for the project. This does
not mean that we have to use ultimate.wiktionary.org as its URL. Sue
Ellen came up with the name Wiktionary2, Arle came up with this design.
I am sure there will be more suggestions to choose from.. We just have
to come up with them :)
Thanks,
GerardM
Just for fun:
Same font as the Wikipedia logo (Hoefler Text, which demonstrates
someone in the early stages of Wikipedia was a Mac user since Hoefler
Text is an Apple font).
Take this as something tossed out: even if the name Wiktionary^2 is
used, I won't be hurt if this isn't used...
-Arle
Hoi,
There are some people that do not like the name "Ultimate Wiktionary"
for the Ultimate Project. The name is used for the project. This does
not mean that we have to use ultimate.wiktionary.org as its URL. Sue
Ellen came up with the name Wiktionary2, Arle came up with this design.
I am sure there will be more suggestions to choose from.. We just have
to come up with them :)
Thanks,
GerardM
Just for fun:
Same font as the Wikipedia logo (Hoefler Text, which demonstrates
someone in the early stages of Wikipedia was a Mac user since Hoefler
Text is an Apple font).
Take this as something tossed out: even if the name Wiktionary^2 is
used, I won't be hurt if this isn't used...
-Arle
Hoi,
On the Dutch Wiktionary user:Marcel had the good sense to add the word
"donatie". I have included a pronunciation and the missing translations
from the English Wiktionary (I have my doubts about the translation in
Yiddish). Subsequently I copied the data to the Italian Wiktionary.
Now that we need a lot of money to give our favourite resource a
brilliant future, could we not make sure that we get a REALLY big amount
of translations to this word.. The trick is that in order to do this, we
have to collaborate :) It would be one way of showing that we care :)
Thanks,
GerardM
Hi, by chance I had a look at uz.wiktionary.org - it is being vandalised
(or better spammed with urls) regularly, but up to now has no obvious
contents. wouldn't it make sense to block it from being edited? Or at
least find someone who looks after it once a month to do the clean-up
(having admin rights)?
Thanks,
Sabine
___________________________________
Yahoo! Messenger: chiamate gratuite in tutto il mondo
http://it.messenger.yahoo.com
Hoi,
I found a nice update on the ISO 639-3 website ..
http://www.sil.org/iso639-3/default.asp is where you can even find a
nice introductory page on the next version of this standard. For the
hardcore language nuts among us there is even a nice download available
including instructions on how to create an SQL table.
In the first public genuine Wikidata outing, we will show you the GEMET
data in a true Wikidata environment. For those who do not know, GEMET is
a thesaurus with ecological content produced for/by the European Union.
Our aim is to be able to have this on line before Christmas, it will be
a read only implementation.
We will also include the ISO-639-3 codes. As you may know, Wiktionary
has the explicit aim to include all words of all languages. Ultimate
Wiktionary (UW) shares this aim and shares the practice with the many
wiktionary that we explicitly intent to include all lexicological
content. As UW intends to eat its own dog food, we want to have
localised labels for the languages chosen by the user for the User
Interface. When localisation for a term is not available, we will have
English as the lingua franca of this day and age.
The consequence is that there will be a clear difference between the
user interface of Ultimate Wiktionary and the user interface of
Mediawiki. UW does not intent to endorse a language for new projects,
but it is likely that people will be stimulated to work on the Mediawiki
user interface in order to have a user interface that is completely
localised. I expect that people appreciate this difference.
In the ISO-639-3 codes there will be languages that have not been
recognised. Having ISO-639 recognition is not necessary for inclusion in
Ultimate Wiktionary.. There is only one thing that we will insist on,
the acceptance of a code for this language, dialect or orthography that
is acceptable for potential projects within the Wikimedia Foundation.
Again, it is not to be seen as an endorsement for a language to have a
project, it is intended to make sure that such a code has been
"future-proofed".
Thanks,
GerardM