Dear All,
We would like to open a Wiktionary for the Amis language. Amis is an
Austronesian language used by the Amis -- one of the Austronesian
people in Taiwan. The Amis account for about one third of Taiwan's
indigenous population (i.e. 130 thousand), however, many of the younger
generation do not speak the language and till now there is still no
exact ststistic report of the number of real speakers.
There are two prevailing writing systems of this language: the
Presbyterian Church system and the International Phonetic Symbol
system, the later is used only within the academic circle. We would
like to edit a Wiktionary which basically employs the Presbyterian
Church system but also notes the IPS so that readers can also know the
exact pronounciations.
At this moment there are at least three people who are willing to
commit to this project. They are Nakao Eki, Afah Lisin, and Tai-ni
Tsou. The former two are native speakers and the later two as
ethnographers are also familiar with the academic Amis writing system.
Pektiong Tan (zh-min-nan:pektiong) will help those people to get
familiar with the wikipedia system.
We would prefer to set the default interface language to be Mandarin
with traditional character used in Taiwan (zh-tw) since most of the
potential user of this wiktionary can read Mandarin with traditional
character.
Proposed domain name:
http://amis.wiktionary.org/
Language tag for Amis:
ISO 639-2: N.A.
ISO 639-3 (Draft): Ami
SIL: ALV
RFC-3066: i-ami
Sincerely,
Nakao Eki,
Afah Lisin,
Tai-ni Tsou,
Pektiong Tan
Some info about Taiwanese aborigine:
http://en.wikipedia.org/wiki/Taiwanese_aborigine
Stefan Jensen wrote:
>Good morning Sabine and everybody,
>Great to see more people joining into the communication. I am not a frequent user of the Wicki discussion tools which Gerhard proposes to use for this but I'll try to move things over to there. (And I see Bernard is already using it).
>
Dear Stefan,
really nice to "cybermeet" ;o) you and the others in this Wiki/EEA ring...
The merit or the blame - you'll see later... - is to be put on that
German Vesuvio ;o) who's definitely SabiNet...
We recently got in touch in a list of discussion - LANGIT - I quitted
very fast, for some reasons, after launching a project, RL - RAPTVS
LINGVÆ, which she was the only replier to.
As soon as Sabine has got some time to explain to me where and how to
effectively start to work in/for Wiki world with the support of EEA
(even if at present I'm already doing/studying really several things,
and would need paid work, I'll be glad to play actually my part, also
owing to its being a good reference on the whole and it will let me learn.
My mother tongue is Italian, which I studied also at the university;
moreover I worked as a professional proofreader and editor for ten years
and am an author.
As for Chinese at least - I can't assure you nothing - but I'm in high
touch with some Chinese linguists (two women) who know English and are
interested in Italian too. They have a sort of translation agency on the
net and I created and moderate an Itlaian subforum on their Chinese
website.
Enough & maybe too much?, for the time being...
ALBatro.
Carme diem,
ALBatro.
__________________________________________________________
Alberto L. Beretta, FIRCA
(Freelance Italian Relations & Communications Assistant).
Italian proofreading - editing - composition - voiceovers - translations
escorting - research - other.
(www.carmediem.it, under refurbishment until 27 Aug.) - info(a)carmediem.it.
Email a.l.beretta(a)virgilio.it, fax on request,
(+39) 02 57501442, home office/study (+39) 338 3524079 (mobile),
Via Pavese 137, 20089 Rozzano (Milan), Italy (EU).
Hoi,
I did send this message using the wrong mail-ID so I do it again using a
forward. This is my request to speak at Wikimania.
Thanks,
GerardM
-------- Original Message --------
Subject: Wiktionary presentation
Date: Sat, 30 Apr 2005 14:28:34 +0200
From: Gerard Meijssen <gerardm(a)myrealbox.com>
To: cfp(a)wikimedia.org
CC: wiktionary-l(a)Wikipedia.org
Hoi,
Several times I have tried to make sure that we would have a wiktionary
presentation. I was told that this could not be as I already am slotted
to speak. That workshop I would gladly do away with in order to be able
to speak about Wiktionary. Wiktionary will become very relevan for many
people and as I say in my presentation this will be increasingly true.
The one thing that Ultimate Wiktionary does is synthesising one
lexicilogical resource potentially containing all words of all languages.
I believe that with the intial deployment of the Ultimate Wiktionary, we
will continue on a learning curve that will result in something
extraordinary.
Thanks,
GerardM
---------------------------------------
The excitement of Wiktionary
A paper for a presentation for Wikimania 2005
Wiktionary is the biggest project after Wikipedia. As there will be big
changes in the way Wiktionary will work, there is a need to explain what
these changes are about. These changes will be quite fundamental; they
will merge wiktionaries in one Ultimate Wiktionarie and consequently the
communities will merge as well. Technically the changes are as profound,
the software for Wiktionary and Wikipedia are the same; with the
inclusion of Wikidata in the Mediawiki software it will be possible to
host structured data within a Mediawiki project. This will have a
profound impact on the way information will be added.
The current wiktionaries are all separate projects; they all do things
in their own way. Some have adopted a set of shared templates that allow
for the easy transfer of data from one Wiktionary to another. This
manual transfer allows for over 80% of the data to be the same in each
Wiktionary. The drawback of this system is that changes in one project
are not easily shared. It is not explicitly clear who added new content
and it can therefore be argued that the rules of the GNU-FDL are not
complied with.
As the different wiktionaries will be merged into the structured
“Ultimate Wiktionary” all changes will be possible to all users never
mind what primary language is used. It means that the rich Neapolitan,
Sicilian, Papiamento resources will be available to all users. The
Kurdish content will have much more links to other languages, it will be
a richer resource based on what we already have.
In the implementation of UW, we will incorporate logic that is
associated with thesauri. It will be possible to rank terminology, to
say things like relations, inclusiveness etc. This logic is particularly
important, as we want to include thesauri like the GEMET thesaurus.
GEMET is one thesaurus of the European Community; inside UW it will be
possible to add translations to other non-EU languages. This in turn
will facilitate trade from an to the EU.
One reason why we want to include this kind of information is, because
it will make UW relevant. As we will include more and more lexicological
information more people will turn to UW for this type of information. As
UW will have the “edit” button people are invited to add content and
improve content.
One other thing we envision doing with UW, is to allow for the import
and export of XML data. We hope to get formal cooperation with
organizations like the EU. This way we enable the EU to keep their
quality eye on their thesaurus.
It is incredible that the funding by Kennisnet for the initial
development of UW makes it possible to entertain all these
possibilities. In my mind it is just a matter of time, hard work and
enthusiasm that will make all this come true.
I have given presentations about Wiktionary before, you can find them on
META:Presentations.
Hoi,
I learned that there are two ways of writing Papiamento. The Aruban
way and the Antilian way. In order to distinguish these two versions,
I have added pap-ar and pap-an as the two ways in which you can write
Papiamento..
The content we received from FrankC is Antilliaans.
Thanks,
GerardM
This is a brief report from the FLOSS Conference in South Africa that
Erik and I attended this week. A more detailed version is on Meta at
<http://meta.wikimedia.org/wiki/Conference_reports/FLOSS%2C_South_Africa_2005>;
so please read that one instead if you have time.
I was invited to give a presentation about the Wikimedia projects at
the international "Free/Libre and Open Source Software" (FLOSS) and
Free Knowledge workshop in Pretoria, South Africa. Erik was given the
opportunity to hold a workshop there about wiki technology. The byline
for the conference was "Knowledge for all, Education for all", so the
Wikimedia projects fitted in perfectly.
The first day was made up of formal presentations. A list of these is
on Meta. My talk was part of a "Digital Commons" panel. Much of the
second day was divided into two workshops, including Erik's. The theme
of Free Knowledge Communities was discussed on day 3, and there were
many areas in which Wikimedia projects could collaborate with existing
initiatives, and new ideas for using Wikimedia content:
* Spoken Wikipedia by cell phone. Many areas of Africa have high
cell phone coverage with access to SMS. Teemu Leinonen of the
University of Art and Design Helsinki is working on a project to allow
a user to send an SMS with the article title to a phone number. A few
seconds later, they get a call on their cell phone with a (usually
machine-generated) spoken version of the article they requested.
* Wikipedia in schools. Static HTML dumps on DVD, offline
applications that allow editing, and update feeds like rsync to
maintain offline copies, were all requested by people working on
getting Wikipedia into schools. Where people were interested in print
projects, they wanted to focus on printing out particular topics,
rather than having a copy of the entire encyclopedia.
* Wiktionary. There is a need for a repository of legal
terminology in the 11 official languages of South Africa since courts
often rely on untrained interpreters who need a reference guide for
dealing with unfamiliar terminology from any of the languages they
were not native speakers of.
* Wikibooks/Wikiversity/E-learning. With the price of textbooks
much higher in South Africa than in developed countriesfree textbooks
are of extreme importance, and Wikibooks could provide the content
needed for initiatives to deliver this. We discussed our existing and
potential future projects at length and talked to proponents of
various e-learning initiatives.
Again, see http://meta.wikimedia.org/wiki/Conference_reports/FLOSS%2C_South_Africa_2005
for details on any of these issues.
==Meetup==
On the evening of the third day, the first African meetup as held in
"Cafe 41". Four Wikipedians from South Africa participated: Laurens,
Alias, Renier Maritz and Andy Rabagliati. Renier's wife also joined
us, along with some people from the conference. We discussed ways to
promote the Afrikaans Wikipedia, methods to distribute Wikipedia to
Africa, localization of the interface, and possibilities for
e-learning.
==Future conferences==
Several upcoming conferences were mentioned as being of possible
interest to Wikimedia. Most notable of these are WSIS
(<http://www.itu.int/wsis/>;), which I believe Jimmy and Yann may be
attending, and the World Conference on Computers in Education
(<http://www.sbs.co.za/wcce2005/>;), for which no Wikimedia attendance
is currently planned.
Unfortunately, we did not see much of South Africa beyond the
conference centre. Nevertheless, the visit was very productive and led
to many new contacts and insights. We aim to follow up on the
discussions, and turn some of the ideas above into reality soon.
Angela.
--
http://en.wikipedia.org/wiki/User:Angela
Yann:
> I don't understand why there are important differences for the numbers of
> articles for some Wiktionaries with the manual count (from
> Special:Statistics) .....
> There are also curious figures for database size
> http://en.wikipedia.org/wikistats/wiktionary/EN/TablesDatabaseSize.htm
I already anwered Yann a few days ago privately but didn't realize the
question had been posted here, so here is the answer.
Erik Zachte
----------------------------------------------------------------------------
----
== 1 ==
For database size I checked sk:
The dump is full of spam!, which has been reverted since, see for example
http://sk.wiktionary.org/w/index.php?title=Wiki&oldid=1725
== 2 ==
For article counts I checked Serbian dump.
There has clearly been a massive cleanup (either delete or move) since last
dump.
Here is part of the namespace 0 data that I can make sense of:
One article per line, format title|content
Sample1: There are numerous articles (see unicode gibberish in sample 1)
that only contain one or two category tags, no data.
Sample2: These data have very probably been deleted or moved.
So expect a much lower article count after next run.
Appendix: samples from Serbian dump
Sample1:
ÐилоÑ|[[Category:СÑпÑко мÑÑко име]]
ÐаÑко|[[Category:СÑпÑко мÑÑко име]]
ÐоÑÑе|[[Category:СÑпÑко мÑÑко име]]
РаÑко|[[Category:СÑпÑко мÑÑко име]]
ÐлекÑандаÑ|[[Category:СÑпÑко мÑÑко име]]
ÐемаÑа|[[Category:СÑпÑко мÑÑко име]]
Ðелена|[[Category:СÑпÑко женÑко име]]\n\n[[Category:У
обÑади]]
ÐилиÑа|[[Category:СÑпÑко женÑко име]]\n\n[[Category:У
обÑади]]
ТиÑана|[[Category:СÑпÑко женÑко име]]\n\n[[Category:У
обÑади]]
Sample2:
Accesskey-addsection|+
Accesskey-anontalk|n
Accesskey-anonuserpage|.
Accesskey-article|a
Accesskey-contributions|<accesskey-contributions>
Accesskey-currentevents|<accesskey-currentevents>
Accesskey-delete|d
Accesskey-edit|e
Accesskey-emailuser|<accesskey-emailuser>
Accesskey-help|<accesskey-help>
Accesskey-history|h
Accesskey-login|o
Accesskey-logout|o
Accesskey-mainpage|z
Accesskey-move|m
Accesskey-mycontris|y
Accesskey-mytalk|n
Accesskey-portal|<accesskey-portal>
Accesskey-preferences|<accesskey-preferences>
Accesskey-protect|=
Accesskey-randompage|x
Accesskey-recentchanges|r
Accesskey-recentchangeslinked|c
Accesskey-sitesupport|<accesskey-sitesupport>
Accesskey-specialpage|<accesskey-specialpage>
Accesskey-specialpages|q
Accesskey-talk|t
Accesskey-undelete|d
Accesskey-unwatch|w
Accesskey-upload|u
Accesskey-userpage|.
Accesskey-viewsource|e
Accesskey-watch|w
Accesskey-watchlist|l
Accesskey-whatlinkshere|b
Tooltip-addsection|Add a comment to this page. [alt-+]
Tooltip-anontalk|Discussion about edits from this ip address [alt-n]
Tooltip-anonuserpage|The user page for the ip you#*$@re editing as [alt-.]
Tooltip-article|View the content page [alt-a]
Tooltip-atom|Atom feed for this page
Tooltip-contributions|View the list of contributions of this user
Tooltip-currentevents|Find background information on current events
Tooltip-delete|Delete this page [alt-d]
Tooltip-edit|You can edit this page. Please use the preview button before
saving. [alt-e]
Tooltip-emailuser|Send a mail to this user
Tooltip-help|The place to find out.
Tooltip-history|Past versions of this page, [alt-h]
Tooltip-login|You are encouraged to log in, it is not mandatory however.
[alt-o]
Tooltip-logout|Log out [alt-o]
Tooltip-mainpage|Visit the Main Page [alt-z]
Tooltip-move|Move this page [alt-m]
Tooltip-mycontris|List of my contributions [alt-y]
Tooltip-mytalk|My talk page [alt-n]
Tooltip-nomove|You don#*$@t have the permissions to move this page
Tooltip-portal|About the project, what you can do, where to find things
Tooltip-preferences|My preferences
Tooltip-protect|Protect this page [alt-=]
Tooltip-randompage|Load a random page [alt-x]
Tooltip-recentchanges|The list of recent changes in the wiki. [alt-r]
Tooltip-recentchangeslinked|Recent changes in pages linking to this page
[alt-c]
Tooltip-rss|RSS feed for this page
Tooltip-sitesupport|Support Wiktionary
Tooltip-specialpage|This is a special page, you can#*$@t edit the page
itself.
Tooltip-specialpages|List of all special pages [alt-q]
Tooltip-talk|Discussion about the content page [alt-t]
Tooltip-undelete|Restore the $1 edits done to this page before it was
deleted [alt-d]
Tooltip-unwatch|Remove this page from your watchlist [alt-w]
Tooltip-upload|Upload images or media files [alt-u]
Tooltip-userpage|My user page [alt-.]
Tooltip-viewsource|This page is protected. You can view its source. [alt-e]
Tooltip-watch|Add this page to your watchlist [alt-w]
Tooltip-watchlist|The list of pages you#*$@re monitoring for changes.
[alt-l]
Tooltip-whatlinkshere|List of all wiki pages that link here [alt-b]
Hoi,
I just reailised that with ultimate wiktionary, there will be a need to
have an URL where the Ultimate Wiktionary will be located. I do not
think it is acceptable to host the Ultimate Wiktionary as wiktionary.org
as this is the current portal page. Therefore I think something like
*ultimate.wiktionary.org* would be as good a choise as any.
When a Wiktionary decides to merge into the Ultimate wiktionary, this
domain will be a redirect to the Ultimate Wiktionary. Technically there
will be a need to redirect references in Wikipedia when a Wiktionary is
migrated. Linking from the Ultimate Wiktionary to Wikipedia or any other
project should remain the same. It should be obvious that the Ultimate
Wiktionary can use all resources of all Wiktionaries. It is as obvious
that any Wiktionary can use the resources of the Ultimate Wiktionary.
Thanks,
GerardM