I have been on this list awhile, when i originally joined i was
interesting in the possibility of exporting the wiktionary data as
.dict format. Now that the newest version of OSX 10.4 has a built-in
dictionary that uses the dict:// to look-up words i was interested to
see if anyone on the technicaly side would like to explore the
possibility of either exporting the Wiktionary database as .dict
format, or run a dictionary daemon that would access the wiktionary
database server and return dict entries. It would be read-only, but it
would be another interesting way to access the wiktionary besides the
web interface.
Does anyone on the tech list know if this is even possible? I'm not
asking you to do it (i can write the export), i was wondering if there
is some sort of database schema available to extract the data into
dict format, or are the entries too fragmented to even attempt an
export?
-brian
--
brian suda
http://suda.co.uk
Dear All,
We would like to open a Wiktionary for the Amis language. Amis is an
Austronesian language used by the Amis -- one of the Austronesian
people in Taiwan. The Amis account for about one third of Taiwan's
indigenous population (i.e. 130 thousand), however, many of the younger
generation do not speak the language and till now there is still no
exact ststistic report of the number of real speakers.
There are two prevailing writing systems of this language: the
Presbyterian Church system and the International Phonetic Symbol
system, the later is used only within the academic circle. We would
like to edit a Wiktionary which basically employs the Presbyterian
Church system but also notes the IPS so that readers can also know the
exact pronounciations.
At this moment there are at least three people who are willing to
commit to this project. They are Nakao Eki, Afah Lisin, and Tai-ni
Tsou. The former two are native speakers and the later two as
ethnographers are also familiar with the academic Amis writing system.
Pektiong Tan (zh-min-nan:pektiong) will help those people to get
familiar with the wikipedia system.
We would prefer to set the default interface language to be Mandarin
with traditional character used in Taiwan (zh-tw) since most of the
potential user of this wiktionary can read Mandarin with traditional
character.
Proposed domain name:
http://amis.wiktionary.org/
Language tag for Amis:
ISO 639-2: N.A.
ISO 639-3 (Draft): Ami
SIL: ALV
RFC-3066: i-ami
Sincerely,
Nakao Eki,
Afah Lisin,
Tai-ni Tsou,
Pektiong Tan
Some info about Taiwanese aborigine:
http://en.wikipedia.org/wiki/Taiwanese_aborigine
Stefan Jensen wrote:
>Good morning Sabine and everybody,
>Great to see more people joining into the communication. I am not a frequent user of the Wicki discussion tools which Gerhard proposes to use for this but I'll try to move things over to there. (And I see Bernard is already using it).
>
Dear Stefan,
really nice to "cybermeet" ;o) you and the others in this Wiki/EEA ring...
The merit or the blame - you'll see later... - is to be put on that
German Vesuvio ;o) who's definitely SabiNet...
We recently got in touch in a list of discussion - LANGIT - I quitted
very fast, for some reasons, after launching a project, RL - RAPTVS
LINGVÆ, which she was the only replier to.
As soon as Sabine has got some time to explain to me where and how to
effectively start to work in/for Wiki world with the support of EEA
(even if at present I'm already doing/studying really several things,
and would need paid work, I'll be glad to play actually my part, also
owing to its being a good reference on the whole and it will let me learn.
My mother tongue is Italian, which I studied also at the university;
moreover I worked as a professional proofreader and editor for ten years
and am an author.
As for Chinese at least - I can't assure you nothing - but I'm in high
touch with some Chinese linguists (two women) who know English and are
interested in Italian too. They have a sort of translation agency on the
net and I created and moderate an Itlaian subforum on their Chinese
website.
Enough & maybe too much?, for the time being...
ALBatro.
Carme diem,
ALBatro.
__________________________________________________________
Alberto L. Beretta, FIRCA
(Freelance Italian Relations & Communications Assistant).
Italian proofreading - editing - composition - voiceovers - translations
escorting - research - other.
(www.carmediem.it, under refurbishment until 27 Aug.) - info(a)carmediem.it.
Email a.l.beretta(a)virgilio.it, fax on request,
(+39) 02 57501442, home office/study (+39) 338 3524079 (mobile),
Via Pavese 137, 20089 Rozzano (Milan), Italy (EU).
Hi,
I'm not able to create a new account in the Italian page.
Though I used the same "user name" into your English page
and for registering in Wikipedia, then on the Italian page I always
get an "error". Even if I use different "user name" the site doesn't
accept it !
Since I'm an Italian people I only should work on the Italian page,
so please help me replying to:
_feeling(a)tin.it
Hi,
Congratulations !
BTW, this should be sent to the Wiktionary list.
I updated the portal and the statistics.
Subject: [Wikipedia-l] Wiktionary www page
Date: Wednesday 18 May 2005 07:20
From: "James R. Johnson"
To: wikipedia-l(a)wikimedia.org
Hello,
I just wanted to inform the list that the Anglo-Saxon wiktionary
has over 100 entries!
Later,
James
--
http://www.non-violence.org/ | Site collaboratif sur la non-violence
http://www.forget-me.net/ | Alternatives sur le Net
http://fr.wikipedia.org/ | Encyclopédie libre
http://www.forget-me.net/pro/ | Formations et services Linux
I think that GNU FDL is perfectly fine for Ultimate Wiktionary and there
is no need to change the license. The license is perfectly compatible
with the .DICT format, so there should be no problems at all.
Thinking about how to import data from wiktionary in the ultimate
wiktionary may pose a few puzzles with respect to FDL compliance, but I
don't see any significant problems. The import script should keep track
of who contributed to a chunk of data and take note of that fact. The
history may be a little more problematic, and I think we will want to
get advice on exactly how to do it.
But changing the license to something else would require throwing away
all existing work in wiktionary, which seems quite unwise to me.
--Jimbo
Milos Rancic wrote:
>(To correct myself...)
>
>I think that there is no need to avoid GNU FDL terms. Just put
>gzipped/bzip2ed history inside of package and in all of articles
>put reference (that contribution history can be found in that file).
>
>
Hoi,
Apparantly you do not understand that there are several issues that make
the GNU-FDL not practical.
*Wiktionary data cannot be imported properly into Ultimate Wiktionary.
UW has no room for gzipped or bzipped history. It is a server side
database and nobody is going to see the information in this way.
*Ultimate Wiktionary will import data from many Wiktionaries, the first
one could be the nl:wiktionary. Many articles have been copied to and
from the it:wiktionary. Suppose an article is shared, it arrived first
from the nl:wikipedia so that one rules.. right ? Now what history
should we have with the article ?? From a GNU-FDL point of view it is
unforseen, crazy.
*When we keep all these histories, who can say it is "my" work? I
contributed to it ??
*When we export content to the .dict or RFC 2229 format, this is a
subset of the data that we have on a word, a concept. We have the UW
history and all these Wiktionary histories. Histories for each word.
Histories for possibly a file with a few fields like: "Word"
"Description" "Translation" "Original source". The amount of bagage that
we should carry according to the GNU-FDL is unforseen and crazy. It just
does not make sense. It is also data that has no stucture. Who will ever
look at it ??
My conclusion is that the current GNU-FDL does not funtion for atomic
information like we will have in Ultimate Wiktionary. When it prevents
the implementation of new use for the data that we have, it becomes a
hindrance. The goal of the Wikimedia Foundation is to make Free
information available. When a license like the GNU-FDL only allows for
server side information that has a static structure, I am sure that even
Richard Stallman will find the arguments to ammend the GNU-FDL compelling.
One crucial thing in all this is that Free information should stay Free
and be accessible. The current Wiktionary data is as closed as any
proprietary datacollection. This is because of its lack of structure. It
cannot be used for anything but server side information. Ultimate
Wiktionary intends to combine the strength of the information that we
have in all our wiktionaries, it will be structured. It does allow
accessibility and new innovative uses. By being Free, accessible and
innovative, we will gain a much wider public, these will not only be
users of our data but also providers of data. This is what we aim for.
In the current nl:wiktionary we have people and organisations like
FrankC and www.ziekenhuis.nl who contributed big time to the content of
Wiktionary. We do need to recognise their contributions. They donated
important body of works but we also have people like MARCEL and S.V.E.T
who added content on a regular basis, it is important that we recognise
their hard work and their contributions. They make and made it the
success it is. So if anything, we should find a way to honour the
members of our Wiktionary community as we move forward to an Ultimate
Wiktionary.
Thanks,
GerardM
>>>> I guess "old-school" is probably a good term. I have very little
involvement on the technical side of things. I do find it a chore to insert
a picture or a table, but I figure it out when I have to. When templates
appear in an article that I am editing, I need to make extra effort just to
track where some of them come from or what they mean. If I, as a person who
has been here for over three years, am having trouble with this, it must be
worse for a non-technical person who just wants to indulge his love of
words.
I too am alarmed at the proposal, as much as I have seen it. I am
technical(well, used to be) 28 years in the computer industry, starting at
the most technical levels of system programming, but over the years
migrating to the role of helping general business users get the best out of
systems, as business analyst, system designer, project manager.
One thing I learned was - never let a technician design a system. It will be
great for the technician's personal uses, but a huge chance it will be
useless to the general users.
Like Ec I find even the present level of codification annoyingly
complicated. Requiring any significant level of codification from general
users would just knock out 99%of the population form being contributors,
asituation we cannot go towards.
>From what little I have learned so far of this idea, including the complete
lack of communication with the user community, the idea of introducing a new
more technical Ultimate Wiktionary sounds like a disaster waiting to happen.
Richardb
rb_wiktionary(a)boult.mailshell.com
This mail message is not intended to receive a reply to the address used to
send it. The address used for sending it will not accept replies.
The content may reveal an address you can use.
>>>Are there objections from old-school wiktionarians to the ultimate
wiktionary plans? Is it possible that we could address these objections
through code?
I've been an active administrator on Wiktionary for some months now. But
only recently subscribed to the mailing list - only just noticing that such
a thing exists.
So this discussion of the Ultimate Wiktionary comes as a complete surprise
to me.
Which is even more alarming as somewhere in Wiki-.org land there is clearly
a policy that all changes etc must be notified and approved through the web
site. The mailing list is only an ancillary means of communication.
So I would suggest whoever is managing the idea of change needs to lift
their game in keeping the active Wiktionary community informed.
Richardb
rb_wiktionary(a)boult.mailshell.com
This mail message is not intended to receive a reply to the address used to
send it. The address used for sending it will not accept replies.
The content may reveal an address you can use.
>>>Are there objections from old-school wiktionarians to the ultimate
wiktionary plans? Is it possible that we could address these objections
through code?
I've been an active administrator on Wiktionary for some months now. But
only recently subscribed to the mailing list - only just noticing that such
a thing exists.
So this discussion of the Ultimate Wiktionary comes as a complete surprise
to me.
Which is even more alarming as somewhere in Wiki-.org land there is clearly
a policy that all changes etc must be notified and approved through the web
site. The mailing list is only an ancillary means of communication.
So I would suggest whoever is managing the idea of change needs to lift
their game in keeping the active Wiktionary community informed.
Richardb
rb_wiktionary(a)boult.mailshell.com
This mail message is not intended to receive a reply to the address used to
send it. The address used for sending it will not accept replies.
The content may reveal an address you can use.