I thank you for your answer. I thank you for your clarifications. 44262 lexemes are a
limited output. That is why I asked if we can merge public domain ontologies in Wikidata
to let the process more efficient. In fact, if we integrate the public domain ontology for
Portuguese into LexData, we can have more than 40000 Portuguese lexemes added into
Wikidata. Then, portuguese users can work to enrich the input.
Houcemeddine Turki (he/him)
Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia
Undergraduate Researcher, UR12SP36
GLAM and Education Coordinator, Wikimedia TN User Group
Member, WikiResearch Tunisia
Member, Wiki Project Med
Member, WikiIndaba Steering Committee
Member, Wikimedia and Library User Group Steering Committee
Co-Founder, WikiLingua Maghreb
-------- Message d'origine --------
De : Nicolas VIGNERON <vigneron.nicolas(a)gmail.com>
Date : 2019/04/14 22:37 (GMT+01:00)
À : Discussion list for the Wikidata project <wikidata(a)lists.wikimedia.org>
Objet : Re: [Wikidata] Wikidata and Portuguese wordnets
Currently, there are nearly 8000 lexemes in Wikidata.
There is, right now, 44262 lexemes entity (query: http://tinyurl.com/y692kszu
42 in Portuguese (query: http://tinyurl.com/y2f3636e
) and in a total of 321 languages (it
already is one of the biggest database if you count the number of languages).
In total, there is 120 391 forms for these 44k+ lexemes (query:http://tinyurl.com/y5xudhwg
far from the biggest databases but already an impressive number as almost everything has
been done by hand and in less than a year!).
If there is not a person who can work to change this
situation, it will take years to let LexData represent the main ten languages including
There is already a lot of person working on Lexemes in Wikidata (I'm actually working
But true, there is also a lot to do!
You're welcome to join us, the project page is here: