Dear Ms., I thank you for your answer. This is absolutely interesting. The choice of the wordnets to integrate to Wikidata Lexicographical Data does not matter. The most important fact is that we should begin to enrich LexData. It will be excellent if you adopt this project as yours. Currently, there are nearly 8000 lexemes in Wikidata. If there is not a person who can work to change this situation, it will take years to let LexData represent the main ten languages including Portuguese. Yours Sincerely, Houcemeddine Turki (he/him) Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia Undergraduate Researcher, UR12SP36 GLAM and Education Coordinator, Wikimedia TN User Group Member, WikiResearch Tunisia Member, Wiki Project Med Member, WikiIndaba Steering Committee Member, Wikimedia and Library User Group Steering Committee Co-Founder, WikiLingua Maghreb Founder, TunSci ____________________ +21629499418
-------- Message d'origine -------- De : Valeria de Paiva valeria.depaiva@gmail.com Date : 2019/04/14 16:21 (GMT+01:00) À : wikidata@lists.wikimedia.org Objet : [Wikidata] Wikidata and Portuguese wordnets
hi, I would love to see more lexical mappings between Wikidata and multiple opensource wordnets, but care is required. In particular, for Portuguese, I think anyone attempting this mapping should check https://www.researchgate.net/publication/298045179_An_overview_of_Portuguese... beforehand. Not surprisingly I prefer my own version of Portuguese wordnet http://openwordnet-pt.org, but so do also Bond's Open Multilingual Wordnethttp://compling.hss.ntu.edu.sg/omw/summx.html, BabelNet as well as Google Translatehttp://translate.google.com/about/intl/en_ALL/license.html as we're the representative of the open Portuguese wordnets used by these projects. best, Valeria On 4/13/19 15:06, Houcemeddine A. Turki wrote:
Dear all, I thank you for your efforts. Thanks to my discussion with Dr. David Abian, I came across a public domain ontology for Portuguese. This ontology is available in http://ontopt.dei.uc.pthttp://ontopt.dei.uc.pt/. Currently, Lexicographical Data does not support Portuguese. I ask if someone can build a bot to automatically integrate this ontology into Wikidata. Onto.PT is downloadable using http://ontopt.dei.uc.pt/index.php?sec=download_ontopt. You can verify that it is a public domain ontology using https://www.researchgate.net/publication/259935306_OntoPT_Recent_development.... Yours Sincerely, Houcemeddine Turki (he/him) Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia Undergraduate Researcher, UR12SP36 GLAM and Education Coordinator, Wikimedia TN User Group Member, WikiResearch Tunisia Member, Wiki Project Med Member, WikiIndaba Steering Committee Member, Wikimedia and Library User Group Steering Committee Co-Founder, WikiLingua Maghreb Founder, TunSci ____________________ +21629499418
Wikidata mailing list Wikidata@lists.wikimedia.orgmailto:Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Valeria de Paiva http://vcvpaiva.github.io/ http://www.cs.bham.ac.uk/~vdp/
Currently, there are nearly 8000 lexemes in Wikidata.
There is, right now, 44262 lexemes entity (query: http://tinyurl.com/y692kszu), including 42 in Portuguese (query: http://tinyurl.com/y2f3636e) and in a total of 321 languages (it already is one of the biggest database if you count the number of languages). In total, there is 120 391 forms for these 44k+ lexemes (query: http://tinyurl.com/y5xudhwg far from the biggest databases but already an impressive number as almost everything has been done by hand and in less than a year!).
If there is not a person who can work to change this situation, it will
take years to let LexData represent the main ten languages including Portuguese.
There is already a lot of person working on Lexemes in Wikidata (I'm actually working on Breton). But true, there is also a lot to do! You're welcome to join us, the project page is here: https://www.wikidata.org/wiki/Wikidata:Lexicographical_data.
Cheers, ~nicolas