Hello Wikimedians,
My name is Sophie and I am the Project and Communications Manager at
Wikimedia Community Ireland.
I am reaching out to draw your attention to an interesting Natural Language
Processing Project that uses Wikidata to generate content in different
languages, at DCU here in Ireland. We have been collaborating with Simon
Mille from the Adapt Centre <https://www.adaptcentre.ie/> recently and I
thought it might be good to make some connections with the wider Wiki
Community who are specifically interested in or involved with AI.
Please let me introduce the project below and if you would like to learn
more or connect with Simon I would be delighted to introduce you.
Kind regards,
Sophie Fitzpatrick
*Project description:* At DCU-NLG <https://dcu-nlg.github.io/>, one of our
main research topics is the automatic generation of text from structured
data. We work with structured repositories such as DBpedia and Wikidata (among
other resources), which contain millions of triples that can be used to
generate texts about targeted entities in a particular language. A lot of
techniques exist for generating text from triple sets, the most famous (and
probably best) one being prompting a GPT model. However, closed-source
models such as the GPT series have some important drawbacks: they are very
much resource-hungry, they are not easily controllable, and they do not
give researchers access to their code. At DCU-NLG, we develop open-source
systems that aim to address these issues in the domain of Natural Language
Generation. We build (i) generators based on Large Language Models (LLMs),
which can achieve very high-quality results but still require a large
amount if energy to work, (ii) fully rule-based systems, which are
extremely energy-efficient but struggle to get to the quality level of
LLMs, and (iii) hybrid systems, which aim at combining the strengths of
LLMs, rule-based systems and neural systems. We are also interested in the
real-world use of these systems, and are currently making a tool that could
help people write Wikipedia articles: we are designing an interface that,
given an entity and a language, returns small seed texts generated using
several techniques mentioned above, always using DBpedia or Wikidata
information to ensure the traceability of the source. People can then use
these seed texts as a starting point for editing a new Wikipedia page.
*Some resources:*
- RTE brainstorm article
<https://www.rte.ie/brainstorm/2023/1206/1420417-gaeilge-irish-translation-a…>
(by
the way it's funny how they use the word "translate" in their title,
knowing the time I spend talking about how NLG is not translation xD)
- Papers from our group about using GPT
<https://aclanthology.org/2023.mmnlg-1.9/> and a rule-based system
<https://aclanthology.org/2023.pandl-1.4/> for the generation of Irish
text from DBpedia.
- The GEM shared task <https://gem-benchmark.com/shared_task> about
generation from DBpedia and WIkidata, which I co-organise.
--
Sophie Fitzpatrick
*Project and Communications Manager*
[image: Wikimedia Community Ireland Bi-Lingual Logo]
Pobal Wikimedia na hÉireann | Wikimedia Community Ireland
https://wikimedia.ie/
Dear Languages list,
On the theme of rediscovered Medieval languages, a new book is available
that translates a fascinating and harrowing account of the victims of a
volcanic eruption in the Mediterranean Sea in 1444, who were saved by the
nuns of Ischia. The book is titled *The Medieval Map and the Mercy Mission.
A Complete Translation of the Voynich Manuscript Map*. ISBN:9781399954990
It is available as a print-on-demand book at *The Great British Bookshop*:
https://www.thegreatbritishbookshop.co.uk/products/the-medieval-map-and-the…
Alternatively available as an ebook at *Google Books*:
https://play.google.com/store/books/details?id=YaG9EAAAQBAJ&rdid=book-YaG9E…
Kindest regards,
Dr. Gerard Cheshire.
Research Associate.
University of Bristol.
Hi Folks,
We encourage each and everyone of you to create a program submission. You can submit an interactive workshop or panel, a lecture, a short lighting talk or a poster for our dedicated poster session. Submissions are catered to both onsite and online (live or pre-recorded) or a hybrid combination. We at the Core Organizing Team would like to see a program submission related to indigenous languages, and small language Wikimedia projects!
The theme for this year's Wikimania is Diversity, Collaboration, Future. Topics that strengthen collaboration on improving the Wikimedia Incubator, increased participation, content contribution on small language Wikimedia projects are topics we like to see this year. It is great way to see participants collaborate with each other. There is also a spacious Wikimania Expo space where interactive activities like tutorials and demonstrations can be performed.
For participants joining virtually, we also have allocated online hours for you too to allow you to participate & present.
Session submissions for Wikimania 2023 are open until 28 March.
Visit the following links for further info:
Wiki page: https://wikimania.wikimedia.org/wiki/2023:Program/Submissions
Diff post: https://diff.wikimedia.org/2023/02/28/be-part-of-the-wikimania-2023-program/
Program Submission Form: https://pretalx.com/wm2023/cfp
Kind regards,
Butch Bustria
Chair, Program Subcommittee
Event lead, ESEAP Wikimania 2023 Core Organizing Team
Sincerely yours, We live in an age where we are increasingly
interconnected with everything, yet there are still too many communication
problems, such as proverbs and idioms that could be made even more
eloquent. For example, the word "complinsult" that is a compliment and an
insult in the same sentence or paragraph. "Sidecars" Have Existed Since The
1940s But Wouldn't It Be Better To Call Them "Sidebike" Or "Sidecycle"?
Then If We Consider A Latin American Proverb, "The Fish Stinks After 3
Days", Then Wouldn't It Be Better To Say, "The Fish Already Stinks The
First Day"? Then If We Talk About Sexist Proverbs Like: "Behind A Great Man
There Is Always A Great Woman, Wouldn't It Be Better To Say Then:" Next To
Great People There Are Equally Great People "? Or Silver Tray ", It would
be better to say:" Served all on a Golden Plate or Tray ". Speech. As for
"Fortune Cookies", Maybe It Would Be Better To Call Them "Cookies For The
Future." And Then If You Say, "A Real Man Has Only One Word", Maybe It
Would Be Better To Say, "Real People Have A Word." Just Word. ”These are
simple suggestions, but please be sure to send them to
First and Second American Dictionary, First and Second British Dictionary,
First and Second Italian Dictionary, First and Second Ecuadorian and
Mexican Dictionary, First and Second Dictionary of Mandarin and Cantonese,
and First and Second Japanese Dictionary. To Finish Wikipedia IN All These
Languages. Again Yours sincerely to YOU.
Distinti Saluti A Voi , Viviamo IN UN ' Epoca IN Cui Si è Sempre Più
Interconnessi A Tutto, Eppure Esistono Ancora Troppi problemi Di
Comunicazione , Ad esempio proverbi E Modi Di Dire Che SI Potrebbero
Rendere Ancora PIù Eloquenti . Ad Esempio La Parola "Complinsulto" Cioè Un
Complimento Ed Un insulto Nella Stessa Frase O ParagrafO . I " Sidecar "
Esistono Dagli Anni '40 Però Non Sarebbe Meglio Chiamarli " Sidebike "
Oppure " Sidecycle " ? Poi Se Consideriamo Un Proverbio Dell' America
Latina , " Il Pesce Puzza Dopo 3 Giorni " , Allora Non sarebbe Meglio Dire
: " Il Pesce Puzza Già Il Primo GiornO " ? Poi Se parliamo Di Proverbi
Sessisti Come : " Dietro Un Grande uomo C' è Sempre Una Grande Donna , Non
Sarebbe Meglio Dire Allora : " Accanto Alle Grandi Persone Ci Sono Persone
Altrettanto Grandi " ? . Invece Che Dire : " Servito Tutto SU UN Piatto
O Vassoio D' Argento " , Sarebbe Meglio Dire : " Servito Tutto SU UN
Piatto O Vassoio DoratO " . E Cosa Dite Dell' Espressione " Vera Verità "
? Come : " Tagliare La Testa Dal Principio " Per Indicare Di Troncare
Subito Una Conversazione O Discorso . Per Quanto Riguarda I " Biscotti
Della Fortuna " , Forse Sarebbe Meglio Chiamarli " Biscotti Per Il Futuro "
. E Poi Se Si Dice : " UN Vero Uomo Ha Una SOla Parola " , Forse Sarebbe
Meglio Dire : " Le Vere Persone Hanno una Sola Parola " . Questi Sono Dei
Semplici Suggerimenti Però Di SIcuro LI Invierò Al
Primo e Secondo Dizionario Americano , Il Primo E Secondo Dizionario
Britannico , Il Primo E Secondo Dizionario Italiano , IL Primo e Secondo
Dizionario Equadoriano E Messicano , IL Primo e Secondo Dizionario Di
Mandarino E Cantonese E IL Primo E Secondo Dizionario Giapponese . Per
Finire Wikipedia IN Tutte Queste Lingue . Di Nuovo Distinti Saluti A VOI .
Atentamente, Vivimos en una época en la que estamos cada vez más
interconectados con todo, pero todavía hay demasiados problemas de
comunicación, como proverbios y modismos que podrían hacerse aún más
elocuentes. Por ejemplo, la palabra "cumplir" que es un cumplido y un
insulto en la misma oración o párrafo. Los "Sidecars" existen desde la
década de 1940, pero ¿no sería mejor llamarlos "Sidebike" o "Sidecycle"?
Entonces, si consideramos un proverbio latinoamericano, "El pescado apesta
después de 3 días", ¿no sería mejor decir: "El pescado ya apesta el primer
día"? Entonces si hablamos de proverbios sexistas como: "Detrás de un gran
hombre siempre hay una gran mujer, ¿no sería mejor decir entonces:" Junto a
grandes personas hay igualmente grandes personas "? O bandeja de plata",
sería Sería mejor decir: "Servido todo en un plato o bandeja de oro".
Discurso. En cuanto a las "Galletas de la fortuna", tal vez sería mejor
llamarlas "Galletas para el futuro". Y luego si dices: "Un hombre de verdad
Solo tiene una palabra", Tal vez sería mejor decir: "La gente real tiene
una palabra". Solo una palabra. ”Estas son sugerencias simples, pero me
aseguraré de enviarlas a
Primer y Segundo Diccionario Americano, Primer y Segundo Diccionario
Británico, Primer y Segundo Diccionario Italiano, Primer y Segundo
Diccionario Ecuatoriano y Mexicano, Primer y Segundo Diccionario de
Mandarín y Cantonés, y Primer y Segundo Diccionario Japonés. Para Terminar
Wikipedia EN Todos Estos Idiomas. De nuevo, atentamente para USTED.
此致,
我们生活在一个与万物联系越来越紧密的时代,但仍有太多的沟通问题,比如谚语和成语,可以变得更有说服力。例如,“complinsult”这个词在同一个句子或段落中既是恭维又是侮辱。
“Sidecars”自 1940
年代就已经存在,但称它们为“Sidebike”或“Sidecycle”不是更好吗?那么如果我们考虑一句拉丁美洲谚语,“鱼三天后臭”,那么说“鱼第一天就臭了”不是更好吗?那么如果我们谈论性别歧视的谚语,比如:“一个伟大的男人背后总有一个伟大的女人,那不是更好吗:”伟大的人旁边有同样伟大的人“?或者银盘”,它会最好说:“盛在金盘或托盘上”。演讲。至于“幸运饼干”,也许最好称它们为“未来的饼干”。然后如果你说,“一个真正的男人只有一个词”,也许最好说,“真实的人有一个词。”只是一个词。“这些是简单的建议,但我一定会发送给
第一和第二美国字典,第一和第二英国字典,第一和第二意大利字典,第一和第二厄瓜多尔和墨西哥字典,第一和第二普通话和粤语字典,第一和第二日本字典。用所有这些语言完成维基百科。再次真诚地向你致敬。
此致,
我們生活在一個與萬物聯繫越來越緊密的時代,但仍有太多的溝通問題,比如諺語和成語,可以變得更有說服力。例如,“complinsult”一詞在同一個句子或段落中既是恭維又是侮辱。
“Sidecars”自 1940
年代就已經存在,但稱它們為“Sidebike”或“Sidecycle”不是更好嗎?那麼如果我們考慮一句拉丁美洲諺語,“魚三天后臭”,那麼說“魚第一天就臭了”不是更好嗎?那麼如果我們談論性別歧視的諺語,比如:“一個偉大的男人背後總有一個偉大的女人,那不是更好嗎:”偉大的人旁邊有同樣偉大的人“?或者銀盤”,它會最好說:“盛在金盤或託盤上”。演講。至於“幸運餅乾”,也許最好稱它們為“未來的餅乾”。然後如果你說,“一個真正的男人只有一個詞”,也許最好說,“真實的人有一個詞。”只是一個詞。“這些是簡單的建議,但我一定會發送給
第一和第二美國字典,第一和第二英國字典,第一和第二意大利字典,第一和第二厄瓜多爾和墨西哥字典,第一和第二普通話和粵語字典,第一和第二日本字典。用所有這些語言完成維基百科。再次真誠地向你致敬。
敬具
私たちはあらゆるものとの相互接続がますます進んでいる時代に生きていますが、ことわざやイディオムなど、より雄弁に表現できるコミュニケーションの問題がまだ多すぎます。たとえば、「complinsult」という単語は、同じ文または段落で褒め言葉と侮辱を意味します。
「サイドカー」は 1940
年代から存在していましたが、「サイドバイク」または「サイドサイクル」と呼んだ方がよいのではないでしょうか?では、ラテンアメリカのことわざである「魚は
3 日後に臭くなる」を考えると、「魚は 1
日目から臭くなる」と言ったほうがよいのではないでしょうか。次に、性差別的なことわざについて話すと、「偉大な男の背後には常に偉大な女性がいます。それなら、「偉大な人々の隣には等しく偉大な人々がいます」と言う方が良いのではないでしょうか」または「銀のトレイ」、それは「フォーチュン
クッキー」については、「クッキー フォー ザ フューチャー」と呼んだ方がいいかもしれません。言葉は 1
つしかありません」、「実際の人は言葉を持っている」と言ったほうがよいかもしれません。言葉だけです。」これらは簡単な提案ですが、必ず送信します
第一・第二アメリカ辞典、第一・第二イギリス辞典、第一・第二イタリア語辞典、第一・第二エクアドル・メキシコ辞典、第一・第二北京語・広東語辞典、第一・第二日本語辞典。これらすべての言語でウィキペディアを完成させる。改めまして、あなたに心からの敬意を表します。
Subject: Can I have a Wikipedia article written about me?
Good day from Singapore,
Can I have a Wikipedia article written about me?
I am looking forward to your reply.
Thank you.
Regards,
Mr. Turritopsis Dohrnii Teo En Ming
Targeted Individual in Singapore
27 July 2022 Wed
Blogs:
https://tdtemcerts.blogspot.comhttps://tdtemcerts.wordpress.com
Tl;dr
European project for Language Equality // Invitation to fill in a survey
about Language Equality and Language Technology: https://t1p.de/5voc //
Bringing the perspective of the Wikimedia movement to the European level
Dear everyone,
Today I’m writing to you all to ask for your help with a project concerning
language technologies, the Wikimedia movement and under-resourced European
languages.
Wikimedia Deutschland is supporting an EC-funded project called “European
Language Equality” (https://european-language-equality.eu/). The vision
that this project is working towards is to achieve full digital language
equality for all European languages - including all 24 official languages,
but also minority and regional languages (
https://european-language-equality.eu/languages/) - by 2030.
In order to prepare for this ambitious goal a partner consortium of 52
organisations (language associations, universities, language technology
providers etc.) will be working on creating a roadmap for European
commission programs and activities in the next year.
This roadmap and those policies should also include the view of the
Wikimedia movement and our perspective, needs and challenges on so-called
language technologies. A brief explanation of language technologies can be
found here (
https://en.wikipedia.org/wiki/Language_and_Communication_Technologies#Today)
and here (https://en.wikipedia.org/wiki/Language_technology).
*Why am I writing this to you?*
I’m currently reaching out to people in the movement whose voices should be
heard in this process. This includes (but is not limited to) editors of
small language Wikipedias, editors of European language Wikipedias,
Wikidata editors, Wiktionary editors, editors of Lexemes, Lingua Libre
volunteers, edit-a-thon organizers etc. etc. - In short, everyone in our
movement who is interested in languages and keeping languages thriving. At
the Arctic Knot (the Wikimedia Language conference), I had the chance to
talk about this project (
https://commons.wikimedia.org/wiki/File:(ELE)_Presentation_at_Arctic_Knot.p…)
and the ways of participating in it.
*What am I asking of you exactly?*
One of the project leaders (Dublin City University) has created a survey
for users of language technology and their demands as well as predictions
of the future of European languages. With the answers of this survey, the
partners (one of them being Wikimedia Deutschland) will write a report on
our perspective on the future of European languages and language
technologies.
I hope to include as many people from the movement as possible in this
consultation process and invite you to fill in the survey. Also, if you
want to spread the survey throughout your community - please do so!
------ Filling in the survey will take about 20 minutes, the data
protection guidelines are stated at the beginning of the survey. The survey
will be open until September 10th. You can get to the survey here:
https://ec.europa.eu/eusurvey/runner/LTusers-consumers-wiki. ----
Thank you for reading this mail, please let me know if you have any
questions or remarks or further suggestions! If you don’t feel comfortable
filling in the survey but still want your perspective to be included, feel
free to write me an e-mail.
Cheers from Berlin,
Maria
--
Maria Heuschkel
Projektmanagerin
Softwareentwicklung
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Tel. (030) 219 158 26-0
https://wikimedia.de
Unsere Vision ist eine Welt, in der alle Menschen am Wissen der Menschheit
teilhaben, es nutzen und mehren können. Helfen Sie uns dabei!
https://spenden.wikimedia.de
Wikimedia Deutschland — Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.