Hello all,
As you may know, ORES is a tool analyzing edits to detect vandalism,
providing a score per edit. You can see the result on Recent Changes, you
can also let us know when you find something wrong
<https://www.wikidata.org/wiki/Wikidata:ORES/Report_mistakes/2018>.
But do you know that you can also directly help ORES to improve? We just
launched a new labeling campaign
<https://labels.wmflabs.org/ui/wikidatawiki/>: after authorizing your
account with OAuth, you will see some real edits, and you will be asked if
you find them damaging or not, good faith or bad faith. Completing a set
will take you around 10 minutes.
The last time we run this campaign was in 2015. Since then, the way of
editing Wikidata changed, some vandalism patterns as well (for example,
there are more vandalism on companies). So, if you're familiar with the
Wikidata rules and you would be willing to give a bit of time to help
fighting against vandalism, please participate
<https://labels.wmflabs.org/ui/wikidatawiki/> :)
If you encounter any problem or have question about the tool, feel free to
contact Ladsgroup <https://www.wikidata.org/wiki/User:Ladsgroup>.
Cheers,
--
Léa Lacroix
Project Manager Community Communication for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hi,
I have a couple of questions regarding the Wiki Page ID. Does it always
stay unique for the page, where the page itself is just a placeholder for
any kind of information that might change over time?
Consider the following cases:
1. The first time someone creates page "Moon" it is assigned ID=1. If at
some point the page is renamed to "The_Moon", the ID=1 remains intact. Is
this correct?
2. What if we have page "Moon" with ID=1. Someone creates a second-page
"The_Moon" with ID=2. Is it possible that page "Moon" is transformed into a
redirect? Then, "Moon" would be redirecting to page "The_Moon"?
3. Is it possible for page "Moon" to become a category "Category:Moon" with
the same ID=1?
Thanks,
Gintas
Hello everyone,
I'd like to ask if Wikidata could please offer a HDT [1] dump along with the already available Turtle dump [2]. HDT is a binary format to store RDF data, which is pretty useful because it can be queried from command line, it can be used as a Jena/Fuseki source, and it also uses orders-of-magnitude less space to store the same data. The problem is that it's very impractical to generate a HDT, because the current implementation requires a lot of RAM processing to convert a file. For Wikidata it will probably require a machine with 100-200GB of RAM. This is unfeasible for me because I don't have such a machine, but if you guys have one to share, I can help setup the rdf2hdt software required to convert Wikidata Turtle to HDT.
Thank you.
[1] http://www.rdfhdt.org/
[2] https://dumps.wikimedia.org/wikidatawiki/entities/
Dear Wikidata community,
We're working on a project called Wikibabel to machine-translate parts of
Wikipedia into underserved languages, starting with Swahili.
In hopes that some of our ideas can be helpful to machine translation
projects, we wrote a blogpost about how we prioritized which pages to
translate, and what categories need a human in the loop:
https://medium.com/@oirzak/wikibabel-equalizing-information-access-on-a-bud…
Rumor has it that the Wikidata community has thought deeply about
information access. We'd love your feedback on our work. Please let us know
about past / ongoing machine translation related projects so we can learn
from & collaborate with them.
Best regards,
Olya & the Wikibabel crew
Hello all,
As previously announced, the sixth birthday of Wikidata will happen around
October 29th, all around the world: local groups, communities, can organize
their own event around Wikidata. Meetup or workshop, talk or editathon, now
is the good time to start thinking about what you want to do.
Wikidata's sixth birthday
<https://www.wikidata.org/wiki/Wikidata:Sixth_Birthday> and its talk page
is the main place where you can find information about organization,
funding, and discuss with others.
As mentioned in the past, WMDE can help with providing advice on
organization and communication, and can send Wikidata swag to your group
before the event. If we want to get all of this ready for you before the
end of October, we need some buffer time. That's why I'd like to inform you
about two things:
1. A *call for all organizers* or people thinking about organizing a
birthday celebration, will take place on August 28th. This is the
opportunity to chat with me, but also people from other local groups, about
your ideas, your questions, your needs.
- Date: *August 28th*, 18:00 to 19:00 (UTC+2, Berlin time)
- Meeting link: Google Meet <https://meet.google.com/bxb-kgsa-tfu>
(yes, it's Google. If you're deeply unhappy about this, you can
write to me
and we'll find another solution)
2. The *deadline for requesting Wikidata swag is September 7th*. After
this date, we can't promise that we'll be able to prepare the swag for you.
- Before this date, you can write me an email, listing everything you
need. It can be items that we already have (T-shirts, hoodies, socks,
stickers, bags, notebooks, pens...) but also other things, if
you have nice
ideas on what to print.
- I'll reply to you shortly and we'll discuss about the best options
to get the swag: print it from our usual partners in Germany, print it
locally, etc. I'll support you during the whole process.
If you have any question, suggestion, feel free to write to me at any time.
I'm excited about the plans we're going to build together!
Cheers,
--
Léa Lacroix
Project Manager Community Communication for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hi!
Today we are indexing in ElasticSearch almost all string properties
(except a few) and select item properties (P31 and P279). We've been
asked to extend this set and index more item properties
(https://phabricator.wikimedia.org/T199884). We did not do it from the
start because we did not want to add too much data to the index at once,
and wanted to see how the index behaves. To evaluate what this change
would mean, some statistics:
All usage of item properties in statements is about 231 million uses
(according to sqid tool database). Of those, about 50M uses are
"instance of" which we are already indexing. Another 98M uses belong to
two properties - published in (P1433) and cites (P2860). Leaving about
86M for the rest of the properties.
So, if we index all the item properties except P2860 and P1433, we'll be
a little more than doubling the amount of data we're storing for this
field, which seems OK. But if we index those too, we'll be essentially
quadrupling it - which may be OK too, but is bigger jump and one that
may potentially cause some issues.
So, we have two questions:
1. Do we want to enable indexing for all item properties? Note that if
you just want to find items with certain statement values, Wikidata
Query Service matches this use case best. It's only in combination with
actual fulltext search where on-wiki search is better.
2. Do we need to index P2860 and P1433 at all, and if so, would it be ok
if we omit indexing for now?
Would be glad to hear thoughts on the matter.
Thanks,
--
Stas Malyshev
smalyshev(a)wikimedia.org
Hello all,
Thanks to Lucas who filled the necessary requirements, Wikidata now appears
in the LOD cloud graph: http://lod-cloud.net
Currently, the graph doesn't display all the actual connections of
Wikidata. The only connections that show up are the properties that link to
other projects or databases, and having a specific statement on them to
link to an RDF endpoint.
If you see something missing, you can contribute by adding the statement
“formatter URI for RDF resource” on properties where the resource supports
RDF (example <https://www.wikidata.org/wiki/Property:P214#P1921>).
You can learn more about the procedure to update the graph and a list of
the existing and missing datasets here
<https://www.wikidata.org/wiki/User:Lucas_Werkmeister_(WMDE)/LOD_Cloud>,
Thanks to Lucas and John for making this happening!
--
Léa Lacroix
Project Manager Community Communication for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hello esteemed team !
Someone noted on Property Talk about
https://www.wikidata.org/wiki/Property_talk:P1748
that an external id should be used ?
I added just a few codes to test what is happening, and the CODE on the
left does populate, like this one for Death
https://www.wikidata.org/wiki/Q1931388 it shows my newly added C28554 on
the left under External properties, but yeah, it is not a clickable link.
The Formatter URL does indeed work when populated with that C28554
code...so...
What could be the issue with the CODE not displaying with the Formatter URL
and being clickable ?
-Thad
*Hello all,starting this week the Technical Advice IRC Meeting will take
place not only every Wednesday at 3 pm UTC, but also every first Wednesday
of the month at 11 pm UTC. This will allow people from other timezones to
attend.The Technical Advice IRC Meeting (TAIM) is a weekly support event
for volunteer developers. Every Wednesday, two full-time developers are
available to help you with all your questions about Mediawiki, gadgets,
tools and more! This can be anything from "how to get started" over "who
would be the best contact for X" to specific questions on your project.The
support meeting has been initiated by WMDE’s tech team about a year ago and
will be continued as a cooperation between WMF and WMDE staff members from
August 1st on.Technical Advice IRC meeting this week: **Wednesday, August
1st at 3-4 pm UTC and 11-12 pm UTC** on #wikimedia-tech.If you know already
what you would like to discuss or ask, please add your topic to the next
meeting: https://www.mediawiki.org/wiki/Technical_Advice_IRC_Meeting
<https://www.mediawiki.org/wiki/Technical_Advice_IRC_Meeting>Hope to see
you there!Michi (for the Technical Advice IRC Meeting crew)*
--
Michael F. Schönitzer
Wikimedia Deutschland e.V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Tel. (030) 219 158 26-0
http://wikimedia.de
Stellen Sie sich eine Welt vor, in der jeder Mensch an der Menge allen
Wissens frei teilhaben kann. Helfen Sie uns dabei!
http://spenden.wikimedia.de/
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/681/51985.