Hey everyone,
After a few hick-ups ordering, ranks and a table of content for item
pages are now live on wikidata.org together with a load of other small
improvements. I hope you enjoy them. There are still a few usability
issues with ordering as well as the problem of ordering changes not
showing up in diffs. If you find any other issues related to this
deployment that do not go away after reloading/purging(add
?action=purge to the URL) the page please let me know.
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
(cross-posting Sebastiano’s post from the analytics list, this may be of interest to both the wikidata and wiki-research-l communities)
Begin forwarded message:
> From: Sebastiano Vigna <vigna(a)di.unimi.it>
> Subject: [Analytics] Distributing an official graph
> Date: December 9, 2013 at 10:09:31 PM PST
>
> [Reposted from private discussion after Dario's request]
>
> My problem is that of exploring the graph structure of Wikipedia
>
> 1) easily;
> 2) reproducibly;
> 3) in a way that does not depend on parsing artifacts.
>
> Presently, when people wants to do this they either do their own parsing of the dumps, or they use the SQL data, or they download a dataset like
>
> http://law.di.unimi.it/webdata/enwiki-2013/
>
> which has everything "cooked up".
>
> My frustration in the last few days was when trying to add the category links. I didn't realize (well, it's not very documented) that bliki extracts all links and render them in HTML *except* for the category links, that are instead accessible programmatically. Once I got there, I was able to make some progress.
>
> Nonetheless, I think that the graph of Wikipedia connections (hyperlinks and category links) is really a mine of information and it is a pity that a lot of huffing and puffing is necessary to do something as simple as a reverse visit of the category links from "People" to get, actually, all people pages (this is a bit more complicated--there are many false positives, but after a couple of fixes worked quite well).
>
> Moreover, one has continuously this feeling of walking on eggshells: a small change in bliki, a small change in the XML format and everything might stop working is such a subtle manner that you realize it only after a long time.
>
> I was wondering if Wikimedia would be interested in distributing in compressed form the Wikipedia graph. That would be the "official" Wikipedia graph--the benefits, in particular for people working on leveraging semantic information from Wikipedia, would be really significant.
>
> I would (obviously) propose to use our Java framework, WebGraph, which is actually quite standard in distributing large (well, actually much larger) graphs, such as ClueWeb09 http://lemurproject.org/clueweb09/, ClueWeb12 http://lemurproject.org/clueweb12/ and the recent Common Web Crawl http://webdatacommons.org/hyperlinkgraph/index.html. But any format is OK, even a pair of integers per line. The advantage of a binary compressed form is reduced network utilization, instantaneous availability of the information, etc.
>
> Probably it would be useful to actually distribute several graphs with the same dataset--e.g., the category links, the content link, etc. It is immediate, using WebGraph, to build a union (i.e., a superposition) of any set of such graphs and use it transparently as a single graph.
>
> In my mind the distributed graph should have a contiguous ID space, say, induced by the lexicographical order of the titles (possibly placing template pages at the start or at the end of the ID space). We should provide graphs, and a bidirectional node<->title map. All such information would use about 300M of space for the current English Wikipedia. People could then associate pages to nodes using the title as a key.
>
> But this last part is just rambling. :)
>
> Let me know if you people are interested. We can of course take care of the process of cooking up the information once it is out of the SQL database.
>
> Ciao,
>
> seba
>
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
All,
We've just commissioned the latest edition of the Linked Open Data (LOD)
Cloud cache that we maintain [1]. As part of this effort, we included
Wikidata's RDF dumps and Freebase cross references. Thus far, I am not
convinced that we've actually loaded all of data out there, so I am
seeking some help from those associated with the production of the
following:
1. Wikidata dumps -- basic
2. Wikidata dumps -- with cross references to Freebase.
To understand my concerns, here are some sample links:
[1] http://bit.ly/1hDhexC -- random Wikidata entity description of one
"Stephen Fry"
[2] http://bit.ly/1aMwpvU -- another description of the same entity
(this time from the Freebase dump which cross references Wikidata).
Links:
[1] http://lod.openlinksw.com -- home page.
[2] http://lod.openlinksw.com/sparql -- SPARQL endpoint.
--
Regards,
Kingsley Idehen
Founder & CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter Profile: https://twitter.com/kidehen
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Dear WikidataLovers,
In the process of organizing, the 2014 edition of XML Prague, I thought it
might be interesting to cross bridges between Wikidata and XML Prague
As an organizers of the conference, I wanted to see the opportunities of
proposing interesting XML challenge to the XML communities via Wikidata
Also, we are also interetested in advanced use of XML from WikidataLovers,
that could lead to a presentation, a poster or an intervention in a hands
on session.
Prague is a lovely city and we will do our best to make it confortable. We
don't have much money (conference fees are very cheap) but we can invite
people and help them finding accomodation.
The conference will be held on 14-16th February 2014
Best regards,
Mohamed ZERGAOUI
on behalf of XML Prague Organizers
http://xmlprague.cz
--
Innovimax SARL
Consulting, Training & XML Development
9, impasse des Orteaux
75020 Paris
Tel : +33 9 52 475787
Fax : +33 1 4356 1746
http://www.innovimax.fr
RCS Paris 488.018.631
SARL au capital de 10.000 €
Hello all,
This week has been a bit quiet however this week does not disappoint. Today
(see email 2 hours ago) a few features were enabled on testwikidata for
testing, the Italian and Polish Wikipedia decided to include Wikidata in
their search results and we make another modification to the summaries in
that notable requests for permissions are now listed. Enjoy the summary
this week at:
https://meta.wikimedia.org/wiki/Wikidata/Status_updates/2013_12_06
John
Hey everyone!
We deployed a bunch of new code to test.wikidata.org for you to test.
Based on your feedback for the initial version of quantities I have
decided to not release them to wikidata.org next week. We have been
working on a good part of the feedback and most fixes are now also
live on test.wikidata.org. This means quantities are now scheduled to
come to Wikidata in the middle of January. I'm sorry for the delay but
your feedback was very valuable.
In addition to that we have rolled out a new batch of features to
test.wikidata.org for you to test and give feedback on.
The first one is ordering. You can now change the order of statements
and statement groups. There are still a few usability issues with it
that we need to solve but I want to get this out to you now.
The second one is ranks. Ranks are there to indicate one or more
statements are preferred or deprecated. This is especially useful for
things like indicating that someone was the major of a city many years
ago.
The third one is a table of contents that was developed by Bene*. This
should come in handy especially on large items.
The forth one is that the data type of a property is now also added to
the JSON and API output.
And last but not least we've worked on improving performance so things
run a bit faster for you.
These should go live on wikidata.org on Tuesday next week. So how's
that for a Christmas present, people? ;-)
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher
Product Manager for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
Hoi,
The Italian Wikipedia is the first project where people who use search will
find results added from Wikidata. As you may know, Wikidata knows has more
items with a label in a language than a Wikipedia has articles. With the
Wikidata based functionality people will gain several functionalities that
are new to them
- a link to Commons categories for a subject
- a link to Wikipedia articles in other languages
- a link to the Wikidata item
- visualisation care of the "Reasonator"
When there are multiple items found in the search request, disambiguation
will be provided based on the statements available on the items. Obviously
as more labels are available in a language for statements, the experience
will improve.
This is a really exciting new development and I want to thank Magnus and
Nemo for making it possible. I hope and expect that many Wikipedias will
follow the example of the Italian Wikipedia. Particularly the smaller
Wikipedias have much to gain from this new functionality.
Thanks,
GerardM
Dear all,
I am exploring options of interaction between Wikidata and scholarly
resources. While my focus here is on biodiversity research (and I'd be
very happy to hear about your activities along these lines), a more
concrete route for collaboration now becomes visible with PubChem, as
mentioned at
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Chemistry#Collabora…
.
Cheers,
Daniel