Is there anyone that has considered how to import data from external
sources, especially those that do not have any prepared an
well-defined API?
A rather simple example from the website for Statistics Norway is an
article on a website like this
http://www.ssb.no/fobstud/
and a table like this
http://www.ssb.no/fobstud/tab-2002-11-21-02.html
In that example you must follow a link to a new page which you then
must monitor for changes. Inside that page you can use Xpath to to
extract a field, and then optionally use something like a regexp to
identify and split fields. As an alternate solution you might use XLT
to transform the whole page.
Anyhow, this can quite easily be formulated both as a parser function
and a tag function.
At the same site there is something called "Statistikkbanken"
(http://statbank.ssb.no/statistikkbanken/) where you can (must) log on
and then iterate through a sequence of pages.
Similar data as in the previous example can be found in
http://statbank.ssb.no/statistikkbanken/selectvarval/Define.asp?MainTable=F…
But it is very difficult to formulate a kind of click-sequence inside that page.
Any idea? Some kind of click-sequence recording?
Statistics Norway publish statistics about Norway for free reuse as
long as they are credited as appropriate.
http://www.ssb.no/english/help/
John
On Sun, Mar 18, 2012 at 4:33 PM, JFC Morfin <jefsey(a)jefsey.com> wrote:
> Dear Lydia,
>
> I joined this list from a comment on the Wikimedia France site, but I would
> like to know where I can find a road map, a charter or a page for the
> project you engage on behalf ow WMDE.
> Thank you !
> jfc
Hi!
The important things are all linked on
http://meta.wikimedia.org/wiki/Wikidata. If you want to get a quick
overview I suggest the timeline in the FAQ linked there. If you are
looking for a detailed technical description then the Technical
Proposal linked there is right for you. It has a more detailed
timeline as well.
We are working on publishing more there as soon as we can. Stuff
starts for real in 2 weeks \o/
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher
Community Communications for Wikidata
Wikimedia Deutschland e.V.
Eisenacher Straße 2
10777 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
At 10:49 18/03/2012, =?ISO-8859-2?Q?Jan_Ku=E8era?= wrote:
>Sorry for biting, but life taught me WMF is a big black hole for
>money, ideas and latedy also for participation... not giving a shi* at
>all about its editors... I simply do not trust anything new since some
>time ago. May this project be an exception, since it actually is not
>run directly by WMF but by you guys in WMDE... it really is not
>surprising that WMF was not able to come up whit this project for
>about 7 years, even though there surely was demand...
Dear Lydia,
I joined this list from a comment on the Wikimedia France site, but I
would like to know where I can find a road map, a charter or a page
for the project you engage on behalf ow WMDE.
Thank you !
jfc
Hoi,
The WikiData project is of real interest to me. It is very much the second
coming of what is still a great idea. Providing an environment where in one
central environment data is maintained. In essence data can be expressed in
triples and with such a statement you get firmly into semantic web and into
language technology.
The original Wikidata was about "solving" the issue that Wiktionary is
doing the same thing over and over again. Consider, a word like "travel" is
linked on 37 wiktionaries and in essence they all say that it is an English
word, a verb and has a particular meaning with translations ... on a high
level all this data is the same.
When you look at a word like "Nederland" at OmegaWiki, you will agree that
Amsterdam is the capital of my country and this can be expressed as a
"triple" and all the two other elements in this triple can be translated.
Check out the word, check it out in other languages (like Arabic or Russian
or Dutch) and you will find the kind of functionality that will be a
challenge for this new project.
Data projects, software projects have one big problem. They typically do
not consider their use in other languages. They find it surprising that you
can not really add all this language stuff in at a later time.
Wikidata will be used by Wikipedia and Wikipedia is at this moment only
some 283 languages. Not one, not two and not fifty. There are over 7000
languages in the ISO-639-3...
<grin> I have been asked to be an advisor to this project and I accepted
</grin> An advisor advises and my advise is to make WikiData II a tools for
all our projects from the start. At OmegaWiki we learned a lot and we LOVE
to share our knowledge with you.
Thanks,
GerardM
http://en.wiktionary.org/wiki/travelhttp://www.omegawiki.org/Expression:Nederland
Hi
I'd also like to briefly introduce myself, although not actively working
on Wikidata or at Wikimedia Germany. I was in the board of the German
chapter the first years and active especially in German Wikipedia - my
main interest is topics in knowledge organization, library science,
computer science, which I both studied in Berlin. Now I live in
Göttingen (by the way that's were the first digital data transmission
over wire took place in 1833), working on research and development for
the GBV library union network. I hope to finish my PhD on foundations of
data structuring this summer [1]. I sometimes write in my weblog at
http://jakoblog.de/.
Cheers
Jakob
[1] http://aboutdata.org/bibliography
--
Verbundzentrale des GBV (VZG)
Digitale Bibliothek - Jakob Voß
Platz der Goettinger Sieben 1
37073 Goettingen - Germany
+49 (0)551 39-10242
http://www.gbv.de
jakob.voss(a)gbv.de
Hi! I'm Sumana Harihareswara and I coordinate volunteer software
development for Wikimedia projects. I imagine I'll mostly be lurking
here but I do want to say hi and thank WMDE for leading such an
important and exciting project!
--
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation
http://domainincite.com/ntia-says-icann-does-not-meet-the-requirements-for-…
This information will most probably affect the New gTLD Project and
open new possibilities to Google+ and to the emergence of the
Internet+. This may help the experimentation of ".wiki" and lead to
an opendata general approach to be tested (new architecture, users
cloud, naming plan, etc.).
jfc