Hi Chinmay,Great to hear that. Github makes it very easy to contribute (fork, modify, send pull request, and have us merge your code). So technically, you are all set.
On 21/02/14 11:24, Chinmay Naik wrote:
Hello Markus,
Glad to see the wikidata toolkit project has kicked off.
As a previous user of wikidata and pywikipedia(knowing its
shortcomings), i fully support the project.
I would like to get involved and contribute to the toolkit. I have
forked on github and i am currently setting up, working my way around.
Would ping back to u if i get stuck.
Hoping for a positive reply.
The more important question is what you want to work on. It is probably best to look into functionality that is of interest to you. The things we are currently working on are the assigned issues in our issue tracker. Mainly, Fredo and I are working to get dump file download and parsing done ASAP, Julian is working on basic data structures that should help with indexing later on, and Michael is looking at implementing initial MediaWiki API bindings.
There are many open questions regarding the MW API implementation, so maybe this could be a good place to contribute if you are already experienced with that. Initially, we only want to support read access, but write access would also be nice at some point. We probably will organize the API code on a per-module basis for low-level access, and then provide some higher level wrappers for more convenient access. Due to the modular nature of the API, this work is easy to split.
Other open issues that are not worked on yet are related to serialisation in useful dump format (RDF, "official" JSON, MediaWiki dump XML [useful to dump part of the data in a file that can be re-imported into another MediaWiki]).
But maybe you have other ideas too.
Cheers,
Markus
<markus.kroetzsch@tu-dresden.de <mailto:markus.kroetzsch@tu-dresden.de>>
--Chinmay
On Wed, Feb 19, 2014 at 7:15 PM, Markus Kroetzsch* Homepage: https://www.mediawiki.org/__wiki/Wikidata_Toolkit
wrote:
Hi all,
The new Wikidata Toolkit project has started this week, so here is a
brief update. You are invited to participate and to comment. Early
users will enjoy a special first-level support: if you have an
interesting use-case for our project, we will help you get started
and prioritize your requirements.
Wikidata Toolkit is going to be a Java library to work with Wikidata
(and Wikibase in general). It will allow you to load, query, and
analyse the data, and to export it to a variety of formats.
Key facts:
<https://www.mediawiki.org/wiki/Wikidata_Toolkit>
* Code: https://github.com/Wikidata/__Wikidata-Toolkit https://github.com/Wikidata/__Wikidata-Toolkit/issues/__milestones https://meta.wikimedia.org/__wiki/Grants:IEG/Wikidata___Toolkit_________________________________________________
<https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit>
Our current team consists of four people: Fredo Erxleben, Julian
Mendez, Michael Günther, and myself. Development is supported by the
Wikimedia Foundation and by the German Research Foundation. The
project is initially planned for six months.
You are invited to contribute code/feature requests/use cases. If
you have a concrete task that you would like to solve, please let us
know, and we will see what we can do. Our first release 0.1 should
be able to create data exports in RDF and maybe in other formats
from the internal Wikidata dump files. If you need Wikidata in other
formats, please let us know.
Our later releases will be able to load data and add query
capabilities that allow you to analyse the data. Again, your
requirements are welcome.
Cheers,
Markus
--
Markus Kroetzsch
Faculty of Computer Science
Technische Universität Dresden
+49 351 463 38486
http://korrekt.org/
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org <mailto:Wikidata-l@lists.wikimedia.org>
https://lists.wikimedia.org/__mailman/listinfo/wikidata-l
<https://lists.wikimedia.org/mailman/listinfo/wikidata-l>
_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l
_______________________________________________
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l