Hi all,
The new Wikidata Toolkit project has started this week, so here is a brief update. You are invited to participate and to comment. Early users will enjoy a special first-level support: if you have an interesting use-case for our project, we will help you get started and prioritize your requirements.
Wikidata Toolkit is going to be a Java library to work with Wikidata (and Wikibase in general). It will allow you to load, query, and analyse the data, and to export it to a variety of formats.
Key facts:
* Homepage: https://www.mediawiki.org/wiki/Wikidata_Toolkit * Code: https://github.com/Wikidata/Wikidata-Toolkit * Planned milestones: https://github.com/Wikidata/Wikidata-Toolkit/issues/milestones * Project plan (high level): https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit
Our current team consists of four people: Fredo Erxleben, Julian Mendez, Michael Günther, and myself. Development is supported by the Wikimedia Foundation and by the German Research Foundation. The project is initially planned for six months.
You are invited to contribute code/feature requests/use cases. If you have a concrete task that you would like to solve, please let us know, and we will see what we can do. Our first release 0.1 should be able to create data exports in RDF and maybe in other formats from the internal Wikidata dump files. If you need Wikidata in other formats, please let us know.
Our later releases will be able to load data and add query capabilities that allow you to analyse the data. Again, your requirements are welcome.
Cheers,
Markus
Hello Markus,
Glad to see the wikidata toolkit project has kicked off.
As a previous user of wikidata and pywikipedia(knowing its shortcomings), i fully support the project.
I would like to get involved and contribute to the toolkit. I have forked on github and i am currently setting up, working my way around. Would ping back to u if i get stuck.
Hoping for a positive reply.
--Chinmay
On Wed, Feb 19, 2014 at 7:15 PM, Markus Kroetzsch < markus.kroetzsch@tu-dresden.de> wrote:
Hi all,
The new Wikidata Toolkit project has started this week, so here is a brief update. You are invited to participate and to comment. Early users will enjoy a special first-level support: if you have an interesting use-case for our project, we will help you get started and prioritize your requirements.
Wikidata Toolkit is going to be a Java library to work with Wikidata (and Wikibase in general). It will allow you to load, query, and analyse the data, and to export it to a variety of formats.
Key facts:
- Homepage: https://www.mediawiki.org/wiki/Wikidata_Toolkit
- Code: https://github.com/Wikidata/Wikidata-Toolkit
- Planned milestones: https://github.com/Wikidata/Wikidata-Toolkit/issues/
milestones
- Project plan (high level): https://meta.wikimedia.org/
wiki/Grants:IEG/Wikidata_Toolkit
Our current team consists of four people: Fredo Erxleben, Julian Mendez, Michael Günther, and myself. Development is supported by the Wikimedia Foundation and by the German Research Foundation. The project is initially planned for six months.
You are invited to contribute code/feature requests/use cases. If you have a concrete task that you would like to solve, please let us know, and we will see what we can do. Our first release 0.1 should be able to create data exports in RDF and maybe in other formats from the internal Wikidata dump files. If you need Wikidata in other formats, please let us know.
Our later releases will be able to load data and add query capabilities that allow you to analyse the data. Again, your requirements are welcome.
Cheers,
Markus
-- Markus Kroetzsch Faculty of Computer Science Technische Universität Dresden +49 351 463 38486 http://korrekt.org/
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Hi Chinmay,
On 21/02/14 11:24, Chinmay Naik wrote:
Hello Markus,
Glad to see the wikidata toolkit project has kicked off.
As a previous user of wikidata and pywikipedia(knowing its shortcomings), i fully support the project.
I would like to get involved and contribute to the toolkit. I have forked on github and i am currently setting up, working my way around. Would ping back to u if i get stuck.
Hoping for a positive reply.
Great to hear that. Github makes it very easy to contribute (fork, modify, send pull request, and have us merge your code). So technically, you are all set.
The more important question is what you want to work on. It is probably best to look into functionality that is of interest to you. The things we are currently working on are the assigned issues in our issue tracker. Mainly, Fredo and I are working to get dump file download and parsing done ASAP, Julian is working on basic data structures that should help with indexing later on, and Michael is looking at implementing initial MediaWiki API bindings.
There are many open questions regarding the MW API implementation, so maybe this could be a good place to contribute if you are already experienced with that. Initially, we only want to support read access, but write access would also be nice at some point. We probably will organize the API code on a per-module basis for low-level access, and then provide some higher level wrappers for more convenient access. Due to the modular nature of the API, this work is easy to split.
Other open issues that are not worked on yet are related to serialisation in useful dump format (RDF, "official" JSON, MediaWiki dump XML [useful to dump part of the data in a file that can be re-imported into another MediaWiki]).
But maybe you have other ideas too.
Cheers,
Markus
--Chinmay
On Wed, Feb 19, 2014 at 7:15 PM, Markus Kroetzsch <markus.kroetzsch@tu-dresden.de mailto:markus.kroetzsch@tu-dresden.de> wrote:
Hi all, The new Wikidata Toolkit project has started this week, so here is a brief update. You are invited to participate and to comment. Early users will enjoy a special first-level support: if you have an interesting use-case for our project, we will help you get started and prioritize your requirements. Wikidata Toolkit is going to be a Java library to work with Wikidata (and Wikibase in general). It will allow you to load, query, and analyse the data, and to export it to a variety of formats. Key facts: * Homepage: https://www.mediawiki.org/__wiki/Wikidata_Toolkit <https://www.mediawiki.org/wiki/Wikidata_Toolkit> * Code: https://github.com/Wikidata/__Wikidata-Toolkit <https://github.com/Wikidata/Wikidata-Toolkit> * Planned milestones: https://github.com/Wikidata/__Wikidata-Toolkit/issues/__milestones <https://github.com/Wikidata/Wikidata-Toolkit/issues/milestones> * Project plan (high level): https://meta.wikimedia.org/__wiki/Grants:IEG/Wikidata___Toolkit <https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit> Our current team consists of four people: Fredo Erxleben, Julian Mendez, Michael Günther, and myself. Development is supported by the Wikimedia Foundation and by the German Research Foundation. The project is initially planned for six months. You are invited to contribute code/feature requests/use cases. If you have a concrete task that you would like to solve, please let us know, and we will see what we can do. Our first release 0.1 should be able to create data exports in RDF and maybe in other formats from the internal Wikidata dump files. If you need Wikidata in other formats, please let us know. Our later releases will be able to load data and add query capabilities that allow you to analyse the data. Again, your requirements are welcome. Cheers, Markus -- Markus Kroetzsch Faculty of Computer Science Technische Universität Dresden +49 351 463 38486 http://korrekt.org/ _________________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org <mailto:Wikidata-l@lists.wikimedia.org> https://lists.wikimedia.org/__mailman/listinfo/wikidata-l <https://lists.wikimedia.org/mailman/listinfo/wikidata-l>
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Hello Markus, Fredo, Julian, Michael and Lydia,
I'm writing to follow up on the following, Markus -
"You are invited to contribute code/feature requests/use cases. If you have a concrete task that you would like to solve, please let us know, and we will see what we can do " -
vis-a-vis WUaS.
I'll email you off list about this (but I don't have all your email addresses).
Hoping, too, for a positive reply.
best, Scott
On Fri, Feb 21, 2014 at 5:22 AM, Markus Krötzsch < markus@semantic-mediawiki.org> wrote:
Hi Chinmay,
On 21/02/14 11:24, Chinmay Naik wrote:
Hello Markus,
Glad to see the wikidata toolkit project has kicked off.
As a previous user of wikidata and pywikipedia(knowing its shortcomings), i fully support the project.
I would like to get involved and contribute to the toolkit. I have forked on github and i am currently setting up, working my way around. Would ping back to u if i get stuck.
Hoping for a positive reply.
Great to hear that. Github makes it very easy to contribute (fork, modify, send pull request, and have us merge your code). So technically, you are all set.
The more important question is what you want to work on. It is probably best to look into functionality that is of interest to you. The things we are currently working on are the assigned issues in our issue tracker. Mainly, Fredo and I are working to get dump file download and parsing done ASAP, Julian is working on basic data structures that should help with indexing later on, and Michael is looking at implementing initial MediaWiki API bindings.
There are many open questions regarding the MW API implementation, so maybe this could be a good place to contribute if you are already experienced with that. Initially, we only want to support read access, but write access would also be nice at some point. We probably will organize the API code on a per-module basis for low-level access, and then provide some higher level wrappers for more convenient access. Due to the modular nature of the API, this work is easy to split.
Other open issues that are not worked on yet are related to serialisation in useful dump format (RDF, "official" JSON, MediaWiki dump XML [useful to dump part of the data in a file that can be re-imported into another MediaWiki]).
But maybe you have other ideas too.
Cheers,
Markus
--Chinmay
On Wed, Feb 19, 2014 at 7:15 PM, Markus Kroetzsch <markus.kroetzsch@tu-dresden.de mailto:markus.kroetzsch@tu-dresden.de>
wrote:
Hi all, The new Wikidata Toolkit project has started this week, so here is a brief update. You are invited to participate and to comment. Early users will enjoy a special first-level support: if you have an interesting use-case for our project, we will help you get started and prioritize your requirements. Wikidata Toolkit is going to be a Java library to work with Wikidata (and Wikibase in general). It will allow you to load, query, and analyse the data, and to export it to a variety of formats. Key facts: * Homepage: https://www.mediawiki.org/__wiki/Wikidata_Toolkit <https://www.mediawiki.org/wiki/Wikidata_Toolkit> * Code: https://github.com/Wikidata/__Wikidata-Toolkit <https://github.com/Wikidata/Wikidata-Toolkit> * Planned milestones: https://github.com/Wikidata/__Wikidata-Toolkit/issues/__milestones <https://github.com/Wikidata/Wikidata-Toolkit/issues/milestones> * Project plan (high level): https://meta.wikimedia.org/__wiki/Grants:IEG/Wikidata___Toolkit <https://meta.wikimedia.org/wiki/Grants:IEG/Wikidata_Toolkit> Our current team consists of four people: Fredo Erxleben, Julian Mendez, Michael Günther, and myself. Development is supported by the Wikimedia Foundation and by the German Research Foundation. The project is initially planned for six months. You are invited to contribute code/feature requests/use cases. If you have a concrete task that you would like to solve, please let us know, and we will see what we can do. Our first release 0.1 should be able to create data exports in RDF and maybe in other formats from the internal Wikidata dump files. If you need Wikidata in other formats, please let us know. Our later releases will be able to load data and add query capabilities that allow you to analyse the data. Again, your requirements are welcome. Cheers, Markus -- Markus Kroetzsch Faculty of Computer Science Technische Universität Dresden +49 351 463 38486 http://korrekt.org/ _________________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org <mailto:Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/__mailman/listinfo/wikidata-l <https://lists.wikimedia.org/mailman/listinfo/wikidata-l>
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l