There is an equal size of data on Belgian enterprises available. with the same objective to enrich wikidata with enterprise data I recently proposed the following property: https://www.wikidata.org/wiki/Wikidata:Property_proposal/NACE_code
However, after some talks with others in the Wikidata community, I recently have some second thoughts on whether or not a full dump of these type of datasets are valuable enrichments of Wikidata. Adding 2 million items with additional statement per item would be quite an enlargement of Wikidata. If we would bot add all business of both Belgium and Germany, we would have 4 million of new items, which currently would count for 10% of all of Wikidata. I am not sure what this would mean in term scalability and if it would cause any scalability issues.
Maybe a use-case driven approach here would be more appropriate. We could think of a bot that would source both the trade registers of the different countries when a specific use case would vouch for the inclusion of trade data.
Just my 2cts
Cheers,
Andra
On Mon, Oct 16, 2017 at 9:48 AM, Sebastian Hellmann < hellmann@informatik.uni-leipzig.de> wrote:
Thanks, done.
https://www.wikidata.org/wiki/Wikidata:Project_chat#Handelsregister
On 15.10.2017 22:10, Yaroslav Blanter wrote:
Hi Sebastian,
I would say the best way is to file a request for the permissions for the bot
https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot
and possibly leave a message on the Project Chat
https://www.wikidata.org/wiki/Wikidata:Project_chat
Cheers Yaroslav
On Sun, Oct 15, 2017 at 9:44 AM, Sebastian Hellmann < hellmann@informatik.uni-leipzig.de> wrote:
Hi all,
the German business registry contains roughly 2.2 million organisations. Some information is paid, but other is public, i.e. the info you are searching for at and clicking on UT (see example below):
https://www.handelsregister.de/rp_web/mask.do?Typ=e
I would like to add this to Wikidata, either by crawling or by raising money to use crowdsourcing concepts like crowdflour or amazon turk.
It should meet notability criteria 2: https://www.wikidata.org/wiki/ Wikidata:Notability
- It refers to an instance of a *clearly identifiable conceptual or
material entity*. The entity must be notable, in the sense that it *can be described using serious and publicly available references*. If there is no item about you yet, you are probably not notable.
The reference is the official German business registry, which is serious and public. Orgs are also per definition clearly identifiable legal entities.
How can I get clearance to proceed on this?
All the best, Sebastian
Entity data Saxony District court *Leipzig HRB 32853 *– A&A Dienstleistungsgesellschaft mbH Legal status: Gesellschaft mit beschränkter Haftung Capital: 25.000,00 EUR Date of entry: 29/08/2016 (When entering date of entry, wrong data input can occur due to system failures!) Date of removal: - Balance sheet available: - Address (subject to correction): A&A Dienstleistungsgesellschaft mbH Prager Straße 38-40 04317 Leipzig
-- All the best, Sebastian Hellmann
Director of Knowledge Integration and Linked Data Technologies (KILT) Competence Center at the Institute for Applied Informatics (InfAI) at Leipzig University Executive Director of the DBpedia Association Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org, https://www.w3.org/community/ld4lt http://www.w3.org/community/ld4lt Homepage: http://aksw.org/SebastianHellmann Research Group: http://aksw.org
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
-- All the best, Sebastian Hellmann
Director of Knowledge Integration and Linked Data Technologies (KILT) Competence Center at the Institute for Applied Informatics (InfAI) at Leipzig University Executive Director of the DBpedia Association Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org, https://www.w3.org/community/ld4lt http://www.w3.org/community/ld4lt Homepage: http://aksw.org/SebastianHellmann Research Group: http://aksw.org
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata