There is an equal size of data on Belgian enterprises available. with the same objective to enrich wikidata with enterprise data I recently proposed the following property: https://www.wikidata.org/wiki/Wikidata:Property_proposal/NACE_code

However, after some talks with others in the Wikidata community, I recently have some second thoughts on whether or not a full dump of these type of datasets are valuable enrichments of Wikidata. Adding 2 million items with additional statement per item would be quite an enlargement of Wikidata. If we would bot add all business of both Belgium and Germany, we would have 4 million of new items, which currently would count for 10% of all of Wikidata. I am not sure what this would mean in term scalability and if it would cause any scalability issues. 

Maybe a use-case driven approach here would be more appropriate. We could think of a bot that would source both the trade registers of the different countries when a specific use case would vouch for the inclusion of trade data. 

Just my 2cts

Cheers, 

Andra

On Mon, Oct 16, 2017 at 9:48 AM, Sebastian Hellmann <hellmann@informatik.uni-leipzig.de> wrote:

Thanks, done.

https://www.wikidata.org/wiki/Wikidata:Project_chat#Handelsregister


On 15.10.2017 22:10, Yaroslav Blanter wrote:
Hi Sebastian,

I would say the best way is to file a request for the permissions for the bot

https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot

and possibly leave a message on the Project Chat

https://www.wikidata.org/wiki/Wikidata:Project_chat

Cheers
Yaroslav

On Sun, Oct 15, 2017 at 9:44 AM, Sebastian Hellmann <hellmann@informatik.uni-leipzig.de> wrote:

Hi all,

the German business registry contains roughly 2.2 million organisations. Some information is paid, but other is public, i.e. the info you are searching for at and clicking on UT (see example below):

https://www.handelsregister.de/rp_web/mask.do?Typ=e


I would like to add this to Wikidata, either by crawling or by raising money to use crowdsourcing concepts like crowdflour or amazon turk.


It should meet notability criteria 2: https://www.wikidata.org/wiki/Wikidata:Notability

2. It refers to an instance of a clearly identifiable conceptual or material entity. The entity must be notable, in the sense that it can be described using serious and publicly available references. If there is no item about you yet, you are probably not notable.


The reference is the official German business registry, which is serious and public. Orgs are also per definition clearly identifiable legal entities.

How can I get clearance to proceed on this?

All the best,
Sebastian



Entity data


Saxony District court Leipzig HRB 32853 – A&A Dienstleistungsgesellschaft mbH
Legal status: Gesellschaft mit beschränkter Haftung  
Capital: 25.000,00 EUR
Date of entry: 29/08/2016
(When entering date of entry, wrong data input can occur due to system failures!)
Date of removal: -
Balance sheet available: -
Address (subject to correction): A&A Dienstleistungsgesellschaft mbH
Prager Straße 38-40
04317 Leipzig


--
All the best,
Sebastian Hellmann

Director of Knowledge Integration and Linked Data Technologies (KILT) Competence Center
at the Institute for Applied Informatics (InfAI) at Leipzig University
Executive Director of the DBpedia Association
Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org, https://www.w3.org/community/ld4lt
Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata




_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

--
All the best,
Sebastian Hellmann

Director of Knowledge Integration and Linked Data Technologies (KILT) Competence Center
at the Institute for Applied Informatics (InfAI) at Leipzig University
Executive Director of the DBpedia Association
Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org, https://www.w3.org/community/ld4lt
Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata