Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
* Wikidata contributors with good institutional memory * people that has been in touch with organizations willing to contribute their open data * developers willing to help improving our software and programming missing pieces * also contributors familiar with the GLAM model(s), what works and what didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to contribute
their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and what
didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to contribute
their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and what
didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela dacuetu@gmail.com wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to contribute
their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and what
didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi Quim, Sylvia, Lydia and Wikidatans,
In terms of strategic partnerships with CC Wikidata, and this Wikidata Engineering Community project looking for -
"* people that have been in touch with organizations willing to contribute their open data" -
in what ways could CC World University and School contribute our early CC Nation State Universities, e.g. Mexico World University and School - http://worlduniversity.wikia.com/wiki/Mexico - (with only the CC MIT OCW in Spanish so far, and not yet in Spanish otherwise)? See also, for example, the Nation States' wiki subject page at WUaS - http://worlduniversity.wikia.com/wiki/Nation_States. Each wiki Nation-State University at CC WUaS, seeks to accredit in each country in their main languags to offer online CC baccalaureate, Ph.D., Law and M.D. degrees, and also I.B. high school degrees, if possible, based on CC MIT OCW in 7 languages and CC Yale OYC, to begin.
In addition, CC World University would like to facilitate wiki / Wikidata schools for open teaching and learning in each of all 7,929+ languages. As planned interlingual wiki schools for highest quality universal education, CC World University and School would like to explore developing in Wikidata, and become a growth story also for Wikidata. World University would thus like to explore contributing our open data in these regards.
In what ways might WUaS best contribute in terms of a strategic partnerships re this Wikidata Engineering Community project ? Perhaps World University could become a part of the"community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively." Thank you.
Best, Info (Scott)
On Wed, Jul 1, 2015 at 9:48 AM, Benjamin Good ben.mcgee.good@gmail.com wrote:
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela dacuetu@gmail.com wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to
contribute their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and
what didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Thank you very much for this quick and very diverse wave of feedback. I'm trying to keep the description of https://phabricator.wikimedia.org/T101950 up to date, and you are welcome to edit too.
The deliverable of this goal is basically documentation, so the first question is where should that documentation live. Somewhere under https://www.wikidata.org/wiki/Wikidata: ? In Meta? In Outreach? Elsewhere? As soon as we have an answer we can start documenting on-wiki, business as usual.
We need a name for this "framework". It can be a working name used for practical reasons. I'm starting to use Wiki Loves Open Data / WLOD, but only in my brain (now it's the first time that I write it down). Please propose better alternatives if you have them.
For now we will keep the discussion here, but people willing to get involved as a contributor / stakeholder should get ready to follow the work in Phabricator. We will start with subtasks blocking T101950, jumping to an own project only if needed.
Some replies to feedback received:
* If you are involved or in touch with an open data organization, you are encouraged to add yourselves to "Organizations interested" at https://phabricator.wikimedia.org/T101950.
* Having real cases with real problems is the best way to focus on what really matters. There is a section "The problem" where we should list the problems that are blocking or making difficult fruitful contributions to Wikidata.
* Where to publish entire datasets... Something tells me that this is not the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use case are organizations having certain types of data that would be great to have in Wikipedia via Wikidata. Whatever is the most urgent problem, we need to be very selective in the scope we want to address in this quarterly goal.
* The experience of maintainers of current projects like ProteinBoxBot is very very useful. Please do not hesitate getting into details, creating subtasks to address specific problems, etc.
* Scott, I have the impression that we need to define this framework a bit before projects like yours can jump in and find their ways for contributing. I don't have a good reply to you right now, and this goal is precisely about offering a framework useful to you, useful to Wikimedia.
Looking forward to your feedback and to start editing on a wiki page soon.
On Wed, Jul 1, 2015 at 8:09 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Hi Quim, Sylvia, Lydia and Wikidatans,
In terms of strategic partnerships with CC Wikidata, and this Wikidata Engineering Community project looking for -
"* people that have been in touch with organizations willing to contribute their open data" -
in what ways could CC World University and School contribute our early CC Nation State Universities, e.g. Mexico World University and School - http://worlduniversity.wikia.com/wiki/Mexico - (with only the CC MIT OCW in Spanish so far, and not yet in Spanish otherwise)? See also, for example, the Nation States' wiki subject page at WUaS - http://worlduniversity.wikia.com/wiki/Nation_States. Each wiki Nation-State University at CC WUaS, seeks to accredit in each country in their main languags to offer online CC baccalaureate, Ph.D., Law and M.D. degrees, and also I.B. high school degrees, if possible, based on CC MIT OCW in 7 languages and CC Yale OYC, to begin.
In addition, CC World University would like to facilitate wiki / Wikidata schools for open teaching and learning in each of all 7,929+ languages. As planned interlingual wiki schools for highest quality universal education, CC World University and School would like to explore developing in Wikidata, and become a growth story also for Wikidata. World University would thus like to explore contributing our open data in these regards.
In what ways might WUaS best contribute in terms of a strategic partnerships re this Wikidata Engineering Community project ? Perhaps World University could become a part of the"community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively." Thank you.
Best, Info (Scott)
On Wed, Jul 1, 2015 at 9:48 AM, Benjamin Good ben.mcgee.good@gmail.com wrote:
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela dacuetu@gmail.com wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to
contribute their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and
what didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Scott MacLeod - Founder & President
415 480 4577
PO Box 442, (86 Ridgecrest Road), Canyon, CA 94516
World University and School - like Wikipedia with best STEM-centric
OpenCourseWare - incorporated as a nonprofit university and school in California, and is a U.S. 501 (c) (3) tax-exempt educational organization, both effective April 2010.
World University and School is sending you this because of your interest in free, online, higher education. If you don't want to receive these, please reply with 'unsubscribe' in the body of the email, leaving the subject line intact. Thank you.
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi Quim, I think a name like "Linked Open Wikidata" would be better, since that is what I think you are talking about - linked open data with Wikidata items at the hub from which all other research questions should begin. Something like "Sum of all Data" in the sense that we already have a mini-framework set up for the "Wikidata:WikiProject sum of all paintings": https://www.wikidata.org/wiki/Wikidata:WikiProject_sum_of_all_paintings The Wikiverse is always overly ambitious, and Wikidata is no exception. It would be nice to have all oil paintings from the world's top museums and national collections in there, as well as all paintings from the world's greatest artists. Jane
On Wed, Jul 1, 2015 at 11:51 PM, Quim Gil qgil@wikimedia.org wrote:
Thank you very much for this quick and very diverse wave of feedback. I'm trying to keep the description of https://phabricator.wikimedia.org/T101950 up to date, and you are welcome to edit too.
The deliverable of this goal is basically documentation, so the first question is where should that documentation live. Somewhere under https://www.wikidata.org/wiki/Wikidata: ? In Meta? In Outreach? Elsewhere? As soon as we have an answer we can start documenting on-wiki, business as usual.
We need a name for this "framework". It can be a working name used for practical reasons. I'm starting to use Wiki Loves Open Data / WLOD, but only in my brain (now it's the first time that I write it down). Please propose better alternatives if you have them.
For now we will keep the discussion here, but people willing to get involved as a contributor / stakeholder should get ready to follow the work in Phabricator. We will start with subtasks blocking T101950, jumping to an own project only if needed.
Some replies to feedback received:
- If you are involved or in touch with an open data organization, you are
encouraged to add yourselves to "Organizations interested" at https://phabricator.wikimedia.org/T101950.
- Having real cases with real problems is the best way to focus on what
really matters. There is a section "The problem" where we should list the problems that are blocking or making difficult fruitful contributions to Wikidata.
- Where to publish entire datasets... Something tells me that this is not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use case are organizations having certain types of data that would be great to have in Wikipedia via Wikidata. Whatever is the most urgent problem, we need to be very selective in the scope we want to address in this quarterly goal.
- The experience of maintainers of current projects like ProteinBoxBot is
very very useful. Please do not hesitate getting into details, creating subtasks to address specific problems, etc.
- Scott, I have the impression that we need to define this framework a bit
before projects like yours can jump in and find their ways for contributing. I don't have a good reply to you right now, and this goal is precisely about offering a framework useful to you, useful to Wikimedia.
Looking forward to your feedback and to start editing on a wiki page soon.
On Wed, Jul 1, 2015 at 8:09 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Hi Quim, Sylvia, Lydia and Wikidatans,
In terms of strategic partnerships with CC Wikidata, and this Wikidata Engineering Community project looking for -
"* people that have been in touch with organizations willing to contribute their open data" -
in what ways could CC World University and School contribute our early CC Nation State Universities, e.g. Mexico World University and School - http://worlduniversity.wikia.com/wiki/Mexico - (with only the CC MIT OCW in Spanish so far, and not yet in Spanish otherwise)? See also, for example, the Nation States' wiki subject page at WUaS - http://worlduniversity.wikia.com/wiki/Nation_States. Each wiki Nation-State University at CC WUaS, seeks to accredit in each country in their main languags to offer online CC baccalaureate, Ph.D., Law and M.D. degrees, and also I.B. high school degrees, if possible, based on CC MIT OCW in 7 languages and CC Yale OYC, to begin.
In addition, CC World University would like to facilitate wiki / Wikidata schools for open teaching and learning in each of all 7,929+ languages. As planned interlingual wiki schools for highest quality universal education, CC World University and School would like to explore developing in Wikidata, and become a growth story also for Wikidata. World University would thus like to explore contributing our open data in these regards.
In what ways might WUaS best contribute in terms of a strategic partnerships re this Wikidata Engineering Community project ? Perhaps World University could become a part of the"community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively." Thank you.
Best, Info (Scott)
On Wed, Jul 1, 2015 at 9:48 AM, Benjamin Good ben.mcgee.good@gmail.com wrote:
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela dacuetu@gmail.com wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
Hi, it's first of July and I would like to introduce you a quarterly goal that the Engineering Community team has committed to:
Establish a framework to engage with data engineers and open data organizations https://phabricator.wikimedia.org/T101950
We are missing a community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively. Imagine GLAM applied to data.
If all goes well, by the end of September we would like to have basic documentation and community processes for open data engineers and organizations willing to contribute to Wikidata, and ongoing projects with one open data org.
If you are interested, get involved! We are looking for
- Wikidata contributors with good institutional memory
- people that has been in touch with organizations willing to
contribute their open data
- developers willing to help improving our software and programming
missing pieces
- also contributors familiar with the GLAM model(s), what works and
what didn't work
This goal has been created after some conversations with Lydia Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both are on board, Lydia assuring that this work fits into what is technically effective, and Sylvia checking our work against real open data organizations willing to get involved.
This email effectively starts the bootstrapping of this project. I will start creating subtasks under that goal based on your feedback and common sense.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Scott MacLeod - Founder & President
415 480 4577
PO Box 442, (86 Ridgecrest Road), Canyon, CA 94516
World University and School - like Wikipedia with best STEM-centric
OpenCourseWare - incorporated as a nonprofit university and school in California, and is a U.S. 501 (c) (3) tax-exempt educational organization, both effective April 2010.
World University and School is sending you this because of your interest in free, online, higher education. If you don't want to receive these, please reply with 'unsubscribe' in the body of the email, leaving the subject line intact. Thank you.
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
On Thu, Jul 2, 2015 at 10:47 AM, Jane Darnell jane023@gmail.com wrote:
I think a name like "Linked Open Wikidata" would be better, since that is what I think you are talking about - linked open data with Wikidata items at the hub from which all other research questions should begin. Something like "Sum of all Data" in the sense that we already have a mini-framework set up for the "Wikidata:WikiProject sum of all paintings": https://www.wikidata.org/wiki/Wikidata:WikiProject_sum_of_all_paintings
I think we are still talking about different things, so I will try to define what I believe is our primary use case here:
* Open data organization has a subset of interesting data that could be used to improve Wikimedia wikis after being added to Wikidata.
Use cases that we are NOT pursuing in this quarterly goal:
* Open data organization has a humongous amount of data to be injected entirely to Wikidata. * Wikidata contributors start compiling a directory with the Sum of all Data. * Wikidata/Commons contributors start building a Wikimedia version of http://datahub.io/
I also want to clarify the intended meaning of "Imagine GLAM https://outreach.wikimedia.org/wiki/GLAM applied to data." Over the years, GLAM has created documentation, campaigns, tools, success stories, a network of of volunteers and cultural organizations, and even some new jobs. Today, a Wikimedian living in a place with an interesting gallery/library/archive/museum (or someone working in any of these institutions) has a framework and a support network to learn how to establish a first contact and organize a first activity. We don't have an equivalent for open data. We can wait all this to be created organically as it happened with GLAM, but we can also help pushing the process reusing good lessons from GLAM and other initiatives.
The Wikiverse is always overly ambitious, and Wikidata is no exception. It would be nice to have all oil paintings from the world's top museums and national collections in there, as well as all paintings from the world's greatest artists. Jane
On Wed, Jul 1, 2015 at 11:51 PM, Quim Gil qgil@wikimedia.org wrote:
Thank you very much for this quick and very diverse wave of feedback. I'm trying to keep the description of https://phabricator.wikimedia.org/T101950 up to date, and you are welcome to edit too.
The deliverable of this goal is basically documentation, so the first question is where should that documentation live. Somewhere under https://www.wikidata.org/wiki/Wikidata: ? In Meta? In Outreach? Elsewhere? As soon as we have an answer we can start documenting on-wiki, business as usual.
We need a name for this "framework". It can be a working name used for practical reasons. I'm starting to use Wiki Loves Open Data / WLOD, but only in my brain (now it's the first time that I write it down). Please propose better alternatives if you have them.
For now we will keep the discussion here, but people willing to get involved as a contributor / stakeholder should get ready to follow the work in Phabricator. We will start with subtasks blocking T101950, jumping to an own project only if needed.
Some replies to feedback received:
- If you are involved or in touch with an open data organization, you are
encouraged to add yourselves to "Organizations interested" at https://phabricator.wikimedia.org/T101950.
- Having real cases with real problems is the best way to focus on what
really matters. There is a section "The problem" where we should list the problems that are blocking or making difficult fruitful contributions to Wikidata.
- Where to publish entire datasets... Something tells me that this is not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use case are organizations having certain types of data that would be great to have in Wikipedia via Wikidata. Whatever is the most urgent problem, we need to be very selective in the scope we want to address in this quarterly goal.
- The experience of maintainers of current projects like ProteinBoxBot is
very very useful. Please do not hesitate getting into details, creating subtasks to address specific problems, etc.
- Scott, I have the impression that we need to define this framework a
bit before projects like yours can jump in and find their ways for contributing. I don't have a good reply to you right now, and this goal is precisely about offering a framework useful to you, useful to Wikimedia.
Looking forward to your feedback and to start editing on a wiki page soon.
On Wed, Jul 1, 2015 at 8:09 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Hi Quim, Sylvia, Lydia and Wikidatans,
In terms of strategic partnerships with CC Wikidata, and this Wikidata Engineering Community project looking for -
"* people that have been in touch with organizations willing to contribute their open data" -
in what ways could CC World University and School contribute our early CC Nation State Universities, e.g. Mexico World University and School - http://worlduniversity.wikia.com/wiki/Mexico - (with only the CC MIT OCW in Spanish so far, and not yet in Spanish otherwise)? See also, for example, the Nation States' wiki subject page at WUaS - http://worlduniversity.wikia.com/wiki/Nation_States. Each wiki Nation-State University at CC WUaS, seeks to accredit in each country in their main languags to offer online CC baccalaureate, Ph.D., Law and M.D. degrees, and also I.B. high school degrees, if possible, based on CC MIT OCW in 7 languages and CC Yale OYC, to begin.
In addition, CC World University would like to facilitate wiki / Wikidata schools for open teaching and learning in each of all 7,929+ languages. As planned interlingual wiki schools for highest quality universal education, CC World University and School would like to explore developing in Wikidata, and become a growth story also for Wikidata. World University would thus like to explore contributing our open data in these regards.
In what ways might WUaS best contribute in terms of a strategic partnerships re this Wikidata Engineering Community project ? Perhaps World University could become a part of the"community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively." Thank you.
Best, Info (Scott)
On Wed, Jul 1, 2015 at 9:48 AM, Benjamin Good ben.mcgee.good@gmail.com wrote:
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela dacuetu@gmail.com wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
Hello Quim,
We have in Belgium (as Wikimedia Belgium) a partner organisation who is together with us working with cultural institutions to get open datasets to be used in Wikidata.
So yes, we are interested.
Greetings, Romaine
2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org:
> Hi, it's first of July and I would like to introduce you a quarterly > goal that the Engineering Community team has committed to: > > Establish a framework to engage with data engineers and open data > organizations > https://phabricator.wikimedia.org/T101950 > > We are missing a community framework allowing Wikidata content and > tech contributors, data engineers, and open data organizations to > collaborate effectively. Imagine GLAM applied to data. > > If all goes well, by the end of September we would like to have > basic documentation and community processes for open data engineers and > organizations willing to contribute to Wikidata, and ongoing > projects with one open data org. > > If you are interested, get involved! We are looking for > > * Wikidata contributors with good institutional memory > * people that has been in touch with organizations willing to > contribute their open data > * developers willing to help improving our software and programming > missing pieces > * also contributors familiar with the GLAM model(s), what works and > what didn't work > > This goal has been created after some conversations with Lydia > Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both > are on board, Lydia assuring that this work fits into what is technically > effective, and Sylvia checking our work against real open data > organizations willing to get involved. > > This email effectively starts the bootstrapping of this project. I > will start creating subtasks under that goal based on your feedback and > common sense. > > -- > Quim Gil > Engineering Community Manager @ Wikimedia Foundation > http://www.mediawiki.org/wiki/User:Qgil > > _______________________________________________ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata > >
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Scott MacLeod - Founder & President
415 480 4577
PO Box 442, (86 Ridgecrest Road), Canyon, CA 94516
World University and School - like Wikipedia with best STEM-centric
OpenCourseWare - incorporated as a nonprofit university and school in California, and is a U.S. 501 (c) (3) tax-exempt educational organization, both effective April 2010.
World University and School is sending you this because of your interest in free, online, higher education. If you don't want to receive these, please reply with 'unsubscribe' in the body of the email, leaving the subject line intact. Thank you.
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Quim, Jane and Wikidatans,
Appreciating your focus on museums and paintings, Jane, re Markus's Wikipedias' mapping project (and a hypothetical Wikidata virtual earth for these and STEM research, building on Markus's mapping), where Museums' paintings and art in all 288 Wikipedia/Wikidata languages could similarly be beautifully mapped database-wise.
And appreciating your focus too, Jane, on open linked data re developing inter-lingual Wikidata with machine learning and artificial intelligence ahead. In these regards, and among about ~10 main CC World University foci/areas - http://scott-macleod.blogspot.com/2011/01/natural-platform-world-univ-sch-wo... - which could be used as a contributory organizing framework in the future, Quim, World University has a main Museums' wiki Subject and is planning to grow open museums in all 7,929 languages online - including in a virtual earth (with paintings much like virtual books "in the stacks" of digital libraries in a virtual earth).
Quim, since Wikia (e.g. WUaS is in the open Wikia wiki http://worlduniversity.wikia.com/wiki/Subjects) is also a Wikimedia project, - e.g. Jimmy Wales is the president of Wikia and Chairman of Wikimedia - I wonder concerning your focus this quarter with this email thread ...
* Open data organization has a subset of interesting data that could be used to improve Wikimedia wikis after being added to Wikidata. ...
whether these ~10 main World University foci/areas could become part of framework for the contributory data planning for what you have in mind.
Cheers, Scott
On Thu, Jul 2, 2015 at 6:37 AM, Quim Gil qgil@wikimedia.org wrote:
On Thu, Jul 2, 2015 at 10:47 AM, Jane Darnell jane023@gmail.com wrote:
I think a name like "Linked Open Wikidata" would be better, since that is what I think you are talking about - linked open data with Wikidata items at the hub from which all other research questions should begin. Something like "Sum of all Data" in the sense that we already have a mini-framework set up for the "Wikidata:WikiProject sum of all paintings": https://www.wikidata.org/wiki/Wikidata:WikiProject_sum_of_all_paintings
I think we are still talking about different things, so I will try to define what I believe is our primary use case here:
- Open data organization has a subset of interesting data that could be
used to improve Wikimedia wikis after being added to Wikidata.
Use cases that we are NOT pursuing in this quarterly goal:
- Open data organization has a humongous amount of data to be injected
entirely to Wikidata.
- Wikidata contributors start compiling a directory with the Sum of all
Data.
- Wikidata/Commons contributors start building a Wikimedia version of
I also want to clarify the intended meaning of "Imagine GLAM https://outreach.wikimedia.org/wiki/GLAM applied to data." Over the years, GLAM has created documentation, campaigns, tools, success stories, a network of of volunteers and cultural organizations, and even some new jobs. Today, a Wikimedian living in a place with an interesting gallery/library/archive/museum (or someone working in any of these institutions) has a framework and a support network to learn how to establish a first contact and organize a first activity. We don't have an equivalent for open data. We can wait all this to be created organically as it happened with GLAM, but we can also help pushing the process reusing good lessons from GLAM and other initiatives.
The Wikiverse is always overly ambitious, and Wikidata is no exception. It would be nice to have all oil paintings from the world's top museums and national collections in there, as well as all paintings from the world's greatest artists. Jane
On Wed, Jul 1, 2015 at 11:51 PM, Quim Gil qgil@wikimedia.org wrote:
Thank you very much for this quick and very diverse wave of feedback. I'm trying to keep the description of https://phabricator.wikimedia.org/T101950 up to date, and you are welcome to edit too.
The deliverable of this goal is basically documentation, so the first question is where should that documentation live. Somewhere under https://www.wikidata.org/wiki/Wikidata: ? In Meta? In Outreach? Elsewhere? As soon as we have an answer we can start documenting on-wiki, business as usual.
We need a name for this "framework". It can be a working name used for practical reasons. I'm starting to use Wiki Loves Open Data / WLOD, but only in my brain (now it's the first time that I write it down). Please propose better alternatives if you have them.
For now we will keep the discussion here, but people willing to get involved as a contributor / stakeholder should get ready to follow the work in Phabricator. We will start with subtasks blocking T101950, jumping to an own project only if needed.
Some replies to feedback received:
- If you are involved or in touch with an open data organization, you
are encouraged to add yourselves to "Organizations interested" at https://phabricator.wikimedia.org/T101950.
- Having real cases with real problems is the best way to focus on what
really matters. There is a section "The problem" where we should list the problems that are blocking or making difficult fruitful contributions to Wikidata.
- Where to publish entire datasets... Something tells me that this is
not the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use case are organizations having certain types of data that would be great to have in Wikipedia via Wikidata. Whatever is the most urgent problem, we need to be very selective in the scope we want to address in this quarterly goal.
- The experience of maintainers of current projects like ProteinBoxBot
is very very useful. Please do not hesitate getting into details, creating subtasks to address specific problems, etc.
- Scott, I have the impression that we need to define this framework a
bit before projects like yours can jump in and find their ways for contributing. I don't have a good reply to you right now, and this goal is precisely about offering a framework useful to you, useful to Wikimedia.
Looking forward to your feedback and to start editing on a wiki page soon.
On Wed, Jul 1, 2015 at 8:09 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Hi Quim, Sylvia, Lydia and Wikidatans,
In terms of strategic partnerships with CC Wikidata, and this Wikidata Engineering Community project looking for -
"* people that have been in touch with organizations willing to contribute their open data" -
in what ways could CC World University and School contribute our early CC Nation State Universities, e.g. Mexico World University and School - http://worlduniversity.wikia.com/wiki/Mexico - (with only the CC MIT OCW in Spanish so far, and not yet in Spanish otherwise)? See also, for example, the Nation States' wiki subject page at WUaS - http://worlduniversity.wikia.com/wiki/Nation_States. Each wiki Nation-State University at CC WUaS, seeks to accredit in each country in their main languags to offer online CC baccalaureate, Ph.D., Law and M.D. degrees, and also I.B. high school degrees, if possible, based on CC MIT OCW in 7 languages and CC Yale OYC, to begin.
In addition, CC World University would like to facilitate wiki / Wikidata schools for open teaching and learning in each of all 7,929+ languages. As planned interlingual wiki schools for highest quality universal education, CC World University and School would like to explore developing in Wikidata, and become a growth story also for Wikidata. World University would thus like to explore contributing our open data in these regards.
In what ways might WUaS best contribute in terms of a strategic partnerships re this Wikidata Engineering Community project ? Perhaps World University could become a part of the"community framework allowing Wikidata content and tech contributors, data engineers, and open data organizations to collaborate effectively." Thank you.
Best, Info (Scott)
On Wed, Jul 1, 2015 at 9:48 AM, Benjamin Good <ben.mcgee.good@gmail.com
wrote:
Quim,
I'm not familiar with GLAM or what you are really asking for here. Could you elaborate a little? Our group is actively engaged in writing bots for populating wikidata with trusted biomedical information and for using that information to drive applications such as Wikipedia. Processes for making this easier would be most welcome. A lot of what we are doing and hoping to do is described on this bot page: https://www.wikidata.org/wiki/User:ProteinBoxBot
? -Ben
On Wed, Jul 1, 2015 at 9:23 AM, David Cuenca Tudela <dacuetu@gmail.com
wrote:
Hello Quim,
There was always the issue of where to publish datasets from partner organisations like a http://datahub.io/
Is that being considered in this new iteration?
Cheers, Micru
On Wed, Jul 1, 2015 at 6:19 PM, Romaine Wiki romaine.wiki@gmail.com wrote:
> Hello Quim, > > We have in Belgium (as Wikimedia Belgium) a partner organisation who > is together with us working with cultural institutions to get open datasets > to be used in Wikidata. > > So yes, we are interested. > > Greetings, > Romaine > > 2015-07-01 17:31 GMT+02:00 Quim Gil qgil@wikimedia.org: > >> Hi, it's first of July and I would like to introduce you a >> quarterly goal that the Engineering Community team has committed to: >> >> Establish a framework to engage with data engineers and open data >> organizations >> https://phabricator.wikimedia.org/T101950 >> >> We are missing a community framework allowing Wikidata content and >> tech contributors, data engineers, and open data organizations to >> collaborate effectively. Imagine GLAM applied to data. >> >> If all goes well, by the end of September we would like to have >> basic documentation and community processes for open data engineers and >> organizations willing to contribute to Wikidata, and ongoing >> projects with one open data org. >> >> If you are interested, get involved! We are looking for >> >> * Wikidata contributors with good institutional memory >> * people that has been in touch with organizations willing to >> contribute their open data >> * developers willing to help improving our software and programming >> missing pieces >> * also contributors familiar with the GLAM model(s), what works and >> what didn't work >> >> This goal has been created after some conversations with Lydia >> Pintscher (Wikidata team) and Sylvia Ventura (Strategic Partnerships). Both >> are on board, Lydia assuring that this work fits into what is technically >> effective, and Sylvia checking our work against real open data >> organizations willing to get involved. >> >> This email effectively starts the bootstrapping of this project. I >> will start creating subtasks under that goal based on your feedback and >> common sense. >> >> -- >> Quim Gil >> Engineering Community Manager @ Wikimedia Foundation >> http://www.mediawiki.org/wiki/User:Qgil >> >> _______________________________________________ >> Wikidata mailing list >> Wikidata@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikidata >> >> > > _______________________________________________ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata > >
-- Etiamsi omnes, ego non
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Scott MacLeod - Founder & President
415 480 4577
PO Box 442, (86 Ridgecrest Road), Canyon, CA 94516
World University and School - like Wikipedia with best STEM-centric
OpenCourseWare - incorporated as a nonprofit university and school in California, and is a U.S. 501 (c) (3) tax-exempt educational organization, both effective April 2010.
World University and School is sending you this because of your interest in free, online, higher education. If you don't want to receive these, please reply with 'unsubscribe' in the body of the email, leaving the subject line intact. Thank you.
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
On Thu, Jul 2, 2015 at 5:38 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Quim, since Wikia (e.g. WUaS is in the open Wikia wiki http://worlduniversity.wikia.com/wiki/Subjects) is also a Wikimedia project
Wikia is not a Wikimedia project, but we are digressing. :)
whether these ~10 main World University foci/areas could become part of
framework for the contributory data planning for what you have in mind.
As a framework, Wiki Loves Open Data [1] should guide organizations through the process of contributing content to Wikidata. Open data organizations or data sets will not be "part of the framework". Each organization will decide their own involvement. If the framework is good, they would not need any type of acceptance or endorsement.
The example of World University makes me think that we need a technical definition of "open data" based on what Wikidata can or is willing to digest. I might be wrong, but there I'm seeing wiki pages, not structure data.
[1] I'll start using this term and see what happens.
Hi Quim, Jane, Andrew and Wikidatans,
Thanks, Quim. You're right that Wikia is not a Wikimedia project but Jimmy Wales is the head of both interestingly.
Thanks for the clarification that "Wiki Loves Open Data [:)] should guide organizations through the process of contributing content to Wikidata" ... and World University looks forward to learning how this will work. (I also just posted my email of 30 minutes ago to you and Jane here - http://scott-macleod.blogspot.com/2015/07/impatiens-species-structuring-worl... - in terms of WUaS's ~ 10 main foci/areas).
In terms of a technical definition of open data, I see data here as ...
"facts and statistics collected together for reference or analysis. See also datum.
• Computing the quantities, characters, or symbols on which operations are performed by a computer, being stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media" (Apple Dictionary 2.x) ...
and both Wikidata's as well as World University's openness seem in part to have to do with Creative Commons' licensing and in part with the framework you're creating, Quim, for which WUaS is especially grateful.
Thank you and cheers, Scott
On Fri, Jul 3, 2015 at 1:49 AM, Quim Gil qgil@wikimedia.org wrote:
On Thu, Jul 2, 2015 at 5:38 PM, Info WorldUniversity < info@worlduniversityandschool.org> wrote:
Quim, since Wikia (e.g. WUaS is in the open Wikia wiki http://worlduniversity.wikia.com/wiki/Subjects) is also a Wikimedia project
Wikia is not a Wikimedia project, but we are digressing. :)
whether these ~10 main World University foci/areas could become part of
framework for the contributory data planning for what you have in mind.
As a framework, Wiki Loves Open Data [1] should guide organizations through the process of contributing content to Wikidata. Open data organizations or data sets will not be "part of the framework". Each organization will decide their own involvement. If the framework is good, they would not need any type of acceptance or endorsement.
The example of World University makes me think that we need a technical definition of "open data" based on what Wikidata can or is willing to digest. I might be wrong, but there I'm seeing wiki pages, not structure data.
[1] I'll start using this term and see what happens.
-- Quim Gil Engineering Community Manager @ Wikimedia Foundation http://www.mediawiki.org/wiki/User:Qgil
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this is not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
Well that really depends on the data actually. There are lots of printed datasets and if someone has those online and can no longer host them, then we might want to harvest some, if not all of it. I am thinking of datasets of large collections of <whatever>. I recall not long ago a museum of music records became defunct and they were looking for a home for their database. We couldn't do anything for them then but we could put it in Mix-n-Match today (assuming the data is all published material that is considered a reliable source yada yada...)
On Fri, Jul 3, 2015 at 2:10 PM, Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this is not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
--
- Andrew Gray andrew.gray@dunelm.org.uk
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Hi,
I've been quite interested in the idea of compiling a list of companies into Wikidata. I wanted to see how well this fits with Wikidata's goals. There are company databases like Hoovers/DNB, Mattermark, LinkedIn, Crunchbase, DataFox, etc. which are very popular, but not as open as Wikidata. These sites are hugely valuable to folks like investors, job candidates, sales organizations, etc. and I'd love to see an open collaborative approach.
My company could contribute engineering resources towards compiling this data in Wikidata. E.g. maybe building a tool which allows to pull and review facts from company websites, SEC filings, news articles referencing funding amounts or acquisitions, etc. into wikidata. This is something that we could do on our own behind closed doors, but it seems more interesting to do it in the open where others can receive value from it and contribute back such that the dataset grows and becomes more valuable.
Many of these companies may not be notable or interesting on their own. However, having an entire compiled dataset where you can lookup any company is extremely interesting. I'd love Wikidata to be the de facto go-to for this type of data instead of the more closed organizations mentioned above.
Is this the type of project that's interesting to Wikidata? If so, I'd be very interested in learning more about the types of ways that we can engage with Wikidata and helping to develop a framework for contributing.
Thanks, Ben
On Fri, Jul 3, 2015 at 5:47 AM, Jane Darnell jane023@gmail.com wrote:
Well that really depends on the data actually. There are lots of printed datasets and if someone has those online and can no longer host them, then we might want to harvest some, if not all of it. I am thinking of datasets of large collections of <whatever>. I recall not long ago a museum of music records became defunct and they were looking for a home for their database. We couldn't do anything for them then but we could put it in Mix-n-Match today (assuming the data is all published material that is considered a reliable source yada yada...)
On Fri, Jul 3, 2015 at 2:10 PM, Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this is
not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main
use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
--
- Andrew Gray andrew.gray@dunelm.org.uk
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Ben
I think opencorporates.com may be a better fit for general information about businesses. We already have a problem in wikidata with companies that don't meet our notablity criteria creating items in wikidata.
Joe
On Fri, 3 Jul 2015 17:19 Ben McCann ben@benmccann.com wrote:
Hi,
I've been quite interested in the idea of compiling a list of companies into Wikidata. I wanted to see how well this fits with Wikidata's goals. There are company databases like Hoovers/DNB, Mattermark, LinkedIn, Crunchbase, DataFox, etc. which are very popular, but not as open as Wikidata. These sites are hugely valuable to folks like investors, job candidates, sales organizations, etc. and I'd love to see an open collaborative approach.
My company could contribute engineering resources towards compiling this data in Wikidata. E.g. maybe building a tool which allows to pull and review facts from company websites, SEC filings, news articles referencing funding amounts or acquisitions, etc. into wikidata. This is something that we could do on our own behind closed doors, but it seems more interesting to do it in the open where others can receive value from it and contribute back such that the dataset grows and becomes more valuable.
Many of these companies may not be notable or interesting on their own. However, having an entire compiled dataset where you can lookup any company is extremely interesting. I'd love Wikidata to be the de facto go-to for this type of data instead of the more closed organizations mentioned above.
Is this the type of project that's interesting to Wikidata? If so, I'd be very interested in learning more about the types of ways that we can engage with Wikidata and helping to develop a framework for contributing.
Thanks, Ben
On Fri, Jul 3, 2015 at 5:47 AM, Jane Darnell jane023@gmail.com wrote:
Well that really depends on the data actually. There are lots of printed datasets and if someone has those online and can no longer host them, then we might want to harvest some, if not all of it. I am thinking of datasets of large collections of <whatever>. I recall not long ago a museum of music records became defunct and they were looking for a home for their database. We couldn't do anything for them then but we could put it in Mix-n-Match today (assuming the data is all published material that is considered a reliable source yada yada...)
On Fri, Jul 3, 2015 at 2:10 PM, Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this is
not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main
use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
--
- Andrew Gray andrew.gray@dunelm.org.uk
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- about.me/benmccann _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Unfortunately, opencorporates very often does not fulfill the need for a company database. The primary reason is that they're only tracking data used to register the company like name, address, and id number. The things that company databases are most used for is very often things like # of employees, industry, funding, revenue, etc.
Wikidata's notability requirements are a bit vague to me. What does it mean that something can be described using serious and publicly available references? E.g. would Protingent be notable? What about GlamBot or Little Bizzy?
-Ben
On Fri, Jul 3, 2015 at 1:26 PM, Joe Filceolaire filceolaire@gmail.com wrote:
Ben
I think opencorporates.com may be a better fit for general information about businesses. We already have a problem in wikidata with companies that don't meet our notablity criteria creating items in wikidata.
Joe
On Fri, 3 Jul 2015 17:19 Ben McCann ben@benmccann.com wrote:
Hi,
I've been quite interested in the idea of compiling a list of companies into Wikidata. I wanted to see how well this fits with Wikidata's goals. There are company databases like Hoovers/DNB, Mattermark, LinkedIn, Crunchbase, DataFox, etc. which are very popular, but not as open as Wikidata. These sites are hugely valuable to folks like investors, job candidates, sales organizations, etc. and I'd love to see an open collaborative approach.
My company could contribute engineering resources towards compiling this data in Wikidata. E.g. maybe building a tool which allows to pull and review facts from company websites, SEC filings, news articles referencing funding amounts or acquisitions, etc. into wikidata. This is something that we could do on our own behind closed doors, but it seems more interesting to do it in the open where others can receive value from it and contribute back such that the dataset grows and becomes more valuable.
Many of these companies may not be notable or interesting on their own. However, having an entire compiled dataset where you can lookup any company is extremely interesting. I'd love Wikidata to be the de facto go-to for this type of data instead of the more closed organizations mentioned above.
Is this the type of project that's interesting to Wikidata? If so, I'd be very interested in learning more about the types of ways that we can engage with Wikidata and helping to develop a framework for contributing.
Thanks, Ben
On Fri, Jul 3, 2015 at 5:47 AM, Jane Darnell jane023@gmail.com wrote:
Well that really depends on the data actually. There are lots of printed datasets and if someone has those online and can no longer host them, then we might want to harvest some, if not all of it. I am thinking of datasets of large collections of <whatever>. I recall not long ago a museum of music records became defunct and they were looking for a home for their database. We couldn't do anything for them then but we could put it in Mix-n-Match today (assuming the data is all published material that is considered a reliable source yada yada...)
On Fri, Jul 3, 2015 at 2:10 PM, Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this is
not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our main
use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
--
- Andrew Gray andrew.gray@dunelm.org.uk
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- about.me/benmccann _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Ben,
You should check out my website at econfactbook.org. I have some corporate data on the website but I plan to get more.
I am also working on https://meta.wikimedia.org/wiki/The_Global_Economic_Map:_Format using Wikidata.
- Alex
On 3 July 2015 at 15:30, Ben McCann ben@benmccann.com wrote:
Unfortunately, opencorporates very often does not fulfill the need for a company database. The primary reason is that they're only tracking data used to register the company like name, address, and id number. The things that company databases are most used for is very often things like # of employees, industry, funding, revenue, etc.
Wikidata's notability requirements are a bit vague to me. What does it mean that something can be described using serious and publicly available references? E.g. would Protingent be notable? What about GlamBot or Little Bizzy?
-Ben
On Fri, Jul 3, 2015 at 1:26 PM, Joe Filceolaire filceolaire@gmail.com wrote:
Ben
I think opencorporates.com may be a better fit for general information about businesses. We already have a problem in wikidata with companies that don't meet our notablity criteria creating items in wikidata.
Joe
On Fri, 3 Jul 2015 17:19 Ben McCann ben@benmccann.com wrote:
Hi,
I've been quite interested in the idea of compiling a list of companies into Wikidata. I wanted to see how well this fits with Wikidata's goals. There are company databases like Hoovers/DNB, Mattermark, LinkedIn, Crunchbase, DataFox, etc. which are very popular, but not as open as Wikidata. These sites are hugely valuable to folks like investors, job candidates, sales organizations, etc. and I'd love to see an open collaborative approach.
My company could contribute engineering resources towards compiling this data in Wikidata. E.g. maybe building a tool which allows to pull and review facts from company websites, SEC filings, news articles referencing funding amounts or acquisitions, etc. into wikidata. This is something that we could do on our own behind closed doors, but it seems more interesting to do it in the open where others can receive value from it and contribute back such that the dataset grows and becomes more valuable.
Many of these companies may not be notable or interesting on their own. However, having an entire compiled dataset where you can lookup any company is extremely interesting. I'd love Wikidata to be the de facto go-to for this type of data instead of the more closed organizations mentioned above.
Is this the type of project that's interesting to Wikidata? If so, I'd be very interested in learning more about the types of ways that we can engage with Wikidata and helping to develop a framework for contributing.
Thanks, Ben
On Fri, Jul 3, 2015 at 5:47 AM, Jane Darnell jane023@gmail.com wrote:
Well that really depends on the data actually. There are lots of printed datasets and if someone has those online and can no longer host them, then we might want to harvest some, if not all of it. I am thinking of datasets of large collections of <whatever>. I recall not long ago a museum of music records became defunct and they were looking for a home for their database. We couldn't do anything for them then but we could put it in Mix-n-Match today (assuming the data is all published material that is considered a reliable source yada yada...)
On Fri, Jul 3, 2015 at 2:10 PM, Andrew Gray andrew.gray@dunelm.org.uk wrote:
On 1 July 2015 at 22:51, Quim Gil qgil@wikimedia.org wrote:
- Where to publish entire datasets... Something tells me that this
is not
the most urgent and important problem that we have, but the community definitely knows better, so correct me if I'm wrong. I think our
main use
I would agree with this. There has historically been a lot of vagueness around the word "data", and a lot of vague suggestions in the early days when Wikidata was still being created... and as a result people sometimes get the impression that Wikidata intends to be a kind of generalised data repository. This is a bit like assuming Wikipedia will take anything that's got words :-)
I wonder if it would be good to identify a couple of good, reliable, repositories we can encourage people to use for this sort of material? This means that even if we have to say to a potential partner "sorry, this isn't what we want", we can still give them advice on how to get it released and available in the most appropriate way. Better than a frustrating back-and-forth...
A
--
- Andrew Gray andrew.gray@dunelm.org.uk
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- about.me/benmccann _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
-- about.me/benmccann
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Dear Ben,
I've been quite interested in the idea of compiling a list of companies into Wikidata. I wanted to see how well this fits with Wikidata's goals. There are company databases like Hoovers/DNB, Mattermark, LinkedIn, Crunchbase, DataFox, etc. which are very popular, but not as open as Wikidata. These sites are hugely valuable to folks like investors, job candidates, sales organizations, etc. and I'd love to see an open collaborative approach.
Are you aware of the http://wikirate.org/ community? This is a EU research funded project.
Raphaël
Hi, just a note to say that I'm back from vacation and I just created
https://www.wikidata.org/wiki/Wikidata:Wiki_Loves_Open_Data
Work in progress! Your edits / feedback are welcome in the discussion page, https://phabricator.wikimedia.org/T104701, or here.
Raphaël Troncy, 04/07/2015 15:55:
Are you aware of the http://wikirate.org/ community? This is a EU research funded project.
Doesn't seem to contain any objective data, am I mistaken?
Nemo