Hi Dario and All,
Thanks, and per your "My thoughts on why we need a class to describe
“(bibliographic) item collections” in @wikidata and @wikicite. Feedback
welcome" - https://twitter.com/ReaderMeter/status/934204071870742528
reply here - https://twitter.com/WorldUnivAndSch/status/934570933372588032
) - I wonder how to add a further "virtual place-based" component to this -
beyond Wikidata geo-coordinates - since such collections (e.g. in all 8,444
entries in languages per Glottolog eventually, and over all history and
pre-history) may become so numerous with time - given Wikimedia's mission
of all knowledge - such that organizing such (bibliographic) items in
collections within a realistic virtual earth (think Google
Streetview/Maps/Earth with TIME SLIDER and group build-able like
Minecraft) - e.g. for where an artist made such art works in the 1600s, or
for museums that have numerous items of such a collection now, but didn't
100 years' ago ... and even eventually with avatar bots painting these
again - may have much merit. (Wiki CC MIT OCW-centric World Univ & Sch is
planning ALL Libraries and ALL Museums in each of all 8,444 languages in
such a realistic virtual earth with TIME SLIDER).
Harbin Hot Springs’ Actual/Virtual Ethnography
On Fri, Nov 24, 2017 at 3:30 PM, Dario Taraborelli <
I'd like to hear from you on a proposal to add some order and structure to
the various bibliographic corpora we currently have in Wikidata.
As you may know, coverage of creative works in Wikidata has seen
significant growth over the last year.  Different groups and projects
have started importing source metadata for various reasons:
- to provide sources machine-extracted statements (WikiFactMine ,
- to represent sources cited in Wikipedia (e.g. DOIs and PMIDs
imported via the mwcite identifier dumps) or other Wikimedia projects
(Wikisource, Wikispecies, Wikinews)
- to create collections of the open access literature citable and
reusable in Wikimedia projects (e.g. open access PMC review articles)
- to maintain small, curated corpora about specific topics (e.g. the
Zika corpus )
While all these efforts have grown organically and with little
coordination, it's hard to keep track of who initiated the, to clearly
communicate their purpose, to understand their completion criteria and
their data quality needs, and last but not least to offer any contribution
opportunities (in terms of code, or manual labor) to other community
members. It's unclear if the future of these efforts should continue to be
within Wikidata, or leverage the power of federated Wikibase-powered wikis
(see our discussion at the end of the WikiCite session at WikidataCon ).
Irrespective of the best long term solution, we need to provide some better
structure to these efforts today if we want to address the above problems.
I'd like to propose a fairly simple solution and hear your feedback on
whether it makes sense to implement it as is or with some modifications.
1. create a Wikidata class called "Wikidata item collection" [Q-X]
2. create and document individual collections (e.g. the Wikidata Zika
corpus [Q-Y]) as instances of this class: [Q-Y] --P31--> [Q-X]
3. add appropriate metadata to describe such collections (its main
topic(s), creators, any external identifiers, if applicable)
4. mark individual bibliographic items as part of [P361] the
Note that this approach can apply to bibliographic item collections but
also to any other set of items not directly identifiable via Wikidata
properties. Of course, the same items could obviously be part of multiple
collections. Some criteria would be needed to determine an appropriate
threshold for legitimate collections (we wouldn't want arbitrary
collections to be created for sets of items generated as part of a test
Beyond solving the issues listed above, this approach would also allow us
to generate dedicated statistics on the growth or data quality of each
collection via the SPARQL endpoint. It would also allow us to design
constraints for arbitrary item collections, something that right now is
not possible (unless these sets can already be identified via a query).
If something similar already exists in the context of structured data
donations/imports for GLAM, I'd be most grateful for any pointers.
You received this message because you are subscribed to the Google Groups
To unsubscribe from this group and stop receiving emails from it, send an
email to wikicite-discuss+unsubscribe(a)wikimedia.org.
- Scott MacLeod - Founder & President
- World University and School
- 415 480 4577
- CC World University and School - like CC Wikipedia with best STEM-centric
CC OpenCourseWare - incorporated as a nonprofit university and school in
California, and is a U.S. 501 (c) (3) tax-exempt educational organization.
IMPORTANT NOTICE: This transmission and any attachments are intended only
for the use of the individual or entity to which they are addressed and may
contain information that is privileged, confidential, or exempt from
disclosure under applicable federal or state laws. If the reader of this
transmission is not the intended recipient, you are hereby notified that
any use, dissemination, distribution, or copying of this communication is
strictly prohibited. If you have received this transmission in error,
please notify me immediately by email or telephone.
World University and School is sending you this because of your interest in
free, online, higher education. If you don't want to receive these, please
reply with 'unsubscribe' in the body of the email, leaving the subject line
intact. Thank you.