Hello all,
I am new to the wikipedia researcher community, so I hope that I'm not
mis-using this list.
*PatCit*
Together with Pr. G. de Rassenfosse <http://www.gder.info/>, we are working
on an open-source project to extract, parse and consolidate worldwide
Patent citations. This is PatCit, and you can learn more about it on the
project GitHub <https://github.com/cverluise/PatCit> and in this short
presentation
<https://docs.google.com/presentation/d/11COlz64EZn8PipXvnDBBZI_bnDD0fpm6tyx1_EqD6lU/edit?usp=sharing>
.
*Patent to wiki citations*
Specifically, we found ~100k wiki citations (with a large majority of
wikipedia articles) stemming from worldwide patents. We have parsed these
citations (item, date, url, hostname) and made the dev table publicly
available on BigQuery (see fp_wiki_dev
<https://console.cloud.google.com/bigquery?project=npl-parsing&p=npl-parsing&d=patcit&t=fp_wiki_dev&page=table>).
A short presentation of this specific work is available here
<https://docs.google.com/presentation/d/1fX7rS4oRXQAZFDuqxSES7XpJp-2rZyKR9nHkz4kegsE/edit?usp=sharing>
.
*We need you!*
Since we are new to the wikipedia research questions, we are looking for
feedback, discussion, etc on how to make these data a true asset for the
community.
The project is thought to be by and for the community and we are deeply
committed to open-source/access and FAIR principles. The overarching
objective of the project is to stimulate interdisciplinary research to
help better understand the various forms and dynamics of innovation.
Happy for feedback, discussion,etc. Feel free to send us an email.
Cheers,
Cyril
--
Cyril Verluise
PhD Candidate - Paris School of Economics
Fellow - Collège de France
cyril.verluise(a)gmail.com
cverluise.github.io
[image: twitter] <https://twitter.com/cyrilverluise>
[image: linkedin] <https://www.linkedin.com/in/cyril-verluise-80184347/>