Forwarding to the Discovery list, since this project seems like it might be of interest even outside the wikidata context. Blame me if you've already seen this elsewhere. :)
Kevin Smith Agile Coach, Wikimedia Foundation
---------- Forwarded message ---------- From: Marco Fossati fossati@spaziodati.eu Date: Wed, Jun 15, 2016 at 9:06 AM Subject: [Wikimedia-l] [ANNOUNCEMENT] StrepHit 1.0 Beta Release To: "Discussion list for the Wikidata project." < wikidata@lists.wikimedia.org> Cc: wikimedia-l@lists.wikimedia.org, wiki-research-l@lists.wikimedia.org
[Feel free to blame me if you read this more than once]
To whom it may interest,
Full of delight, I would like to announce the first beta release of *StrepHit*:
https://github.com/Wikidata/StrepHit
TL;DR: StrepHit is an intelligent reading agent that understands text and translates it into *referenced* Wikidata statements. It is a IEG project funded by the Wikimedia Foundation.
Key features: -Web spiders to harvest a collection of documents (corpus) from reliable sources -automatic corpus analysis to understand the most meaningful verbs -sentences and semi-structured data extraction -train a machine learning classifier via crowdsourcing -*supervised and rule-based fact extraction from text* -Natural Language Processing utilities -parallel processing
You can find all the details here: https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Val... https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Val...
If you like it, star it on GitHub!
Best,
Marco
Hi, guys!
Nice project! Is it possible for someone not working at Wikipedia to contribute to the project?
Regards, Shubham
On Wed, Jun 15, 2016 at 9:59 PM, Kevin Smith ksmith@wikimedia.org wrote:
Forwarding to the Discovery list, since this project seems like it might be of interest even outside the wikidata context. Blame me if you've already seen this elsewhere. :)
Kevin Smith Agile Coach, Wikimedia Foundation
---------- Forwarded message ---------- From: Marco Fossati fossati@spaziodati.eu Date: Wed, Jun 15, 2016 at 9:06 AM Subject: [Wikimedia-l] [ANNOUNCEMENT] StrepHit 1.0 Beta Release To: "Discussion list for the Wikidata project." < wikidata@lists.wikimedia.org> Cc: wikimedia-l@lists.wikimedia.org, wiki-research-l@lists.wikimedia.org
[Feel free to blame me if you read this more than once]
To whom it may interest,
Full of delight, I would like to announce the first beta release of *StrepHit*:
https://github.com/Wikidata/StrepHit
TL;DR: StrepHit is an intelligent reading agent that understands text and translates it into *referenced* Wikidata statements. It is a IEG project funded by the Wikimedia Foundation.
Key features: -Web spiders to harvest a collection of documents (corpus) from reliable sources -automatic corpus analysis to understand the most meaningful verbs -sentences and semi-structured data extraction -train a machine learning classifier via crowdsourcing -*supervised and rule-based fact extraction from text* -Natural Language Processing utilities -parallel processing
You can find all the details here:
https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Val...
https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Val...
If you like it, star it on GitHub!
Best,
Marco
discovery mailing list discovery@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/discovery