I recommend you have a look at the SWEBLE project <http://sweble.org/>, at least
for the parsing. They basically represent all of Wikipedia (and potentially all
Wikipedias together) as one huge parse tree, using an XML database.
The website doesn't have much details, but they are building some interesting
projects on top of this. Best contact Dirk Riehle directly,
<https://osr.cs.fau.de/people/members/riehle-dirk/>.
Am 01.08.2016 um 20:38 schrieb Ian Seyer:
Full disclosure: I am the creator of the Project Grant
application
for Arc.heolo.gy <http://arc.heolo.gy/>, located
here:
https://meta.wikimedia.org/wiki/Grants:Project/Arc.heolo.gy
I hope for this to be a general discussion on potential applications,
criticisms, questions, technological recommendations, and community discussion
about a graph representation of Wikipedia.
Currently, the project has a live Neo4j Graph database built and parsed from a
download of the English language Wikipedia from April. I have temporarily hosted
the database instance both on my local machine and a SoftLayer server provided
under a temporary entrepreneur credit.
My goal is two fold.
On the backend: refine the parsing algorithm (I am getting some incorrect
relationships in the database), automate the parsing so that it updates the
database frequently, expand language support, and perform semantic parsing to
weight individual relationships to strengthen the ability to filter out
extraneous relationships.
On the frontend: I have done little to zero work here beyond pure
conceptualization. I would hope to use an asynchronous front-end javascript
framework to build both a 2d (d3) and 3d (webGL) interface to be able to explore
the database with a high amount of control and ease.
If any of you would like to access the database for exploration, please contact
me privately and I will give you credentials.
Any recommendations on parsing, hosting, visualization, or otherwise are
appreciated. Endorsements and Volunteers are also highly appreciated!
p.s. I am new to directly engaging with the Wiki community, and if I committed
some faux pas in starting this thread please let me know and I will do my best
to correct it.
--
╭╮
╭╮┃┃
╭╮ ╭╮┃┃┃┃╭╮
┃┃ ╭╮ ┃╰╯╰╯┃┃╰
╭╮┃┃╭╮┃┃╭╮┃ ╰╯
╭╮ ┃┃┃┃┃╰╯┃┃╰╯
┃┃╭╮┃╰╯┃┃ ╰╯
╮┃╰╯┃┃ ╰╯
╰╯ ┃┃
╰╯
_______________________________________________
Wikidata mailing list
Wikidata(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata
--
Daniel Kinzler
Senior Software Developer
Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.