Full disclosure: I am the creator of the Project Grant application for Arc.heolo.gy http://arc.heolo.gy/, located here: https://meta.wikimedia.org/wiki/Grants:Project/Arc.heolo.gy
I hope for this to be a general discussion on potential applications, criticisms, questions, technological recommendations, and community discussion about a graph representation of Wikipedia.
Currently, the project has a live Neo4j Graph database built and parsed from a download of the English language Wikipedia from April. I have temporarily hosted the database instance both on my local machine and a SoftLayer server provided under a temporary entrepreneur credit.
My goal is two fold. On the backend: refine the parsing algorithm (I am getting some incorrect relationships in the database), automate the parsing so that it updates the database frequently, expand language support, and perform semantic parsing to weight individual relationships to strengthen the ability to filter out extraneous relationships. On the frontend: I have done little to zero work here beyond pure conceptualization. I would hope to use an asynchronous front-end javascript framework to build both a 2d (d3) and 3d (webGL) interface to be able to explore the database with a high amount of control and ease.
If any of you would like to access the database for exploration, please contact me privately and I will give you credentials.
Any recommendations on parsing, hosting, visualization, or otherwise are appreciated. Endorsements and Volunteers are also highly appreciated!
p.s. I am new to directly engaging with the Wiki community, and if I committed some faux pas in starting this thread please let me know and I will do my best to correct it.
Ian, how do I/we get in touch with you directly?
Thanks, --David
David@iconicloud.com
On Monday, August 1, 2016, Ian Seyer ian.seyer@gmail.com wrote:
Full disclosure: I am the creator of the Project Grant application for Arc.heolo.gy http://arc.heolo.gy/, located here: https://meta.wikimedia.org/wiki/Grants:Project/Arc.heolo.gy
I hope for this to be a general discussion on potential applications, criticisms, questions, technological recommendations, and community discussion about a graph representation of Wikipedia.
Currently, the project has a live Neo4j Graph database built and parsed from a download of the English language Wikipedia from April. I have temporarily hosted the database instance both on my local machine and a SoftLayer server provided under a temporary entrepreneur credit.
My goal is two fold. On the backend: refine the parsing algorithm (I am getting some incorrect relationships in the database), automate the parsing so that it updates the database frequently, expand language support, and perform semantic parsing to weight individual relationships to strengthen the ability to filter out extraneous relationships. On the frontend: I have done little to zero work here beyond pure conceptualization. I would hope to use an asynchronous front-end javascript framework to build both a 2d (d3) and 3d (webGL) interface to be able to explore the database with a high amount of control and ease.
If any of you would like to access the database for exploration, please contact me privately and I will give you credentials.
Any recommendations on parsing, hosting, visualization, or otherwise are appreciated. Endorsements and Volunteers are also highly appreciated!
p.s. I am new to directly engaging with the Wiki community, and if I committed some faux pas in starting this thread please let me know and I will do my best to correct it. -- ╭╮ ╭╮┃┃ ╭╮ ╭╮┃┃┃┃╭╮ ┃┃ ╭╮ ┃╰╯╰╯┃┃╰ ╭╮┃┃╭╮┃┃╭╮┃ ╰╯ ╭╮ ┃┃┃┃┃╰╯┃┃╰╯ ┃┃╭╮┃╰╯┃┃ ╰╯ ╮┃╰╯┃┃ ╰╯ ╰╯ ┃┃ ╰╯
I am reachable at this email, ian.seyer@gmail.com.
On Mon, Aug 1, 2016 at 1:53 PM David McDonell david@iconicloud.com wrote:
Ian, how do I/we get in touch with you directly?
Thanks, --David
David@iconicloud.com
On Monday, August 1, 2016, Ian Seyer ian.seyer@gmail.com wrote:
Full disclosure: I am the creator of the Project Grant application for Arc.heolo.gy http://arc.heolo.gy/, located here: https://meta.wikimedia.org/wiki/Grants:Project/Arc.heolo.gy
I hope for this to be a general discussion on potential applications, criticisms, questions, technological recommendations, and community discussion about a graph representation of Wikipedia.
Currently, the project has a live Neo4j Graph database built and parsed from a download of the English language Wikipedia from April. I have temporarily hosted the database instance both on my local machine and a SoftLayer server provided under a temporary entrepreneur credit.
My goal is two fold. On the backend: refine the parsing algorithm (I am getting some incorrect relationships in the database), automate the parsing so that it updates the database frequently, expand language support, and perform semantic parsing to weight individual relationships to strengthen the ability to filter out extraneous relationships. On the frontend: I have done little to zero work here beyond pure conceptualization. I would hope to use an asynchronous front-end javascript framework to build both a 2d (d3) and 3d (webGL) interface to be able to explore the database with a high amount of control and ease.
If any of you would like to access the database for exploration, please contact me privately and I will give you credentials.
Any recommendations on parsing, hosting, visualization, or otherwise are appreciated. Endorsements and Volunteers are also highly appreciated!
p.s. I am new to directly engaging with the Wiki community, and if I committed some faux pas in starting this thread please let me know and I will do my best to correct it. -- ╭╮ ╭╮┃┃ ╭╮ ╭╮┃┃┃┃╭╮ ┃┃ ╭╮ ┃╰╯╰╯┃┃╰ ╭╮┃┃╭╮┃┃╭╮┃ ╰╯ ╭╮ ┃┃┃┃┃╰╯┃┃╰╯ ┃┃╭╮┃╰╯┃┃ ╰╯ ╮┃╰╯┃┃ ╰╯ ╰╯ ┃┃ ╰╯
-- David McDonell Co-founder & CEO ICONICLOUD, Inc. "Illuminating the cloud"
M: 703-864-1203 EM: david@iconicloud.com URL: http://iconicloud.com _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
I recommend you have a look at the SWEBLE project http://sweble.org/, at least for the parsing. They basically represent all of Wikipedia (and potentially all Wikipedias together) as one huge parse tree, using an XML database.
The website doesn't have much details, but they are building some interesting projects on top of this. Best contact Dirk Riehle directly, https://osr.cs.fau.de/people/members/riehle-dirk/.
Am 01.08.2016 um 20:38 schrieb Ian Seyer:
Full disclosure: I am the creator of the Project Grant application for Arc.heolo.gy http://arc.heolo.gy/, located here: https://meta.wikimedia.org/wiki/Grants:Project/Arc.heolo.gy
I hope for this to be a general discussion on potential applications, criticisms, questions, technological recommendations, and community discussion about a graph representation of Wikipedia.
Currently, the project has a live Neo4j Graph database built and parsed from a download of the English language Wikipedia from April. I have temporarily hosted the database instance both on my local machine and a SoftLayer server provided under a temporary entrepreneur credit.
My goal is two fold. On the backend: refine the parsing algorithm (I am getting some incorrect relationships in the database), automate the parsing so that it updates the database frequently, expand language support, and perform semantic parsing to weight individual relationships to strengthen the ability to filter out extraneous relationships. On the frontend: I have done little to zero work here beyond pure conceptualization. I would hope to use an asynchronous front-end javascript framework to build both a 2d (d3) and 3d (webGL) interface to be able to explore the database with a high amount of control and ease.
If any of you would like to access the database for exploration, please contact me privately and I will give you credentials.
Any recommendations on parsing, hosting, visualization, or otherwise are appreciated. Endorsements and Volunteers are also highly appreciated!
p.s. I am new to directly engaging with the Wiki community, and if I committed some faux pas in starting this thread please let me know and I will do my best to correct it. -- ╭╮ ╭╮┃┃ ╭╮ ╭╮┃┃┃┃╭╮ ┃┃ ╭╮ ┃╰╯╰╯┃┃╰ ╭╮┃┃╭╮┃┃╭╮┃ ╰╯ ╭╮ ┃┃┃┃┃╰╯┃┃╰╯ ┃┃╭╮┃╰╯┃┃ ╰╯ ╮┃╰╯┃┃ ╰╯ ╰╯ ┃┃ ╰╯
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata