Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that.
My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out.
The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties.
I would appreciate any pointers or thoughts on the matter.
Thanks,
Daniel
I actually already implemented support in SPARQL for that, but it needs a bit more work to get it properly merged with the Blazegraph code. I had it working for a while as part of Sophox (OSM Sparql).
* docs: https://wiki.openstreetmap.org/wiki/Sophox#External_Data_Sources * code: https://github.com/Sophox/wikidata-query-rdf/compare/master...Sophox:tabular (see Tabular* files) * phabricator discussion about the above code: https://phabricator.wikimedia.org/T181319
Tabular support allows any CSV-style tables to be treated as federated sources. With minor changes it should be possible to use mediawiki's .tab pages too.
On Fri, May 31, 2019 at 6:01 PM Daniel Mietchen < daniel.mietchen@googlemail.com> wrote:
Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that.
My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out.
The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties.
I would appreciate any pointers or thoughts on the matter.
Thanks,
Daniel
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Thanks, Yuri - this looks promising!
On Fri, May 31, 2019 at 5:28 PM Yuri Astrakhan yuriastrakhan@gmail.com wrote:
I actually already implemented support in SPARQL for that, but it needs a bit more work to get it properly merged with the Blazegraph code. I had it working for a while as part of Sophox (OSM Sparql).
- docs: https://wiki.openstreetmap.org/wiki/Sophox#External_Data_Sources
- code: https://github.com/Sophox/wikidata-query-rdf/compare/master...Sophox:tabular
(see Tabular* files)
- phabricator discussion about the above code: https://phabricator.wikimedia.org/T181319
Tabular support allows any CSV-style tables to be treated as federated sources. With minor changes it should be possible to use mediawiki's .tab pages too.
On Fri, May 31, 2019 at 6:01 PM Daniel Mietchen daniel.mietchen@googlemail.com wrote:
Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that.
My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out.
The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties.
I would appreciate any pointers or thoughts on the matter.
Thanks,
Daniel
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
On 5/31/19 11:28 AM, Yuri Astrakhan wrote:
I actually already implemented support in SPARQL for that, but it needs a bit more work to get it properly merged with the Blazegraph code. I had it working for a while as part of Sophox (OSM Sparql).
code: https://github.com/Sophox/wikidata-query-rdf/compare/master...Sophox:tabular (see Tabular* files)
- phabricator discussion about the above
code: https://phabricator.wikimedia.org/T181319
Tabular support allows any CSV-style tables to be treated as federated sources. With minor changes it should be possible to use mediawiki's .tab pages too.
Hi Yuri,
What is the SPARQL Query Service endpoint? Basically, the equivalent of : http://query.wikidata.org/sparql ??
Kingsley
On Fri, May 31, 2019 at 6:01 PM Daniel Mietchen <daniel.mietchen@googlemail.com mailto:daniel.mietchen@googlemail.com> wrote:
Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that. My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out. The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties. I would appreciate any pointers or thoughts on the matter. Thanks, Daniel _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org <mailto:Wikidata@lists.wikimedia.org> https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Kingsley, I'm not sure I understood the question. Please take a look at the phabricator ticket that describes the implementation approach. The code is currently sitting in a branch, and can be easily merged with the sophox's master branch, and enabled on sophox endpoint.
On Mon, Jun 3, 2019 at 8:38 PM Kingsley Idehen kidehen@openlinksw.com wrote:
On 5/31/19 11:28 AM, Yuri Astrakhan wrote:
I actually already implemented support in SPARQL for that, but it needs a bit more work to get it properly merged with the Blazegraph code. I had it working for a while as part of Sophox (OSM Sparql).
https://github.com/Sophox/wikidata-query-rdf/compare/master...Sophox:tabular (see Tabular* files)
- phabricator discussion about the above code:
https://phabricator.wikimedia.org/T181319
Tabular support allows any CSV-style tables to be treated as federated sources. With minor changes it should be possible to use mediawiki's .tab pages too.
Hi Yuri,
What is the SPARQL Query Service endpoint? Basically, the equivalent of : http://query.wikidata.org/sparql ??
Kingsley
On Fri, May 31, 2019 at 6:01 PM Daniel Mietchen < daniel.mietchen@googlemail.com> wrote:
Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that.
My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out.
The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties.
I would appreciate any pointers or thoughts on the matter.
Thanks,
Daniel
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
-- Regards,
Kingsley Idehen Founder & CEO OpenLink Software Home Page: http://www.openlinksw.com Community Support: https://community.openlinksw.com Weblogs (Blogs): Company Blog: https://medium.com/openlink-software-blog Virtuoso Blog: https://medium.com/virtuoso-blog Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers
Personal Weblogs (Blogs): Medium Blog: https://medium.com/@kidehen Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/ http://kidehen.blogspot.com
Profile Pages: Pinterest: https://www.pinterest.com/kidehen/ Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen Twitter: https://twitter.com/kidehen Google+: https://plus.google.com/+KingsleyIdehen/about LinkedIn: http://www.linkedin.com/in/kidehen
Web Identities (WebID): Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
On 6/3/19 2:00 PM, Yuri Astrakhan wrote:
Kingsley, I'm not sure I understood the question. Please take a look at the phabricator ticket that describes the implementation approach. The code is currently sitting in a branch, and can be easily merged with the sophox's master branch, and enabled on sophox endpoint.
Yuri,
I was just seeking the endpoint URL for the SPARQL Query Service. I think it is https://sophox.org/ .
Kingsley
On Mon, Jun 3, 2019 at 8:38 PM Kingsley Idehen <kidehen@openlinksw.com mailto:kidehen@openlinksw.com> wrote:
On 5/31/19 11:28 AM, Yuri Astrakhan wrote:
I actually already implemented support in SPARQL for that, but it needs a bit more work to get it properly merged with the Blazegraph code. I had it working for a while as part of Sophox (OSM Sparql). * docs: https://wiki.openstreetmap.org/wiki/Sophox#External_Data_Sources * code: https://github.com/Sophox/wikidata-query-rdf/compare/master...Sophox:tabular (see Tabular* files) * phabricator discussion about the above code: https://phabricator.wikimedia.org/T181319 Tabular support allows any CSV-style tables to be treated as federated sources. With minor changes it should be possible to use mediawiki's .tab pages too.
Hi Yuri, What is the SPARQL Query Service endpoint? Basically, the equivalent of : http://query.wikidata.org/sparql ?? Kingsley
On Fri, May 31, 2019 at 6:01 PM Daniel Mietchen <daniel.mietchen@googlemail.com <mailto:daniel.mietchen@googlemail.com>> wrote: Hi, I'm looking into ways to use tabular data like https://commons.wikimedia.org/wiki/Data:Zika-institutions-test.tab in SPARQL queries but could not find anything on that. My motivation here is in part coming from the time out limits, and the basic idea here would be to split queries that typically time out into sets of queries that do not time out and - if their results were aggregated - would yield the results that would be expected for the original query would it not time out. The second line of motivation here is that of keeping track of how things develop over time, which would be interesting for both content and maintenance queries as well as usage of things like classes, references, lexemes or properties. I would appreciate any pointers or thoughts on the matter. Thanks, Daniel _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org <mailto:Wikidata@lists.wikimedia.org> https://lists.wikimedia.org/mailman/listinfo/wikidata _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org <mailto:Wikidata@lists.wikimedia.org> https://lists.wikimedia.org/mailman/listinfo/wikidata
-- Regards, Kingsley Idehen Founder & CEO OpenLink Software Home Page: http://www.openlinksw.com Community Support: https://community.openlinksw.com Weblogs (Blogs): Company Blog: https://medium.com/openlink-software-blog Virtuoso Blog: https://medium.com/virtuoso-blog Data Access Drivers Blog: https://medium.com/openlink-odbc-jdbc-ado-net-data-access-drivers Personal Weblogs (Blogs): Medium Blog: https://medium.com/@kidehen Legacy Blogs: http://www.openlinksw.com/blog/~kidehen/ http://kidehen.blogspot.com Profile Pages: Pinterest: https://www.pinterest.com/kidehen/ Quora: https://www.quora.com/profile/Kingsley-Uyi-Idehen Twitter: https://twitter.com/kidehen Google+: https://plus.google.com/+KingsleyIdehen/about LinkedIn: http://www.linkedin.com/in/kidehen Web Identities (WebID): Personal: http://kingsley.idehen.net/public_home/kidehen/profile.ttl#i : http://id.myopenlink.net/DAV/home/KingsleyUyiIdehen/Public/kingsley.ttl#this _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org <mailto:Wikidata@lists.wikimedia.org> https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata