I am trying to get paired articles from Simple English Wikipedia and English Wikipedia. For that I am looking for language links for Simple English Wikipedia. Is it available?
with regards
Ditty
You would probably need to pull from wikidata
On Fri, Oct 3, 2014 at 10:31 AM, Ditty Mathew dittyvkm@gmail.com wrote:
I am trying to get paired articles from Simple English Wikipedia and English Wikipedia. For that I am looking for language links for Simple English Wikipedia. Is it available?
with regards
Ditty
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
How to do that?
with regards
Ditty
On Fri, Oct 3, 2014 at 10:43 AM, John phoenixoverride@gmail.com wrote:
You would probably need to pull from wikidata
On Fri, Oct 3, 2014 at 10:31 AM, Ditty Mathew dittyvkm@gmail.com wrote:
I am trying to get paired articles from Simple English Wikipedia and English Wikipedia. For that I am looking for language links for Simple English Wikipedia. Is it available?
with regards
Ditty
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
Hi. If you need it "live" and don't mind doing it one at a time, you can use the Wikimedia API. You can read here for more info: https://www.mediawiki.org/wiki/API:Properties#langlinks_.2F_ll
For example, here's the API call to get all langlinks for https://simple.wikipedia.org/wiki/Earth: http://simple.wikipedia.org/w/api.php?action=query&prop=langlinks&ti...
If you need to do a lot of them at once, then you are better off trying to download a wikidata dump and working on it from your machine. I don't think remember if the data is presented simply, so you may have to do some exploration. You'd start off with the extension here: https://www.mediawiki.org/wiki/Extension:Wikibase . The dump is here https://dumps.wikimedia.org/wikidatawiki/
If you're comfortable with SQL, you can try doing it with XOWA at http://xowa.sourceforge.net . It'll probably be easier than working with wikibase and the dump. You'd have to do the following:
- Import the www.wikidata.org dump at home/wiki/Help:Import/List (about 1 hour) - Open up the followng file in a sqlite shell: /xowa/wiki/ www.wikidata.org/www.wikidata.org.003.sqlite3 - Run the following SQL
--create an index for faster lookup. this may take 20 min CREATE INDEX wdata_qids_trg_ttl ON wdata_qids (wq_trg_ttl, wq_src_wiki, wq_src_ns, wq_src_ttl); --run the SQL to get the links for Earth SELECT q.* FROM wdata_qids t JOIN wdata_qids q ON t.wq_trg_ttl = q.wq_trg_ttl WHERE t.wq_src_wiki = 'simplewiki' AND t.wq_src_ns = '0' AND t.wq_src_ttl = 'Earth';
The last statement produces something like the following:
abwiki,0,"Адгьыл",q2 acewiki,0,"Bumoë",q2 afwiki,0,Aarde,q2 alswiki,0,Erde,q2 amwiki,0,"መሬት",q2 angwiki,0,"Eorðe",q2 anwiki,0,Tierra,q2 arcwiki,0,"ܐܪܥܐ",q2 arwiki,0,"الأرض",q2 arzwiki,0,"الارض",q2 (etc...)
Hope this helps.
On Fri, Oct 3, 2014 at 11:44 AM, Ditty Mathew dittyvkm@gmail.com wrote:
How to do that?
with regards
Ditty
On Fri, Oct 3, 2014 at 10:43 AM, John phoenixoverride@gmail.com wrote:
You would probably need to pull from wikidata
On Fri, Oct 3, 2014 at 10:31 AM, Ditty Mathew dittyvkm@gmail.com wrote:
I am trying to get paired articles from Simple English Wikipedia and English Wikipedia. For that I am looking for language links for Simple English Wikipedia. Is it available?
with regards
Ditty
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
xmldatadumps-l@lists.wikimedia.org