(I sent this to xmldatadumps-l yesterday but just realised that this might be a more suitable place.)
Hallo,
I'm looking at the data dumps for all Wikipedia languages and noticed that for some larger wikis, the geo_tags.sql.gz dump file does not include any geotags found in articles. Is it possible to determine why this is, and for which languages this is the case?
For example, the geotags dump file for Indonesian (a wiki with
400,000 articles) is only 7kb large, and all geotags in it are from
user pages, file uploads, or file templates, but not from articles: https://dumps.wikimedia.org/idwiki/20181020/
Yet it doesn't take much effort to find pages that are geotagged, such as this one (see the infobox): https://id.wikipedia.org/wiki/London
I realise that there are a number of alternative geotagging conventions. Does idwiki possibly use a geotagging scheme that is not supported by some part of this data ingestion/export process? Which other wikis/languages may fall in this category?
I tried to find the script(s) that populate the geo_tags table from page content but so far had no luck, as I'm not sufficiently familiar with WP's software architecture; if someone can point me in the right direction I'd be happy to investigate myself.
Many thanks!
m.
The extension responsible for adding the data is https://www.mediawiki.org/wiki/Extension:GeoData
The template ({{coord}}) and module (Module:Coord) on id.wp seem taken as is from en.wp, and claims to be adding the data, although I can't find the specific code on my phone (which means absolutely nothing).
I would check if the template or module call the parser function first. You can also check if this is a dump problem by asking for the data through the api.
HTH Strainu
Pe miercuri, 7 noiembrie 2018, Martin Dittus martin@dekstop.de a scris:
(I sent this to xmldatadumps-l yesterday but just realised that this might be a more suitable place.)
Hallo,
I'm looking at the data dumps for all Wikipedia languages and noticed that for some larger wikis, the geo_tags.sql.gz dump file does not include any geotags found in articles. Is it possible to determine why this is, and for which languages this is the case?
For example, the geotags dump file for Indonesian (a wiki with
400,000 articles) is only 7kb large, and all geotags in it are from
user pages, file uploads, or file templates, but not from articles: https://dumps.wikimedia.org/idwiki/20181020/
Yet it doesn't take much effort to find pages that are geotagged, such as this one (see the infobox): https://id.wikipedia.org/wiki/London
I realise that there are a number of alternative geotagging conventions. Does idwiki possibly use a geotagging scheme that is not supported by some part of this data ingestion/export process? Which other wikis/languages may fall in this category?
I tried to find the script(s) that populate the geo_tags table from page content but so far had no luck, as I'm not sufficiently familiar with WP's software architecture; if someone can point me in the right direction I'd be happy to investigate myself.
Many thanks!
m.
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
Thank you Strainu and Bartosz, this was very useful.
As far as I can tell, idwiki editors simply don't use the GeoData geotagging conventions. Instead, people specify article location as infobox latd/longd properties, which on enwiki (and likely others) is being deprecated in favour of GeoData tags. While this older method allows a map to be displayed on the page, the coordinates are not imported by the GeoData extension, and as a result none of these geotagged pages show up in API geo lookups, or in the data dumps.
See also: https://en.wikipedia.org/wiki/Wikipedia:Coordinates_in_infoboxes
I'm now pondering if there's a quick way to asses for which wikis this is the case... I may report back if I find a simple approach, beyond simply manually checking every wiki.
Thanks again!
m. On Wed, Nov 7, 2018 at 10:31 PM Bartosz Dziewoński matma.rex@gmail.com wrote:
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
-- Bartosz Dziewoński
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
List of wikis by geo_tags count may help to identify wikis that aren't using geo tags:
abwiki 0 adywiki 0 akwiki 0 arcwiki 0 aywiki 0 bgwiki 0 biwiki 0 bjnwiki 0 bmwiki 0 bowiki 0 bpywiki 0 brwiki 0 bugwiki 0 bxrwiki 0 chrwiki 0 chywiki 0 crhwiki 0 crwiki 0 csbwiki 0 cuwiki 0 diqwiki 0 dzwiki 0 eewiki 0 emlwiki 0 extwiki 0 ffwiki 0 fjwiki 0 fywiki 0 gagwiki 0 ganwiki 0 gdwiki 0 glkwiki 0 gotwiki 0 gvwiki 0 hawiki 0 hawwiki 0 hsbwiki 0 iawiki 0 iewiki 0 igwiki 0 ikwiki 0 inhwiki 0 iuwiki 0 jamwiki 0 jbowiki 0 kaawiki 0 kbdwiki 0 kbpwiki 0 kgwiki 0 kiwiki 0 klwiki 0 koiwiki 0 krcwiki 0 kshwiki 0 kswiki 0 kvwiki 0 ladwiki 0 lbewiki 0 lfnwiki 0 lijwiki 0 liwiki 0 lrcwiki 0 ltgwiki 0 mdfwiki 0 mhrwiki 0 miwiki 0 mrjwiki 0 mtwiki 0 mznwiki 0 nahwiki 0 napwiki 0 nawiki 0 novwiki 0 nrmwiki 0 nvwiki 0 omwiki 0 oswiki 0 pamwiki 0 papwiki 0 pcdwiki 0 pdcwiki 0 piwiki 0 pmswiki 0 pntwiki 0 quwiki 0 rmwiki 0 rmywiki 0 rnwiki 0 rupwiki 0 rwwiki 0 scnwiki 0 sewiki 0 sgwiki 0 skwiki 0 srnwiki 0 stqwiki 0 swwiki 0 szlwiki 0 tcywiki 0 tiwiki 0 towiki 0 tpiwiki 0 tumwiki 0 twwiki 0 udmwiki 0 ugwiki 0 vecwiki 0 vepwiki 0 vewiki 0 vlswiki 0 vowiki 0 wawiki 0 wowiki 0 wuuwiki 0 xalwiki 0 xmfwiki 0 yiwiki 0 yowiki 0 zawiki 0 zeawiki 0 classicalwiki 0 nanwiki 0 zuwiki 0 amwiki 1 oldwiki 1 frrwiki 1 gorwiki 1 kwwiki 1 nywiki 1 sowiki 1 tywiki 1 anwiki 2 lgwiki 2 cowiki 3 smwiki 3 smgwiki 4 stwiki 5 lnwiki 6 cdowiki 7 iowiki 8 kuwiki 8 tetwiki 8 lbwiki 9 kawiki 10 sswiki 13 eowiki 14 snwiki 14 atjwiki 15 chwiki 15 pagwiki 18 gnwiki 21 zamwiki 23 hrwiki 37 mkwiki 37 hakwiki 38 satwiki 39 xhwiki 40 angwiki 45 fowiki 50 furwiki 55 tyvwiki 63 pihwiki 66 bmswiki 67 dinwiki 70 mwlwiki 74 tkwiki 86 gomwiki 99 lowiki 138 frpwiki 139 hifwiki 140 htwiki 186 tnwiki 229 tarawiki 249 kabwiki 282 vrowiki 287 mnwiki 295 sawiki 336 dtywiki 338 ruewiki 374 sdwiki 401 avwiki 462 bclwiki 481 dvwiki 499 minwiki 511 idwiki 571 olowiki 575 mrwiki 580 suwiki 589 nlwiki 628 dsbwiki 639 kmwiki 655 sahwiki 691 aswiki 759 lezwiki 761 iswiki 961 pnbwiki 1172 scwiki 1286 gawiki 1290 lawiki 1354 arzwiki 1387 bhwiki 1533 siwiki 1560 pswiki 1621 yuewiki 1726 lmowiki 1788 warwiki 1820 pflwiki 1882 acewiki 1934 newwiki 1984 nsowiki 2394 myvwiki 2660 orwiki 3181 knwiki 3262 ckbwiki 3666 tgwiki 3846 maiwiki 5152 newiki 5179 tlwiki 5305 ndswiki 5808 pawiki 6390 tawiki 7747 mlwiki 8229 tewiki 8424 alswiki 9300 cvwiki 9821 afwiki 11025 jvwiki 11062 tswiki 12077 barwiki 12224 bnwiki 13776 slwiki 14608 astwiki 14778 mswiki 15047 simplewiki 15836 hiwiki 15933 kywiki 16201 ttwiki 16624 azwiki 17440 sqwiki 18621 scowiki 19495 mywiki 19593 ilowiki 19769 guwiki 20287 cywiki 21966 lvwiki 24097 glwiki 24254 bawiki 25354 hewiki 27228 azbwiki 29506 thwiki 29889 etwiki 31017 elwiki 33093 urwiki 33521 bswiki 34825 mgwiki 34926 ocwiki 37370 trwiki 43747 nnwiki 47846 ltwiki 54916 bewiki 57123 fiwiki 59629 kowiki 63841 uzwiki 67233 kkwiki 71017 cewiki 77008 nowiki 101437 hywiki 105532 viwiki 110624 dawiki 112547 jawiki 115929 huwiki 136378 euwiki 142636 ptwiki 144496 cswiki 154191 rowiki 181206 fawiki 185066 nlwiki 219759 shwiki 263092 ukwiki 305328 cawiki 307912 zhwiki 310981 itwiki 361703 srwiki 374054 arwiki 381995 plwiki 387716 eswiki 474918 ruwiki 480509 frwiki 665610 dewiki 1348793 enwiki 1969331 svwiki 3027477 cebwiki 5763981
On Thu, Nov 8, 2018 at 5:23 PM Martin Dittus martin@dekstop.de wrote:
Thank you Strainu and Bartosz, this was very useful.
As far as I can tell, idwiki editors simply don't use the GeoData geotagging conventions. Instead, people specify article location as infobox latd/longd properties, which on enwiki (and likely others) is being deprecated in favour of GeoData tags. While this older method allows a map to be displayed on the page, the coordinates are not imported by the GeoData extension, and as a result none of these geotagged pages show up in API geo lookups, or in the data dumps.
See also: https://en.wikipedia.org/wiki/Wikipedia:Coordinates_in_infoboxes
I'm now pondering if there's a quick way to asses for which wikis this is the case... I may report back if I find a simple approach, beyond simply manually checking every wiki.
Thanks again!
m. On Wed, Nov 7, 2018 at 10:31 PM Bartosz Dziewoński matma.rex@gmail.com wrote:
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
-- Bartosz Dziewoński
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Hehe I think you're right -- in our case it really doesn't need to be more complicated than that.
(We're currently studying Wikipedia's language geography, I'll likely post some preliminary findings on the research list in the coming months.)
Thanks all!
m. On Thu, Nov 8, 2018 at 4:12 PM Eran Rosenthal eranroz89@gmail.com wrote:
List of wikis by geo_tags count may help to identify wikis that aren't using geo tags:
abwiki 0 adywiki 0 akwiki 0 arcwiki 0 aywiki 0 bgwiki 0 biwiki 0 bjnwiki 0 bmwiki 0 bowiki 0 bpywiki 0 brwiki 0 bugwiki 0 bxrwiki 0 chrwiki 0 chywiki 0 crhwiki 0 crwiki 0 csbwiki 0 cuwiki 0 diqwiki 0 dzwiki 0 eewiki 0 emlwiki 0 extwiki 0 ffwiki 0 fjwiki 0 fywiki 0 gagwiki 0 ganwiki 0 gdwiki 0 glkwiki 0 gotwiki 0 gvwiki 0 hawiki 0 hawwiki 0 hsbwiki 0 iawiki 0 iewiki 0 igwiki 0 ikwiki 0 inhwiki 0 iuwiki 0 jamwiki 0 jbowiki 0 kaawiki 0 kbdwiki 0 kbpwiki 0 kgwiki 0 kiwiki 0 klwiki 0 koiwiki 0 krcwiki 0 kshwiki 0 kswiki 0 kvwiki 0 ladwiki 0 lbewiki 0 lfnwiki 0 lijwiki 0 liwiki 0 lrcwiki 0 ltgwiki 0 mdfwiki 0 mhrwiki 0 miwiki 0 mrjwiki 0 mtwiki 0 mznwiki 0 nahwiki 0 napwiki 0 nawiki 0 novwiki 0 nrmwiki 0 nvwiki 0 omwiki 0 oswiki 0 pamwiki 0 papwiki 0 pcdwiki 0 pdcwiki 0 piwiki 0 pmswiki 0 pntwiki 0 quwiki 0 rmwiki 0 rmywiki 0 rnwiki 0 rupwiki 0 rwwiki 0 scnwiki 0 sewiki 0 sgwiki 0 skwiki 0 srnwiki 0 stqwiki 0 swwiki 0 szlwiki 0 tcywiki 0 tiwiki 0 towiki 0 tpiwiki 0 tumwiki 0 twwiki 0 udmwiki 0 ugwiki 0 vecwiki 0 vepwiki 0 vewiki 0 vlswiki 0 vowiki 0 wawiki 0 wowiki 0 wuuwiki 0 xalwiki 0 xmfwiki 0 yiwiki 0 yowiki 0 zawiki 0 zeawiki 0 classicalwiki 0 nanwiki 0 zuwiki 0 amwiki 1 oldwiki 1 frrwiki 1 gorwiki 1 kwwiki 1 nywiki 1 sowiki 1 tywiki 1 anwiki 2 lgwiki 2 cowiki 3 smwiki 3 smgwiki 4 stwiki 5 lnwiki 6 cdowiki 7 iowiki 8 kuwiki 8 tetwiki 8 lbwiki 9 kawiki 10 sswiki 13 eowiki 14 snwiki 14 atjwiki 15 chwiki 15 pagwiki 18 gnwiki 21 zamwiki 23 hrwiki 37 mkwiki 37 hakwiki 38 satwiki 39 xhwiki 40 angwiki 45 fowiki 50 furwiki 55 tyvwiki 63 pihwiki 66 bmswiki 67 dinwiki 70 mwlwiki 74 tkwiki 86 gomwiki 99 lowiki 138 frpwiki 139 hifwiki 140 htwiki 186 tnwiki 229 tarawiki 249 kabwiki 282 vrowiki 287 mnwiki 295 sawiki 336 dtywiki 338 ruewiki 374 sdwiki 401 avwiki 462 bclwiki 481 dvwiki 499 minwiki 511 idwiki 571 olowiki 575 mrwiki 580 suwiki 589 nlwiki 628 dsbwiki 639 kmwiki 655 sahwiki 691 aswiki 759 lezwiki 761 iswiki 961 pnbwiki 1172 scwiki 1286 gawiki 1290 lawiki 1354 arzwiki 1387 bhwiki 1533 siwiki 1560 pswiki 1621 yuewiki 1726 lmowiki 1788 warwiki 1820 pflwiki 1882 acewiki 1934 newwiki 1984 nsowiki 2394 myvwiki 2660 orwiki 3181 knwiki 3262 ckbwiki 3666 tgwiki 3846 maiwiki 5152 newiki 5179 tlwiki 5305 ndswiki 5808 pawiki 6390 tawiki 7747 mlwiki 8229 tewiki 8424 alswiki 9300 cvwiki 9821 afwiki 11025 jvwiki 11062 tswiki 12077 barwiki 12224 bnwiki 13776 slwiki 14608 astwiki 14778 mswiki 15047 simplewiki 15836 hiwiki 15933 kywiki 16201 ttwiki 16624 azwiki 17440 sqwiki 18621 scowiki 19495 mywiki 19593 ilowiki 19769 guwiki 20287 cywiki 21966 lvwiki 24097 glwiki 24254 bawiki 25354 hewiki 27228 azbwiki 29506 thwiki 29889 etwiki 31017 elwiki 33093 urwiki 33521 bswiki 34825 mgwiki 34926 ocwiki 37370 trwiki 43747 nnwiki 47846 ltwiki 54916 bewiki 57123 fiwiki 59629 kowiki 63841 uzwiki 67233 kkwiki 71017 cewiki 77008 nowiki 101437 hywiki 105532 viwiki 110624 dawiki 112547 jawiki 115929 huwiki 136378 euwiki 142636 ptwiki 144496 cswiki 154191 rowiki 181206 fawiki 185066 nlwiki 219759 shwiki 263092 ukwiki 305328 cawiki 307912 zhwiki 310981 itwiki 361703 srwiki 374054 arwiki 381995 plwiki 387716 eswiki 474918 ruwiki 480509 frwiki 665610 dewiki 1348793 enwiki 1969331 svwiki 3027477 cebwiki 5763981
On Thu, Nov 8, 2018 at 5:23 PM Martin Dittus martin@dekstop.de wrote:
Thank you Strainu and Bartosz, this was very useful.
As far as I can tell, idwiki editors simply don't use the GeoData geotagging conventions. Instead, people specify article location as infobox latd/longd properties, which on enwiki (and likely others) is being deprecated in favour of GeoData tags. While this older method allows a map to be displayed on the page, the coordinates are not imported by the GeoData extension, and as a result none of these geotagged pages show up in API geo lookups, or in the data dumps.
See also: https://en.wikipedia.org/wiki/Wikipedia:Coordinates_in_infoboxes
I'm now pondering if there's a quick way to asses for which wikis this is the case... I may report back if I find a simple approach, beyond simply manually checking every wiki.
Thanks again!
m. On Wed, Nov 7, 2018 at 10:31 PM Bartosz Dziewoński matma.rex@gmail.com wrote:
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
-- Bartosz Dziewoński
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
(We're currently studying Wikipedia's language geography, I'll likely post some preliminary findings on the research list in the coming months.)
@Martin Dittus, please share it on maps-l@lists.wikimedia.org too.
On Thu, Nov 8, 2018 at 2:22 PM Martin Dittus martin@dekstop.de wrote:
Hehe I think you're right -- in our case it really doesn't need to be more complicated than that.
(We're currently studying Wikipedia's language geography, I'll likely post some preliminary findings on the research list in the coming months.)
Thanks all!
m. On Thu, Nov 8, 2018 at 4:12 PM Eran Rosenthal eranroz89@gmail.com wrote:
List of wikis by geo_tags count may help to identify wikis that aren't using geo tags:
abwiki 0 adywiki 0 akwiki 0 arcwiki 0 aywiki 0 bgwiki 0 biwiki 0 bjnwiki 0 bmwiki 0 bowiki 0 bpywiki 0 brwiki 0 bugwiki 0 bxrwiki 0 chrwiki 0 chywiki 0 crhwiki 0 crwiki 0 csbwiki 0 cuwiki 0 diqwiki 0 dzwiki 0 eewiki 0 emlwiki 0 extwiki 0 ffwiki 0 fjwiki 0 fywiki 0 gagwiki 0 ganwiki 0 gdwiki 0 glkwiki 0 gotwiki 0 gvwiki 0 hawiki 0 hawwiki 0 hsbwiki 0 iawiki 0 iewiki 0 igwiki 0 ikwiki 0 inhwiki 0 iuwiki 0 jamwiki 0 jbowiki 0 kaawiki 0 kbdwiki 0 kbpwiki 0 kgwiki 0 kiwiki 0 klwiki 0 koiwiki 0 krcwiki 0 kshwiki 0 kswiki 0 kvwiki 0 ladwiki 0 lbewiki 0 lfnwiki 0 lijwiki 0 liwiki 0 lrcwiki 0 ltgwiki 0 mdfwiki 0 mhrwiki 0 miwiki 0 mrjwiki 0 mtwiki 0 mznwiki 0 nahwiki 0 napwiki 0 nawiki 0 novwiki 0 nrmwiki 0 nvwiki 0 omwiki 0 oswiki 0 pamwiki 0 papwiki 0 pcdwiki 0 pdcwiki 0 piwiki 0 pmswiki 0 pntwiki 0 quwiki 0 rmwiki 0 rmywiki 0 rnwiki 0 rupwiki 0 rwwiki 0 scnwiki 0 sewiki 0 sgwiki 0 skwiki 0 srnwiki 0 stqwiki 0 swwiki 0 szlwiki 0 tcywiki 0 tiwiki 0 towiki 0 tpiwiki 0 tumwiki 0 twwiki 0 udmwiki 0 ugwiki 0 vecwiki 0 vepwiki 0 vewiki 0 vlswiki 0 vowiki 0 wawiki 0 wowiki 0 wuuwiki 0 xalwiki 0 xmfwiki 0 yiwiki 0 yowiki 0 zawiki 0 zeawiki 0 classicalwiki 0 nanwiki 0 zuwiki 0 amwiki 1 oldwiki 1 frrwiki 1 gorwiki 1 kwwiki 1 nywiki 1 sowiki 1 tywiki 1 anwiki 2 lgwiki 2 cowiki 3 smwiki 3 smgwiki 4 stwiki 5 lnwiki 6 cdowiki 7 iowiki 8 kuwiki 8 tetwiki 8 lbwiki 9 kawiki 10 sswiki 13 eowiki 14 snwiki 14 atjwiki 15 chwiki 15 pagwiki 18 gnwiki 21 zamwiki 23 hrwiki 37 mkwiki 37 hakwiki 38 satwiki 39 xhwiki 40 angwiki 45 fowiki 50 furwiki 55 tyvwiki 63 pihwiki 66 bmswiki 67 dinwiki 70 mwlwiki 74 tkwiki 86 gomwiki 99 lowiki 138 frpwiki 139 hifwiki 140 htwiki 186 tnwiki 229 tarawiki 249 kabwiki 282 vrowiki 287 mnwiki 295 sawiki 336 dtywiki 338 ruewiki 374 sdwiki 401 avwiki 462 bclwiki 481 dvwiki 499 minwiki 511 idwiki 571 olowiki 575 mrwiki 580 suwiki 589 nlwiki 628 dsbwiki 639 kmwiki 655 sahwiki 691 aswiki 759 lezwiki 761 iswiki 961 pnbwiki 1172 scwiki 1286 gawiki 1290 lawiki 1354 arzwiki 1387 bhwiki 1533 siwiki 1560 pswiki 1621 yuewiki 1726 lmowiki 1788 warwiki 1820 pflwiki 1882 acewiki 1934 newwiki 1984 nsowiki 2394 myvwiki 2660 orwiki 3181 knwiki 3262 ckbwiki 3666 tgwiki 3846 maiwiki 5152 newiki 5179 tlwiki 5305 ndswiki 5808 pawiki 6390 tawiki 7747 mlwiki 8229 tewiki 8424 alswiki 9300 cvwiki 9821 afwiki 11025 jvwiki 11062 tswiki 12077 barwiki 12224 bnwiki 13776 slwiki 14608 astwiki 14778 mswiki 15047 simplewiki 15836 hiwiki 15933 kywiki 16201 ttwiki 16624 azwiki 17440 sqwiki 18621 scowiki 19495 mywiki 19593 ilowiki 19769 guwiki 20287 cywiki 21966 lvwiki 24097 glwiki 24254 bawiki 25354 hewiki 27228 azbwiki 29506 thwiki 29889 etwiki 31017 elwiki 33093 urwiki 33521 bswiki 34825 mgwiki 34926 ocwiki 37370 trwiki 43747 nnwiki 47846 ltwiki 54916 bewiki 57123 fiwiki 59629 kowiki 63841 uzwiki 67233 kkwiki 71017 cewiki 77008 nowiki 101437 hywiki 105532 viwiki 110624 dawiki 112547 jawiki 115929 huwiki 136378 euwiki 142636 ptwiki 144496 cswiki 154191 rowiki 181206 fawiki 185066 nlwiki 219759 shwiki 263092 ukwiki 305328 cawiki 307912 zhwiki 310981 itwiki 361703 srwiki 374054 arwiki 381995 plwiki 387716 eswiki 474918 ruwiki 480509 frwiki 665610 dewiki 1348793 enwiki 1969331 svwiki 3027477 cebwiki 5763981
On Thu, Nov 8, 2018 at 5:23 PM Martin Dittus martin@dekstop.de wrote:
Thank you Strainu and Bartosz, this was very useful.
As far as I can tell, idwiki editors simply don't use the GeoData geotagging conventions. Instead, people specify article location as infobox latd/longd properties, which on enwiki (and likely others) is being deprecated in favour of GeoData tags. While this older method allows a map to be displayed on the page, the coordinates are not imported by the GeoData extension, and as a result none of these geotagged pages show up in API geo lookups, or in the data dumps.
See also: https://en.wikipedia.org/wiki/Wikipedia:Coordinates_in_infoboxes
I'm now pondering if there's a quick way to asses for which wikis this is the case... I may report back if I find a simple approach, beyond simply manually checking every wiki.
Thanks again!
m. On Wed, Nov 7, 2018 at 10:31 PM Bartosz Dziewoński <
matma.rex@gmail.com>
wrote:
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
-- Bartosz Dziewoński
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Pe joi, 8 noiembrie 2018, Martin Dittus martin@dekstop.de a scris:
Thank you Strainu and Bartosz, this was very useful.
As far as I can tell, idwiki editors simply don't use the GeoData geotagging conventions. Instead, people specify article location as infobox latd/longd properties, which on enwiki (and likely others) is being deprecated in favour of GeoData tags. While this older method allows a map to be displayed on the page, the coordinates are not imported by the GeoData extension, and as a result none of these geotagged pages show up in API geo lookups, or in the data dumps.
I think you're confusing the coord template with the #coordinates: parser function - unintuitively, they're both in brackets. The first is a generic way of displaying some coordinates. It can be built by code from individual parameters, like on id.wp, or it can be passed already built, like on en.wp. What it does inside varies, and some wikis, in addition to displaying the coordinates, also call the parser function.
The parser function, on the other hand, simply saves the data in the database, without displaying anything. It is almost always called from templates or modules and very rarely directly from pages. Adding it to Template:coord (or Module:Coordinates, for most wikis nowadays) makes the coordinates magically appear in the database in some time. For instance, these are the changes I made to use the parser function on ro.wp: https://ro.wikipedia.org/w/index.php?title=Modul%3ACoordonate&type=revis...
Strainu
See also: https://en.wikipedia.org/wiki/Wikipedia:Coordinates_in_infoboxes
I'm now pondering if there's a quick way to asses for which wikis this is the case... I may report back if I find a simple approach, beyond simply manually checking every wiki.
Thanks again!
m. On Wed, Nov 7, 2018 at 10:31 PM Bartosz Dziewoński matma.rex@gmail.com wrote:
The coordinates template/module needs to use the {{#coordinates:…}} parser function for the page to be geotagged.
-- Bartosz Dziewoński
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org