Hello all,
I am trying to download a number of Russian Wikipedia articles in either Firefox or Chrome. These articles have embedded URLs in Cyrillic. The texts of the actual articles in Cyrillic save correctly in text files, but the Cyrillic URLs are saved as gibberish. Does anyone have a solution? It doesn't have to be those two browsers I guess.
Thank you, Tom
Hi, thanks for email! This isn't mailing list for these question, here we ask questions related to Wikimedia Cloud services..
Do you mean on some numbers letters and percents (%)?
Best wishes, Zoran Volunteer of Serbian Wikipedia.
uto, 03. sep 2019. 20:18 Thomas Stieve tomthirteen@email.arizona.edu je napisao/la:
Hello all,
I am trying to download a number of Russian Wikipedia articles in either Firefox or Chrome. These articles have embedded URLs in Cyrillic. The texts of the actual articles in Cyrillic save correctly in text files, but the Cyrillic URLs are saved as gibberish. Does anyone have a solution? It doesn't have to be those two browsers I guess.
Thank you, Tom
-- Thomas Stieve Ph.D. Candidate School of Geography and Development University of Arizona _______________________________________________ Wikimedia Cloud Services mailing list Cloud@lists.wikimedia.org (formerly labs-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud
Hi Zoran,
Sorry, I wasn't exactly sure to go with this question.
Yes, exactly, the URLs are saved as numbers and percentages. How do I have them saved as the actual Cyrillic letters?
Thanks, Tom
On Tue, Sep 3, 2019 at 11:23 AM Zoran Dori zorandori4444@gmail.com wrote:
Hi, thanks for email! This isn't mailing list for these question, here we ask questions related to Wikimedia Cloud services..
Do you mean on some numbers letters and percents (%)?
Best wishes, Zoran Volunteer of Serbian Wikipedia.
uto, 03. sep 2019. 20:18 Thomas Stieve tomthirteen@email.arizona.edu je napisao/la:
Hello all,
I am trying to download a number of Russian Wikipedia articles in either Firefox or Chrome. These articles have embedded URLs in Cyrillic. The texts of the actual articles in Cyrillic save correctly in text files, but the Cyrillic URLs are saved as gibberish. Does anyone have a solution? It doesn't have to be those two browsers I guess.
Thank you, Tom
-- Thomas Stieve Ph.D. Candidate School of Geography and Development University of Arizona _______________________________________________ Wikimedia Cloud Services mailing list Cloud@lists.wikimedia.org (formerly labs-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud
Wikimedia Cloud Services mailing list Cloud@lists.wikimedia.org (formerly labs-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/cloud
On 9/3/19 8:24 PM, Thomas Stieve wrote:
Yes, exactly, the URLs are saved as numbers and percentages. How do I have them saved as the actual Cyrillic letters?
Not sure if this is useful in your situation, but if you have a normal browser and you look at a Wikipedia article, and you want to save the URL, you can press ctrl-L to select all the text in the URL field and ctrl-C to copy, switch to some other place and ctrl-V to paste. This works fine but some URLs with non-ASCII (non-English) letters might be encoded as %numbers. If instead you press: ctrl-L right-arrow (to deselect and place cursor at the end) X (or any character) backspace (to remove that X again) ctrl-A (to select the entire URL field again) ctrl-C then, when you paste this, it will not have the % encoding.
An example is the Russian Wikipedia article https://ru.wikipedia.org/wiki/%D0%9C%D1%91%D0%B4 which then comes out as https://ru.wikipedia.org/wiki/%D0%9C%D1%91%D0%B4
This might not always work, but it works sometimes.