Hello all,
I am trying to download a number of Russian Wikipedia articles in either
Firefox or Chrome. These articles have embedded URLs in Cyrillic. The texts
of the actual articles in Cyrillic save correctly in text files, but the
Cyrillic URLs are saved as gibberish. Does anyone have a solution?
Thank you,
Tom
--
Thomas Stieve
Ph.D. Candidate
School of Geography and Development
University of Arizona
Show replies by thread
How are you downloading them?
The codification of the urls would be the same as the actual article
(utf-8), so I don't see how you could end up in a situation where the
Cyrillic text is fine but the Cyrillic urls are not.
If you could provide us the steps you are following for downloading the
articles, we may be able to test it.
Best regards