HI, Sorry for my english I use to speak in french.
I have a problem with the encodages in french dump. i use the Api of mediawiki to retrieve text of article.But there still have some %C3%A8 \u00e8 \ufffd to replace accented caracter(é,è,...).
How can i resolve this problem?
I hope that i posted this in the rigth place.
Tank's
Are you using the dump or the API? Those are two separate things (and if you're using the API, then you're also off-topic on this list).
And I think you will have to show us how exactly are you accessing the text of the article, it's likely that the problem is there.
Petr Onderka [[en:User:Svick]]
On Sat, Apr 20, 2013 at 11:38 AM, Yannick Guigui yanstv@gmail.com wrote:
HI, Sorry for my english I use to speak in french.
I have a problem with the encodages in french dump. i use the Api of mediawiki to retrieve text of article.But there still have some %C3%A8 \u00e8 \ufffd to replace accented caracter(é,è,...).
How can i resolve this problem?
I hope that i posted this in the rigth place.
Tank's
-- guigui777
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
xmldatadumps-l@lists.wikimedia.org