Re: [Wikitech-l] How to decode URL to article title?

26 Apr 2007

...
  OK, to summarize what this thread seems to be saying,
the procedure is:

 1. Split the title from the rest of the URL.

 2. Percent-decode the title, yielding a UTF-8 byte string.

 3. Convert the byte string into a Unicode string.

 4. Replace underscores with spaces.

 Step 4 yields the article title, which is what appears in the XML dumps. 
Wrong. AFAIK XML dumps are already encoded in UTF-8 so you don't need 
step 3. You'd only need to klnow it if you want to present somehow the 
title to the user.

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] How to decode URL to article title?