On Feb 19, 2008, at 10:56 AM, Kilian wrote:
Am Dienstag, den 19.02.2008, 10:41 -0600 schrieb Jim
Hu:
I have a problem where an import script has a
page name containing a
right arrow represented as -> this converts to a ->, which breaks
the import. I can replace this, but what should I use? Suggestions?
Hi Jim,
1) are you trying to import a MediaWiki XML dump?
No, I'm importing an XML file
that I generate from a flatfile from
another source (gene ontology terms).
2) Are you using a standard import script?
Yes. Importdump.php from maintenance.
3) Which step goes wrong, and how?
"WikiRevision given a null title in import.
You may need to adjust
$wgLegalTitleChars."
Now that you made me think harder about it, the problem is probably
related to my running htmlentities() on the title while constructing
the XML file. But the '>' is not a default legal title char, but I'm
not sure what happens if I try to enable it. < > are not discussed
in the docs at
http://www.mediawiki.org/wiki/Manual:%24wgLegalTitleChars
or
http://meta.wikimedia.org/wiki/Help:Page_name#Special_characters
I am assuming that since <> are used for XML tags and markup, enabling
these is not a good idea. Right now, I'm just converting '->' to
'-',
but this changes the meaning.
Jim
~ Kilian
_______________________________________________
MediaWiki-l mailing list
MediaWiki-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
=====================================
Jim Hu
Associate Professor
Dept. of Biochemistry and Biophysics
2128 TAMU
Texas A&M Univ.
College Station, TX 77843-2128
979-862-4054