[Wikipedia-l] [Fwd: [ wikipedia-Bugs-584804 ] URL followed by

lcrocker at nupedia.com lcrocker at nupedia.com
Thu Jul 25 08:13:21 UTC 2002


> I vote for punctuation cropping, too.

That seems to be a growing consensus.  The remaining questions,
then, are the exact details.  Exactly which characters do we want
to consider "punctuation", and under what circumstances?  My
suggestion is this: After parsing URLs the way it does now, if the
URL ends with one, and exactly one, of the characters period, comma,
question mark, or exclamation; then remove that character and
assume it punctuates the sentence. Otherwise, leave the URL alone.








More information about the Wikipedia-l mailing list