I vote for punctuation cropping, too.
That seems to be a growing consensus. The remaining questions, then, are the exact details. Exactly which characters do we want to consider "punctuation", and under what circumstances? My suggestion is this: After parsing URLs the way it does now, if the URL ends with one, and exactly one, of the characters period, comma, question mark, or exclamation; then remove that character and assume it punctuates the sentence. Otherwise, leave the URL alone.
lcrocker@nupedia.com wrote:
I vote for punctuation cropping, too.
That seems to be a growing consensus. The remaining questions, then, are the exact details. Exactly which characters do we want to consider "punctuation", and under what circumstances? My suggestion is this: After parsing URLs the way it does now, if the URL ends with one, and exactly one, of the characters period, comma, question mark, or exclamation; then remove that character and assume it punctuates the sentence. Otherwise, leave the URL alone.
I agree, but I'd also add semicolon and colon characters to that list.
Neil
.
----- Original Message ----- From: "Neil Harris" usenet@tonal.clara.co.uk To: wikipedia-l@nupedia.com Sent: Thursday, July 25, 2002 10:21 AM Subject: Re: [Wikipedia-l] [Fwd: [ wikipedia-Bugs-584804 ] URL followed by
| lcrocker@nupedia.com wrote: | | >>I vote for punctuation cropping, too. | >> | >> | > | >That seems to be a growing consensus. The remaining questions, | >then, are the exact details. Exactly which characters do we want | >to consider "punctuation", and under what circumstances? My | >suggestion is this: After parsing URLs the way it does now, if the | >URL ends with one, and exactly one, of the characters period, comma, | >question mark, or exclamation; then remove that character and | >assume it punctuates the sentence. Otherwise, leave the URL alone. | > | > | > | I agree, but I'd also add semicolon and colon characters to that list. | | Neil
...and single, double quotation marks, parentheses, and dashes.
WojPob
wikipedia-l@lists.wikimedia.org