[Foundation-l] Foundation-l word cloud
John Vandenberg
jayvdb at gmail.com
Mon Oct 4 23:24:06 UTC 2010
On Tue, Oct 5, 2010 at 7:48 AM, Peter Gehres <in2thats12 at gmail.com> wrote:
> In looking at the contents of the gzip'ed archives, stripping out the
> headers does not look trivial, but it appears that it could be done in most
> cases. A whole other problem is quoted text. Any preference on whether or
> not that should be included as well? If it is included, the word are not
> entirely accurate.
If it is including quoted passages, a simple way to address this is to
remove any line starting with '>' and all attachments.
btw, very interesting Nemo!
--
John Vandenberg
More information about the foundation-l
mailing list