On Fri, Feb 20, 2009 at 12:57 PM, Platonides Platonides@gmail.com wrote: [snip]
It could also pass a virus scan but I don't think it's really needed. Virus scanners mainly look for known bad code, inside executables. We don't want any kind of executable.
I've run clamav against the entire set of files in the past. Found a couple of interesting things (like, 3 files out of millions).
Converting pdftops and back will probably totally kill the text layer. Might as well render to images and djvu.