On 12/12/11 14:11, Rob Meijeren wrote:
We load the printable page of the article and then we walk through the article. What we don't need we set in a variable and what we need we set in a other variable. For most of the articles this works but for the two I mentioned it goes wrong. And after some digging we found out that with those 2 articles the div tag was missing.
Seems a problem with your extraction. Maybe you were stopping before "<!-- Saved in parser cache with key".. ?