https://bugzilla.wikimedia.org/show_bug.cgi?id=72047
--- Comment #4 from Mpaa mpaa.wiki@gmail.com --- Regarding point 1)
In textlib.py, in timestripper(self, line), line should be stripped of all items where a valid date cannot be (comments, links, external links, etc.).
Esp. external links might be deceptive, saw some like this in cswp: [https://lists.wikimedia.org/pipermail/mobile-l/2014-August/007927.html]
Best way to reuse already available textlib functions with minimum code duplication TBD. Some regexes might become global constants, maybe?