Apologies if I missed some documentation or prior discussion about this,
but is there a reason why the seconds field in the /pagecounts-raw/ dump
files vary? It seems unnecessary to scrape and parse the html to get the
true filenames (e.g., pagecounts-20131021-160013.gz) instead of being able
to pass clean filenames (e.g., pagecounts-20131021-160000.gz) especially
when there's no true precision needed at the second-level here. Is it
unreasonable to request that these be renamed to a more consistent and
clean format?
Thanks!
Brian