Fred Benenson wrote:
Hi There, My name is Fred Benenson and I'm a graduate student (as well as free culture activist) doing research on Wikipedia. I'm working on a problem that requires an up to date version of the logging on wikipedia.
I was told to look here (after posting, unsuccessfully to JIRA) for help on a problem I've found with a dump:
Despite its name, enwiki-latest-logging (available at http://download.wikimedia.org/enwiki/latest/) is not actually the latest logging information.
I've found the most recent log_timestamp of a row is somewhere around February 2007. This means that the dump is about a year old, and is not the 'latest' version. It'd be great to get a fairly recent (within a month) version up live.
Let me know if there's anyway I can help or make this easier.
Thanks.
Fred
The last-modified and etags of http://download.wikimedia.org/enwiki/latest/enwiki-latest-logging.sql.gz match with http://download.wikimedia.org/enwiki/20080103/enwiki-20080103-logging.sql.gz, as it should be.
ETag: "-1820672343" Last-Modified: Thu, 03 Jan 2008 22:06:17 GMT
So it may be that logging is not being updated and instead an old version is being copied around?
Some more inthoguht after after downloading: -Both files are identical. -I got a corrupted copy (gzip: enwiki-20080103-logging.sql.gz: unexpected end of file) where the latest entry was of 20050108105348 and a the right one were it was 20080103215656
So maybe your copy wasn't so good? Check the md5sums.