Hi, folks,

Recently, I'm working on a research project which needs extracting article information from wikipedia.

I managed to get pywikibot work on my computer and was able to pull out a few simple results.

One question is regarding a method called pywikibot.pagegenerators.AllpagesPageGenerator.

By setting the argument "content" to "True", it will return a page generator with current version. But, which version will be returned if setting the argument to False?

Also, is there a way in pywikibot to get a page generator that contains articles/pages up to a certain date?

Maybe, pywikibot is not a right tool to do this.

I was thinking of using wiki dump data instead of using a wiki API.

But, it seems the files are huge. I appreciate it you happen to have any idea to deal with this.  


Thanks a lot!

hz.cmu