It might be faster to generate that list using a sql query directly on one of wikimedia database server, additionaly tt will save some bandwith.
That's a great idea, IMO, for exactly the reasons you cite. The page need not be updated frequently.
The query does take over 12 hours to run, albeit on a machine of limited resources. While I am not an expert on query optimization, I doubt that much can be done to speed it up.
FWIW there is presently a three month backlog of queries to be run, on the list on the meta.
UninvitedCompany
wikitech-l@lists.wikimedia.org