Emilio J. RodrÃguez-Posada, 14/10/2013 14:18:
Internet Archive has this problem in several other topics, like its Wayback Machine, there is not search engine to search the billions grabbed websites by keyword of whatever.
Internet Archive is a pile of hard disks and a time capsule with backups, and they try to do the best at showing the materials (media players, pdf viewers), but it is not always easy or possible.
...and that's why Hay said we need someone with a good idea. :) Now it's easy to download the dataset (though it's not perfect), of course this doesn't automatically make something cool happen with it. Except replication of the data in multiple places, which is a good thing in itself.
Nemo