Andreas Meier, 28/07/2013 22:48:
Hello,
there is a problem with the extracted page abstracts for Yahoo on the
big wikis moved to the new infrastructure. During generation everything
seems to be fine, but it ended with a 159kb file.
An other question: Why is this step not parallelized?
Sorry, I can't answer your questions but I have one for you: as you are
interested in these abstracts, could you please add a line about them
and the use case(s) for them to
<https://meta.wikimedia.org/wiki/Data_dumps/What%27s_available_for_download>?
Thank you!
Apart from what their name tells, these files has always been quite
mysterious.
Nemo