Andreas Meier, 28/07/2013 22:48:
there is a problem with the extracted page abstracts for Yahoo on the
big wikis moved to the new infrastructure. During generation everything
seems to be fine, but it ended with a 159kb file.
An other question: Why is this step not parallelized?
Sorry, I can't answer your questions but I have one for you: as you are
interested in these abstracts, could you please add a line about them
and the use case(s) for them to
Apart from what their name tells, these files has always been quite