Hello,
there is a problem with the extracted page abstracts for Yahoo on the
big wikis moved to the new infrastructure. During generation everything
seems to be fine, but it ended with a 159kb file.
An other question: Why is this step not parallelized?
Best regards
Andreas Meier
So in spite of my keeping an eye on space on one of our dump hosts, when
I started the en wiki job a little bit ago it filled up shortly
afterwards. I've reclaimed some space by removing two older de wiki
dumps for now, and will deal with restarts and cleanup of other jobs
either over the weekend or on Monday.
Ariel