Update on this:
* Piwik is not finding a lot of love. The readership team is working on puppetizing it and we theoretically have hardware to run it, but we haven't decided it's a good idea for Analytics to support this yet.
* We're a (bit?) more optimistic about parallel Event Logging processors. Last we spoke Madhu was going to try and modify the eventlogging_processor code to allow this.
In short, the best bet for getting data into HDFS right now might be to make an EL schema and wait for us to move it to Kafka transport.