Hi everybody!

I created the following doc: https://wikitech.wikimedia.org/wiki/Analytics/Tutorials/Analytics_Client_Nodes

It contains two FAQ:
- How do I ensure that there is enough space on disk before storing big datasets/files ?
- How do I check the space used by my files/data on stat/notebook hosts ?

Please read them and let me know if anything is not clear or missing. We have plenty of space on stat100X hosts, but we tend to cluster on single machines like stat1007 for some reason, ending up in fighting for resources.

On a related note, we are going to work on unifying stat/notebook puppet configs in https://phabricator.wikimedia.org/T243934, so eventually all Analytics clients will be exactly the same.

Thanks!

Luca (on behalf of the Analytics team)