SWAP (Jupyter Notebooks) now supports Spark - Analytics

13 Aug 2018


      Hi all!
I’d like to announce that we’ve done a bit of work to make Jupyter
Notebooks in SWAP https://wikitech.wikimedia.org/wiki/SWAP support Spark
kernels. This means that you can now run Spark shells in both local mode
(on the notebook server) or YARN mode (distributed on the Hadoop Cluster)
inside of a Jupyter notebook. You can then take advantage of fancy Jupyter
plotting libraries to make graphs directly from data in Spark.
See https://wikitech.wikimedia.org/wiki/SWAP#Spark for documentation.
This is a new feature, and I’m sure there will be kinks to work out.  If
you encounter issues of have questions, please respond on this phabricator
ticket https://phabricator.wikimedia.org/T190443, or create a new one and
add the Analytics tag.
Enjoy!
-Andrew Otto & Analytics Engineering