Great insightful comment. I came to the same conclusion a number of years ago. W...

teacpde · on Oct 8, 2017

https://dataworkssummit.com/munich-2017/sessions/breaking-th...

psandersen · on Oct 9, 2017

Hopsworks looks like it might be exactly what I need, I do typical data science work for small to small-medium data and wanted to start properly playing with spark on a HDFS store.

Currently most work is just done in R/Python in VM's on a small proxmox cluster (where only 1 node is always on) but I'd like start gently moving to spark, run the stack on a single node and scale on demand.

Is Hopsworks for me, does this approach even make sense for such small data or am I crazy? Thanks for your response!

jamesblonde · on Oct 11, 2017

Yes, Hopsworks can run on anything from 1 server to 1000s. We are finalizing the first proper release now - Jupyter support, tensorflow, pyspark, sparkr, python-kernel for jupyter too,

psandersen · on Oct 11, 2017

Awesome, that sounds perfect, I'll give it a shot. You have a mailing list or anyway to follow? Cheers