Workshop: Mind blown: Crafting a Distributed Data Science Pipeline using Spark, Cassandra, Akka and the Spark Notebook
Featuring Andy Petrella and Xavier Tordoir
Get your hands dirty with distributed tools, during these two hours we’ll have a quick overview on how a dataset can be processed in a distributed way towards the exposition exposition as a web service.
The tool we’ll use for this are Spark, Cassandra, Akka HTTP and the Spark Notebook.
A primary...scala spark cassandra akka spark-notebook
Distributed Data Science with Scala in a Browser
Featuring Xavier Tordoir
While machine learning has been used for decades, accessibility to these methods is undergoing a radical shift, with the rise of simple interfaces and implementations on distributed systems. In practice it means that more players can afford to take advantage of Machine Learning, and at larger...machine-learning distributed-data-science scala spark-notebook