Skip to content

Files

Latest commit

e75b066 · May 12, 2015

History

History
46 lines (32 loc) · 2.17 KB

README.md

File metadata and controls

46 lines (32 loc) · 2.17 KB

Cloudwick Spark CodeBase

This repository is a collection of Spark examples & use-case implementations for various components of the Spark eco-system including Spark-Core, Spark-Streaming, Spark-SQL, Spark-MLLib.

What does this repository contains ?

How to download ?

Simplest way is to clone the repository:

git clone https://github.com/cloudwicklabs/spark_codebase.git

How to run these ?

To run any of these examples or use-cases you have to package them using a uber-jar (most of the examples depend of external dependencies, hence have to be packaged as a assembly jar).

Building an assembly jar

From the project's home directory

sbt assembly

Running using spark-submit

spark-submit is the simplest way to submit a spark application to the cluster and supports all the cluster manager's like stand-alone, yarn and mesos.

Each of the main class has documentation on how to run it.