1.
Introduction
1.1.
Goal of this lab
1.2.
Code repositories
1.3.
Groups
1.4.
AWS
1.5.
Grading
2.
Getting Started
2.1.
Docker
2.2.
Scala
2.3.
Apache Spark
2.3.1.
Resilient Distributed Datasets
2.3.2.
Dataframe and Dataset
2.3.3.
Packaging your application using SBT
2.4.
Amazon Web Services
2.5.
Apache Kafka
2.6.
OpenStreetMap
2.7.
ALOS Global Digital Surface Model
3.
Lab 1
3.1.
Before you start
3.2.
Assignment
3.3.
Deliverables
3.4.
Rubric
4.
Lab 2
4.1.
Before you start
4.2.
Assignment
4.3.
Deliverables
4.4.
Rubric
5.
Lab 3
5.1.
Before you start
5.2.
Assignment
5.3.
Deliverables
5.4.
Rubric
6.
FAQ
7.
Quiz example
8.
Useful links
Light (default)
Rust
Coal
Navy
Ayu
Supercomputing for Big Data - Lab Manual
Useful links
Below are some links that are useful:
Git cheatsheet
Often-used API docs:
Spark all APIs
Spark DataSet API