RDD
Lineage, Dependencies
Cache, Checkpoint
Replication
Partitions
Cluster
Master
Worker
Application
Client
Driver
Executor
Task
Mechanism
- BlacklistTracker, TaskSetBlacklist
Links
- Apache Spark源码走读之15 – Standalone部署模式下的容错性分析
- Fault Tolerance in Spark: Lessons Learned from Production: Spark Summit East talk by Jose Soltren
- Spark Documentation
- Author:HyperJ
- Source:HyperJ’s Blog
- Link:Spark Fault Tolerance