Spark Fault Tolerance

RDD

  • Lineage, Dependencies

  • Cache, Checkpoint

  • Replication

  • Partitions

Cluster

  • Master

  • Worker

Application

  • Client

  • Driver

  • Executor

  • Task

Mechanism

  • BlacklistTracker, TaskSetBlacklist