Spark

  • Spark 20 architecture
  • Spark SQLSpark
  • MILib
  • Sqoop
  • Kafka
  • Flume
  • Spark Streaming
  • Spark DataFrames
  • Schemas for RDD lazy executions and transformations
  • Aggregate transform filter and sort data with DataFrames