Yu Zhang
AboutThoughtsNotes
notes/operating-system/hadoop

Hadoop

  • K2B-7-1HadoopJun 1, 2020

    The Apache Hadoop ecosystem — HDFS (distributed storage), YARN (cluster resource scheduler), MapReduce (batch compute), and the surrounding stack (Hive, Spark, HBase, Kafka). What each piece does, how the pieces fit, where Hadoop still wins in the cloud-native era, and the certification ecosystem (Cloudera, formerly Hortonworks).

© 2026 Yu Zhang