Introduction to Hadoop Several tools are available for working with big data. Many of the tools are open-source and Linux-based. Explore the fundamentals of Apache Hadoop including distributed computing design principles HDFS Yarn MapReduce and Spark.