Spark for High-speed Big Data Analytics
Spark is an open-source massively parallel in-memory solution that allows you to run big data analytics pipelines at high speed. Use this course to learn how Apache Spark works and gain an understanding of its architecture. As you progress investigate the industry-leading examples of Uber and Alibaba to recognize how Spark can add business value to data in many industry types. Moving along compare the functionality of Spark and Hadoop in relation to use cases identifying when using Spark is most advantageous. Finally explore fundamental Spark characteristics optimization techniques and best practices. When youve completed this course youll have a solid theoretical understanding of how and when to use Apache Spark for specific big data analytics tasks.