Data Analysis using Spark SQL

placeholder

Analyze an Apache Spark DataFrame as though it were a relational database table. During this Aspire course you will discover the different stages involved in optimizing any query or method call on the contents of a Spark DataFrame. Discover how to create views out of a Spark DataFrames contents and run queries against them; and how to trim and clean a DataFrame. Next learn how to perform an analysis of data by running different SQL queries; how to configure a DataFrame with an explicitly defined schema; and define what a window is in the context of Spark. Finally observe how to create and analyze categories of data in a data set by using Windows.