Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas

placeholder

For data analysis to be useful and accurate the analyzed data needs to be cleaned and curated. There are copious methods to achieve this in pandas. In this course youll learn how to identify and eliminate duplicates in pandas. Youll start by using the pandas cut method to discretize data into bins using bins to plot histograms and identify outliers using box-and-whisker plots. Youll parse and work with datetime objects read in from strings and convert string columns to datetime using the dateutils python library. Moving on youll master different pandas methods for aggregating data – including the groupby pivot and pivot_table methods. Lastly youll perform various joins – inner left outer right outer and full outer – using both the merge and join methods.