Data Manipulation
We will go through the main activities one have to face in order to overcome most common data cleaning/wrangling problems.
R has an ecosystem of packages, the tidyverse
, each specialized in a single task:
readr
for importing data from various sources;tidyr
anddplyr
for data wrangling;ggplot2
for plotting;- and others.
In Python, it is possible to accomplish almost every data manipulation task with the Pandas
package.