In this section we will discuss the data management process, basic data cleaning and management in R. For demonstration, routine malaria surveillance data extracted from DHIS2 is used.
Section covered:
- read .xlsx and .dta data files
- data cleaning - removing texts and renaming columns, name matching, grouping labels
- reshaping data - long to wide and wide to long formats using the pivot_long() and pivot_wide() function
- exploration - summary of data and table