The coding assignment above, involves analyzing student debt data from the US and examining the relationship between debt levels, university costs, and student income.
So, The data cleaning and analysis process involves merging and manipulating multiple datasets, calculating inflation-adjusted costs, and generating various summary tables for comprehensive insights.
So, for Part 1, to do it, one need to read the data: Use the read.csv() function to load the education_data.csv, cost_data.csv, and data_description.csv files.
Also examine the data and use descriptive statistics and data visualization techniques to understand the structure and characteristics of each dataset.
Then Combine data and Merge the education_data.csv and cost_data.csv datasets by matching the year and school_id columns.