7.6k views
4 votes
Before running any regressions make sure to check for multicollinearity. How did you check for multicollinearity? If there is multicollinearity, how do you plan to resolve it? Are there any other issues with the dataset we have to consider before running the regressions?

User Edepperson
by
6.9k points

1 Answer

4 votes

Answer:

Multicollinearity

Explanation:

  • In a linear regression model, we predict the dependent variable(y) with the help of independent variable
    (x_i).
  • Our aim is to minimize the residuals and make the best prediction.
  • Multicollinearity refers to the situation when there is correlation between the independent variables.
  • This could lead to wrong predictions and increase residuals
  • Multicollinearity can be checked with the help of VIF, variance inflation factor.
  • The industry accepted value of VIF is 5. A VIF greater than 5 means collinearity.
  • In order to treat multicollinearityy, we could plot scatter plot between different independent variables and remove one of the variable that is correlated.
  • Before running the linear regression model, we should make sure that there is no correlation between independent and dependent variable, residuals to be normally distributed, no auto correlation.
User Tryliom
by
7.6k points