85.8k views
3 votes
What are the four primary outputs generated during the data preparation phase of the CRISP-DM process?

User Bwalshy
by
7.1k points

1 Answer

3 votes

Final answer:

The four primary outputs generated during the data preparation phase of the CRISP-DM process are: Data Source Description, Data Set Description, Data Cleaning Report, and Data Integration Report.

Step-by-step explanation:

The CRISP-DM process is a widely-used data mining methodology. During the data preparation phase of the CRISP-DM process, there are four primary outputs that are generated:

  1. Data Source Description: This output provides information about the source of the data, such as its location, format, and structure.
  2. Data Set Description: This output describes the data set that will be used for analysis, including details about the variables and their characteristics.
  3. Data Cleaning Report: This output summarizes the cleaning and transformation steps applied to the data, including the identification and treatment of missing values and outliers.
  4. Data Integration Report: This output documents the process of combining data from multiple sources, including any decisions made to resolve discrepancies or conflicts.
User Navaneeth Sen
by
7.7k points