Final answer:
The split-by parameter in Sqoop determines the number of rows to send to each mapper when importing data from a database into Hadoop.
Step-by-step explanation:
The split-by parameter in Sqoop is used to determine the number of rows to send to each mapper. It is used when importing data from a database into Hadoop.
For example, if you have a table with 1000 rows and set the split-by parameter to a column such as 'id', Sqoop will split the import into multiple parallel tasks and send a specific range of rows to each mapper based on the 'id' column values.
By defining the number of rows to send to each mapper, Sqoop can improve the efficiency and speed of importing large datasets into Hadoop.