76.5k views
5 votes
What function would you use to perform a selection operation in Spark

A) select()
B) filter()
C) groupby()
D) join()

User Koehr
by
7.8k points

1 Answer

5 votes

Final answer:

The correct function to perform a selection operation in Spark is filter(). It is used to select rows from a DataFrame based on a given condition.

Step-by-step explanation:

The correct function to perform a selection operation in Spark is filter().

The filter() function is used to select rows from a DataFrame based on a given condition. It takes a Boolean expression as an argument and returns a new DataFrame with the rows that satisfy the condition.

For example, to select all rows from a DataFrame where the 'age' column is greater than 18, you can use the filter() function like this: df.filter(df['age'] > 18).

User Oat Anirut
by
7.7k points