88.9k views
3 votes
How to drop duplicates based on a column in pandas

User Nouran H
by
8.1k points

1 Answer

1 vote

Final answer:

To drop duplicates based on a column in pandas, use the drop_duplicates() method and specify the column.

Step-by-step explanation:

To drop duplicates based on a column in pandas, you can use the drop_duplicates() method. This method allows you to specify the column on which duplicates should be dropped. Here's an example:

df.drop_duplicates(subset='column_name')

This will remove all duplicate rows in the specified column, keeping only the first occurrence.

User JianMing Wang
by
8.0k points