首页 文章资讯内容详情

Python Pandas - 除第一次出现外,删除重复值的返回索引

2026-06-03 1 花语

要返回除去第一次出现之外的重复值的索引,请使用该方法。将keep参数与valuefirst一起使用index.drop_duplicates()

首先,导入所需的库-

import pandas as pd

创建具有一些重复项的索引-

index = pd.Index([Car,Bike,Airplane,Ship,Airplane])

显示索引-

print("Pandas Index with duplicates...\n",index)

删除重复值的返回索引。值为“first”的“keep”参数保留每组重复条目的第一次出现-

index.drop_duplicates(keep=first)

示例

以下是代码-

import pandas as pd #创建具有一些重复项的索引 index = pd.Index([Car,Bike,Airplane,Ship,Airplane]) #显示索引 print("Pandas Index with duplicates...\n",index) #返回数据的dtype print("\nThe dtype object...\n",index.dtype) #获取数据中的字节 print("\nGet the bytes...\n",index.nbytes) #获取数据的维度 print("\nGet the dimensions...\n",index.ndim) #删除重复值的返回索引 # The "keep" 带值的参数 "first" keeps the first occurrence for each set of duplicated entries print("\nIndex with duplicate values removed (keeping the first occurrence)...\n",index.drop_duplicates(keep=first))输出结果

这将产生以下代码-

Pandas Index with duplicates... Index([Car, Bike, Airplane, Ship, Airplane], dtype=object) The dtype object... object Get the bytes... 40 Get the dimensions... 1 Index with duplicate values removed (keeping the first occurrence)... Index([Car, Bike, Airplane, Ship], dtype=object)