小编Ji *_*Wei的帖子

一种在多个数据集中映射数据的更好方法,具有多个数据映射规则

我有三个数据集(final_NNppt_codeherd_id),我想MapValuefinal_NN数据框中添加一个名为的新列,并且可以从其他两个数据框中检索要添加的值,规则在代码后的底部。

import pandas as pd

final_NN = pd.DataFrame({
    "number": [123, 456, "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown"],
    "ID": ["", "", "", "", "", "", "", "", 799, 813],
    "code": ["", "", "AA", "AA", "BB", "BB", "BB", "CC", "", ""]
})

ppt_code = pd.DataFrame({
    "code": ["AA", "AA", "BB", "BB", "CC"],
    "number": [11, 11, 22, 22, 33]
})

herd_id = pd.DataFrame({
    "ID": [799, 813],
    "number": [678, 789]
})

new_column = …
Run Code Online (Sandbox Code Playgroud)

python data-mapping data-analysis dataframe pandas

5
推荐指数
1
解决办法
188
查看次数

标签 统计

data-analysis ×1

data-mapping ×1

dataframe ×1

pandas ×1

python ×1