我有一个类似于以下内容的数据框
+----------------+-------+
| class | year |
+----------------+-------+
| ['A', 'B'] | 2001 |
| ['A'] | 2002 |
| ['B'] | 2001 |
| ['A', 'B', 'C']| 2003 |
| ['B', 'C'] | 2001 |
| ['C'] | 2003 |
+----------------+-------+
Run Code Online (Sandbox Code Playgroud)
我想使用它创建一个数据框,以使结果表显示每个类中每个类别的计数。
+-----+----+----+----+
|year | A | B | C |
+-----+----+----+----+
|2001 | 1 | 3 | 1 |
|2002 | 1 | 0 | 0 |
|2003 | 1 | 1 | 2 |
+-----+----+----+----+
Run Code Online (Sandbox Code Playgroud)
最简单的方法是什么?
假设我有一个具有结构的对象
data = {
a : [{values: {key1: 5, key2: "abc"}}, {values: {key1: 3, key2: "abc"}}, {values: {key1: 4, key2: "cde"}}],
b : [{values: {key1: 3, key2: "ffe"}}, {values: {key1: 11, key2: "gga"}}, {values: {key1: 7, key2: "abc"}}]
}
Run Code Online (Sandbox Code Playgroud)
我想提取元素 where key2 == "abc"。
预期输出:
data = {
a : [{values: {key1: 5, key2: "abc"}}, {values: {key1: 3, key2: "abc"}}],
b : [{values: {key1: 7, key2: "abc"}}]
}
Run Code Online (Sandbox Code Playgroud)
我试图遵循类似的例子,但未能实现我想要的。