开发者社区问答正文

熊猫数据框/ numpy数组“轴”定义中的歧义

对于如何定义python轴，以及它们是否引用DataFrame的行或列，我一直感到困惑。考虑下面的代码：

df = pd.DataFrame([[1, 1, 1, 1], [2, 2, 2, 2], [3, 3, 3, 3]], columns=["col1", "col2", "col3", "col4"]) df col1 col2 col3 col4 0 1 1 1 1 1 2 2 2 2 2 3 3 3 3 因此，如果调用df.mean(axis=1)，我们将在各行中获得均值：

df.mean(axis=1) 0 1 1 2 2 3 但是，如果调用df.drop(name, axis=1)，则实际上是删除一列，而不是一行：

df.drop("col4", axis=1) col1 col2 col3 0 1 1 1 1 2 2 2 2 3 3 3 有人可以帮助我了解pandas / numpy / scipy中“轴”的含义吗？

旁注，DataFrame.mean可能只是定义错误。它在文档中DataFrame.mean说这axis=1应该是指各列的平均值，而不是各行的平均值。问题来源于stack overflow

展开

收起

保持可爱mmm 2020-02-08 14:30:43 470 版权

1 条回答

写回答

取消提交回答

保持可爱mmm

另一种解释方式：

// Not realistic but ideal for understanding the axis parameter df = pd.DataFrame([[1, 1, 1, 1], [2, 2, 2, 2], [3, 3, 3, 3]], columns=["idx1", "idx2", "idx3", "idx4"], index=["idx1", "idx2", "idx3"] )

---------------------------------------1 | idx1 idx2 idx3 idx4 | idx1 1 1 1 1 | idx2 2 2 2 2 | idx3 3 3 3 3 0 关于df.drop（轴表示位置）

A: I wanna remove idx3. B: Which one? // typing while waiting response: df.drop("idx3", A: The one which is on axis 1 B: OK then it is >> df.drop("idx3", axis=1)

// Result ---------------------------------------1 | idx1 idx2 idx4 | idx1 1 1 1 | idx2 2 2 2 | idx3 3 3 3 0 关于df.apply（轴表示方向）

A: I wanna apply sum. B: Which direction? // typing while waiting response: df.apply(lambda x: x.sum(), A: The one which is on parallel to axis 0 B: OK then it is >> df.apply(lambda x: x.sum(), axis=0)

// Result idx1 6 idx2 6 idx3 6 idx4 6

2020-02-08 14:30:59

赞同展开评论

问答分类：

Python

问答标签：

NumPy数组数据框NumPy数组

问答地址：

开发者社区 > 开发与运维 > 问答

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

熊猫数据框/ numpy数组“轴”定义中的歧义

相关文章