pandas 統計量

3451 ワード

pandas Python GroupBy Python テキストリンク

import pandas as pd
import numpy as np

df = pd.DataFrame({
    "category1" : ["A","A","A","B","B","B"],
    "column1" : np.array([5,4,6,9,11,13])
})

統計量

df.describe()

column1はcategoryでAとBに別れているが区別なく統計量が出るのは不便。

groupby関数

category A,B で区別して統計量を得る。

# category1でgroupを作る
group = df.groupby("category1")
# groupごとに関数処理する
print(group.mean()) # 平均

print(group.std(ddof = 1)) # 標準偏差

group.describe() # groupごとに統計量の一括表示

地道に取り出す

# Aを取り出す
df_A = df[df["category1"] == "A"]
# mean関数に通す
df_A["column1"].mean()

groupby便利。

Author And Source

この問題について(pandas 統計量), 我々は、より多くの情報をここで見つけました https://qiita.com/roadto93ds/items/0161c697317c06fd8ba3

著者帰属：元の著者の情報は、元のURLに含まれています。著作権は原作者に属する。

Content is automatically searched and collected through network algorithms . If there is a violation . Please contact us . We will adjust (correct author information ,or delete content ) as soon as possible .