学習記録その２７（３１日目）

10868 ワード

機械学習ニューラルネットワーク Keras Python 初心者 Python テキストリンク

学習記録（31日目）

勉強開始：１２／７（土）〜

教材等：
・大重美幸『詳細！ Python3 入門ノート』(ソーテック社、２０１７年)：１２／７（土）〜１２／１９（木）読了
・Progate Python講座（全５コース）：１２／１９（木）〜１２／２１（土）終了
・Andreas C. Müller、Sarah Guido『（邦題）Pythonではじめる機械学習』（オライリージャパン、２０１７年）：１２／２１（土）〜１２月２３日（土）読了
・Kaggle : Real or Not? NLP with Disaster Tweets ：１２月２８日（土）投稿〜１月３日（金）まで調整
・Wes Mckinney『（邦題）Pythonによるデータ分析入門』（オライリージャパン、２０１８年）：１／４（水）〜１／１３（月）読了
・斎藤康毅『ゼロから作るDeep Learning』（オライリージャパン、2016年）：１／１５（水）〜１／２０（月）
・François Chollet『PythonとKerasによるディープラーニング』（クイープ、2018年）：１／２１（火）〜

『PythonとKerasによるディープラーニング』

p.186 第5章コンピュータービジョンのためのディープラーニングまで読み終わり。

ニューラルネットワークによるアヤメの分類

ニューラルネットワークについて、書籍ではkerasデータセットの手書き文字などを使って解説してあったものの、あえてsklearnのアヤメデータセットを使って分類を実施してみました。

各種モジュールのインポート

from keras import models, layers
from keras.utils.np_utils import to_categorical
from sklearn import datasets
from sklearn.model_selection import train_test_split

下準備

#トイデータの読み込み
iris = datasets.load_iris()

#ディレクトリ中身の確認、'data'と'target'についてそれぞれ定義
dir(iris)
x = iris.data
y = iris.target

#データ形状を確認
print([x.shape, y.shape])
print(iris.fearture_names)
print(x[0])

x.shape = (150, 4), y.shape = (150, )

iris.feature_namesでxの要素について確認。

xの4列がそれぞれ ['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)'] に対応していることを確認。

yは0~2で、３種類のアヤメに対応。

正規化

def normalization(x):
    mean = x.mean(axis = 0)
    std = x.std(axis = 0)
    x -= mean
    x /= std
    return x

x = normalization(x)
y = to_categorical(y)

xから平均を引き標準偏差で割ることで、「中心が０、標準偏差が１」になるよう各要素ごと正規化する。
yもto_categoricalで０か１かのベクトル化を実施する。

訓練用とテスト用に分割

x_train, x_test, y_train, y_test = 
train_test_split(x, y, train_size = 0.7, random_state = 3)

print([x_train.shape, x_test.shape, y_train.shape, y_test.shape])

訓練用とテスト用を７：３で分割。
ShuffleはDefaultでTrueなのでrandom_stateのみ設定する。

要素はそれぞれ、[(105, 4), (45, 4), (105, 3), (45, 3)]となっており、ちゃんと分割できたことを確認。

モデル構築：２層

model = models.Sequential()

model.add(layers.Dense(64, activation = 'relu', input_shape = (x.shape[1], )))
model.add(layers.Dense(64, activation = 'relu'))
model.add(layers.Dense(3, activation = 'softmax'))

model.summary()

model.compile(optimizer = 'rmsprop', loss = 'categorical_crossentropy', metrics = ['accuracy'])

各層の活性化関数はReLU、最後の層の活性化関数はsoftmaxを使用。（p.119 表4-1を参照）

model.summary()で各層について可視化

オプティマイザは、ほぼrmspropで足りるとの記載よりこちらを選択した。

学習及び評価

model.fit(x_train, y_train, epochs = 100, batch_size = 1, verbose=0)

result = model.evaluate(x_test, y_test)

print(result)
#[0.15816927485995821, 0.9555555582046509]

epochsはとりあえず100、batch_sizeは当然ながら１。

以上、単純な構造であったものの、結果として96%の精度が得られました。

処置事項１：モデル最終層の出力

アヤメは３種類あるので、layers.Dense(3)とすべきところをlayers.Dense(1) にしていたため発生したエラー。
必要とする出力の数に合わせて変える必要がある。(手書き文字なら０〜９の１０個)

処置事項２：学習の記録

特にエラーではないが、記録を必要としない場合は引数verbose=0とすることで対処できた。

Author And Source

この問題について(学習記録その２７（３１日目）), 我々は、より多くの情報をここで見つけました https://qiita.com/SHUNMT/items/fc1bc5ea896557c5238b

著者帰属：元の著者の情報は、元のURLに含まれています。著作権は原作者に属する。

Content is automatically searched and collected through network algorithms . If there is a violation . Please contact us . We will adjust (correct author information ,or delete content ) as soon as possible .