Solution of Overfitting
Definition of Overfitting
data:image/s3,"s3://crabby-images/61917/619179ba33cdc81e48e0bfbe2dfd28b8e8ea6d70" alt=""
:train setは正しいがvalidation/test setは正しくない
Regularization
data:image/s3,"s3://crabby-images/05581/0558140e7b3ecd12ce939525c9e4cf84fa8a776e" alt=""
定義
Lp norm
data:image/s3,"s3://crabby-images/843c1/843c1e2ffc9317747bcbb2e03a51c37492d85137" alt=""
vector norm
i)pが自然数の場合
data:image/s3,"s3://crabby-images/0600f/0600f7a2c27ccbc35e9826ac9c782918668c3354" alt=""
x=np.array([1,10,1,1,1])
p=5
norm_x=np.linalg.norm(x, ord=p)
making_norm = (sum(x**p))**(1/p)
print("result of numpy package norm function : %0.5f "%norm_x)
print("result of making norm : %0.5f "%making_norm)
ii)pが無限大の場合data:image/s3,"s3://crabby-images/67539/67539c6b156043835bcaaa6a2d052597e5a80362" alt=""
norm_x=np.linalg.norm(x, ord=np.inf)
print("result of infinite norm : %0.5f "%norm_x)
matrix norm
:p=1,pが無限大であれば,把握すればよい.
i)p=1、最大の列と値を出力
ii)p無限大,Louの和出力最大値
:A=m*n行列
data:image/s3,"s3://crabby-images/d7992/d799296c9f6fbb95bb803236bc312ca873d15522" alt=""
A=np.array([[1,2,3],[1,2,3],[4,6,8]])
inf_norm_A=np.linalg.norm(A, ord=np.inf)
print("result inf norm of A :", inf_norm_A)
one_norm_A=np.linalg.norm(A, ord=1)
print("result one norm of A :", one_norm_A)
L1 Regularization
式:
data:image/s3,"s3://crabby-images/3992c/3992c9b9077447c3a3ae8a1c364289b4c8cab8a4" alt=""
機能:
- 어떤 컬럼이 결과에 영향을 미치는지에 대해서 알 수 있다.
data:image/s3,"s3://crabby-images/6ac24/6ac241c65994694f8fe50c6a811b5789847117a5" alt=""
例:
plt.figure(figsize=(5,5))
plt.scatter(X,Y)
plt.plot(X,linear.predict(X.reshape(-1,1)),'-b')
plt.title('petal-sepal scatter with linear regression')
plt.xlabel('petal length (cm)')
plt.ylabel('sepal length (cm)')
plt.grid()
plt.show()
data:image/s3,"s3://crabby-images/363f6/363f60e7249e55ef078138c310f5aae1a7697830" alt=""
L2 Regularization
data:image/s3,"s3://crabby-images/28122/281223fe76bc2e755036b987fac5112857947824" alt=""
機能:
円の形態:
data:image/s3,"s3://crabby-images/7f5cf/7f5cff79c38f94b3aba3e0b9ed237b0c74b9aa2b" alt=""
しゅうそくそくどが速い
->2乗基準が結果に大きな影響を与える場合は、それをより大きいまたはより小さい位置に送信します.
#L2 regularization은 Ridge로 import 합니다.
from sklearn.linear_model import Ridge
L2 = Ridge()
L2.fit(X.reshape(-1,1), Y)
a, b = L2.coef_, L2.intercept_
print("기울기 : %0.2f, 절편 : %0.2f" %(a,b))
plt.figure(figsize=(5,5))
plt.scatter(X,Y)
plt.plot(X,L2.predict(X.reshape(-1,1)),'-b')
plt.title('petal-sepal scatter with L2 regularization(Ridge)')
plt.xlabel('petal length (cm)')
plt.ylabel('sepal length (cm)')
plt.grid()
plt.show()
data:image/s3,"s3://crabby-images/a3aaf/a3aaf2df6df85e7ccae26ab81d2f198c65e45ff2" alt=""
Normalization
data:image/s3,"s3://crabby-images/f3689/f3689c9a5825d937a17fc2ce9e759808f00695e8" alt=""
定義
data:image/s3,"s3://crabby-images/ded42/ded42b56310b9481eef879e3439b1281485b0f59" alt=""
例:z-scoreまたはminmax scalerを使用して、値分布を0と1の間の右に調整します.
効果:データ値の分布による距離の歪みによる学習の妨げとなる問題の解決
Batch Normalization
機能:
Dropout
定義:情報をランダムに選択して渡す
機能:
コード実装
tf.keras.layers.Dropout(rate, noise_shape=None, seed=None, **kwargs)
>>> tf.random.set_seed(0)
>>> layer = tf.keras.layers.Dropout(.2, input_shape=(2,))
>>> data = np.arange(10).reshape(5, 2).astype(np.float32)
>>> print(data)
[[0. 1.]
[2. 3.]
[4. 5.]
[6. 7.]
[8. 9.]]
>>> outputs = layer(data, training=True)
>>> print(outputs)
tf.Tensor(
[[ 0. 1.25]
[ 2.5 3.75]
[ 5. 6.25]
[ 7.5 8.75]
[10. 0. ]], shape=(5, 2), dtype=float32)
Reference
この問題について(Solution of Overfitting), 我々は、より多くの情報をここで見つけました https://velog.io/@qsdcfd/Solution-of-Overfittingテキストは自由に共有またはコピーできます。ただし、このドキュメントのURLは参考URLとして残しておいてください。
Collection and Share based on the CC Protocol