New 1 - Data Science
3198 ワード
1. QQ-plot(Quantile Quantile plot)
data:image/s3,"s3://crabby-images/3f799/3f7992d6063761659c1d00212f17118300464e67" alt=""
(완전한 정규분포를 이룰 때)
data:image/s3,"s3://crabby-images/e9900/e9900941c7b903399a1f8c1bd230fcbd693664d9" alt=""
(skewed가 있을 떄)
2. Pandas.DataFrame.apply()
data:image/s3,"s3://crabby-images/487c5/487c5acc78151cb9275e36e44f1e4d4ca5c9a514" alt=""
3. Pandas.DataFrame.transform()
4.apply()とtransform()の違い
(写真の出所:https://towardsdatascience.com/difference-between-apply-and-transform-in-pandas-242e5cf32705)
1. transform() work with function, a string function, a list of functions, and a dict. However, apply() is only allowed with function.
data:image/s3,"s3://crabby-images/d54dd/d54dd75d472b79cf2a207e5881a1a2c13e4ad598" alt=""
data:image/s3,"s3://crabby-images/42246/42246f982b960353f915556e3ceb244ced45e67b" alt=""
2. transform() cannot produce aggregated results.
data:image/s3,"s3://crabby-images/8e83e/8e83e106309700e40b3a006a0ea1f398209c40e7" alt=""
3. apply() works with multiple Series at a time. But, transform() is only allowed to work with a single Series at a time.
data:image/s3,"s3://crabby-images/2c9b7/2c9b70a287bd043c629db80a341761cf63f2012c" alt=""
5. Pandas.DataFrame.astype()
6. Sklearn.pipeline()
data:image/s3,"s3://crabby-images/3523c/3523c0b9924d360119049a4149bf5f5a4e33d530" alt=""
7. cross_val_score()
data:image/s3,"s3://crabby-images/2fbb1/2fbb1e5cde1bdcc0f0b5b8b821ae1eacc926bae7" alt=""
8. Key differences GBM vs XGBOOST
data:image/s3,"s3://crabby-images/09e0c/09e0c0ec1888e1a892eae6b5dbb63725c971f4aa" alt=""
Guide for XGBoost
9. BaseEstimator, RegressorMixin, TransformerMixin
1. BaseEstimator
data:image/s3,"s3://crabby-images/6f84c/6f84c00751d937db8f4fd62aedcffd6709833029" alt=""
2. TransformerMixin
data:image/s3,"s3://crabby-images/5bcaf/5bcafb41c2be7081bffeb50f546fd280049c2706" alt=""
3. RegressorMixin
data:image/s3,"s3://crabby-images/3f8ef/3f8efb9e8bb3db198137d93decb71b70304ce1d3" alt=""
あるKaggle Notebookで次のようなクラスが見られました.
複数のモデルを一度に学習し、予測平均値を返す機能があるようです.
data:image/s3,"s3://crabby-images/a90e4/a90e45471ccc61398a316ebc30b36afafadab233" alt=""
このレベルもあります.
data:image/s3,"s3://crabby-images/99616/99616323d256362db99f2811b927b566adfe4cf7" alt=""
このように,平均modelsはrmsle cvという関数に入りcross val score()の推定因子として伝達される.
Reference
この問題について(New 1 - Data Science), 我々は、より多くの情報をここで見つけました https://velog.io/@aspalt85/New-1-Data-Scienceテキストは自由に共有またはコピーできます。ただし、このドキュメントのURLは参考URLとして残しておいてください。
Collection and Share based on the CC Protocol