量化投資學習筆記37——《Python機器學習應用》課程筆記10

原創

2020-03-06 13:01

用KNN算法來進行數字識別，還是用sklearn自帶的digits數據集。
coding:utf-8
KNN算法實現手寫識別

from sklearn import neighbors
from sklearn.datasets import load_digits
from sklearn.model_selection import train_test_split
from sklearn.metrics import classification_report
import matplotlib.pyplot as plt

if name == "main":
加載數據
digits = load_digits()
x_data = digits.data
y_data = digits.target
print(x_data.shape)
print(y_data.shape)

劃分訓練測試集
x_train, x_test, y_train, y_test =  train_test_split(x_data, y_data)
訓練
knn = neighbors.KNeighborsClassifier(algorithm = "kd_tree", n_neighbors = 3)
knn.fit(x_train, y_train)
準確率評估
predictions = knn.predict(x_test)
print(classification_report(y_test, predictions))

除了訓練那部分，代碼幾乎都是抄前文的。可以看到用sklearn庫非常方便。結果也很好，準確率98%。
KNN的準確率遠高於MLP分類器，原因是MLP在小數據集上容易過擬合。而且MLP對於參數調整比較敏感。
接下來是強化學習。

我發文章的四個地方，歡迎大家在朋友圈等地方分享，歡迎點“在看”。
我的個人博客地址：https://zwdnet.github.io
我的知乎文章地址： https://www.zhihu.com/people/zhao-you-min/posts
我的博客園博客地址： https://www.cnblogs.com/zwdnet/
我的微信個人訂閱號：趙瑜敏的口腔醫學學習園地

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

量化投資學習筆記37——《Python機器學習應用》課程筆記10

【面試準備】又一次失敗的面試經歷，題目離譜～資深軟件測試工程師

dotnet 8 版本與銀河麒麟V10和UOS系統的 glibc 兼容性

量化投資學習筆記27——《Python機器學習應用》課程筆記01

量化投資學習筆記20——迴歸分析:實操，泰坦尼克號乘客生還機會預測，邏輯迴歸方法。

量化投資學習筆記36——《Python機器學習應用》課程筆記09

量化投資學習筆記37——《Python機器學習應用》課程筆記10

量化投資學習筆記34——《Python機器學習應用》課程筆記08

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結