線性迴歸

原創

内心的笃定

2018-09-03 20:14

import matplotlib.pyplot as plt
import numpy as np
#讀取的是sklearn自帶的數據集
from sklearn import datasets
class LinearRegression():
    def __init__(self):
        self.w = None

    def fit(self, X, y):
        #在第0列填充1
        X = np.insert(X, 0, 1, axis=1) 
        print(X.shape)
        #X.T.dot(X) 求逆運算 沒有考慮矩陣的逆不存在的情況
        X_ = np.linalg.inv(X.T.dot(X))
        self.w = X_.dot(X.T).dot(y)
        

    def predict(self, X):
        # Insert constant ones for bias weights
        X = np.insert(X, 0, 1, axis=1)
        y_pred = X.dot(self.w)
        return y_pred
def mean_squared_error(y_true, y_pred):
    #np.power數組元素求n次方
    mse = np.mean(np.power(y_true - y_pred, 2))
    return mse
def main():
    # Load the diabetes dataset
    diabetes = datasets.load_diabetes()
    #print(diabetes)
    #diabetes沒有shape的屬性
    #print(diabetes.shape) AttributeError: shape
    # Use only one feature
    #X = diabetes.data[:, 2]直接取到的是一個一維的數據，要把它變成n*1二維數組的形式，需在列上增加維度
    X = diabetes.data[:, np.newaxis, 2]
    
    print (X.shape)
    # Split the data into training/testing sets
    #X[:-20]從頭開始到倒數第20行
    x_train, x_test = X[:-20], X[-20:]

    # Split the targets into training/testing sets
    y_train, y_test = diabetes.target[:-20], diabetes.target[-20:]

    clf = LinearRegression()
    clf.fit(x_train, y_train)
    y_pred = clf.predict(x_test)

    # Print the mean squared error
    print ("Mean Squared Error:", mean_squared_error(y_test, y_pred))

    # Plot the results
    plt.scatter(x_test[:,0], y_test,  color='black')
    plt.plot(x_test[:,0], y_pred, color='y', linewidth=3)
    plt.show()

執行main函數：main()

運行結果

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

python實現基金定投並可視化結果（及時止損）

1.什麼是指數基金 2.什麼是基金定投 3.本次數據來源 4.作出假設每週定投一次，每次定投500，計算2019年對滬深300指數基金進行定投的收益率每週定投一次，每次定投500，分別計算從2002年開始到2019年，每年定

2020-07-07 18:05:31

Python數據分析與挖掘實戰Chapter7 航空公司客戶價值分析

1.數據探索 #-*- coding: utf-8 -*- #對數據進行基本的探索，返回缺失值以及最大值，最小值 import pandas as pd datafile='G:/學習資料/統計/chapter7/demo/data

weixin_42764993

2020-07-06 13:07:27

【pandas】[9] pandas loc、iloc

創建一個dataframe import numpy as np import pandas as pd #創建一個Dataframe data=pd.DataFrame(np.arange(16).reshape(4,4),index

2020-07-06 09:40:22

【phantomjs】爬蟲安裝使用

phantomJS：的用處可謂非常廣泛諸如網絡監測、網頁截屏、無需瀏覽器的wen測試、頁面訪問自動化等。 phantomjs的下載安裝： http://phantomjs.org/download.html 下載完成後，直接解壓到桌面。

2020-07-04 23:33:26

matplotlib畫圖相關知識

Matplotlib 數據可視化 matplotlib庫的介紹數據可視化第三方庫 matplotlib.pyplot 是繪製各類可視化圖形的命令子庫，相當於快捷方式。 import matplotlib.pyplot as pl

2020-07-04 17:56:51

numpy庫相關知識

文章目錄numpy庫函數速查表numpy庫入門數據維度numpy介紹ndarray對象的屬性ndarray數組的創建和變換ndarray數組的變換ndarray數組的操作ndarray數組的運算numpy的隨機數函數numpy的統

2020-07-04 17:56:51

Python運算符和表達式

本文轉載自http://www.cnblogs.com/yueya/p/5811937.html 算術運算符：比較運算符：賦值運算符：位運算符：邏輯運算符：身份運算符：對比：isinst

liangyingyi1006

2020-07-04 10:41:01

python學習筆記——numpy

補充tile(val,(x,y))將val內容複製x行，y列。val可以使單個值，也可以是列表shape()查看矩陣或者數組的維數；如果是一個值，返回'()';如果存在x行，y列，返回'(x,y)';含有n個值得一維數組，返回'(n,)'

2020-07-04 02:08:38

使用Python玩轉word

需求：客戶提供Excel表格試題試卷，要求我們隨機生成10份word文檔試卷，試題內容隨機排序。讀取Excel中數據生成word試卷定義生成試卷的總數讀取Excel中數據 # -*- coding: utf-8 -*- """

奥斯维克鸡腿学徒

2020-07-03 15:55:48

【Python數據分析】1st-數據探索與數據預處理

《Python數據分析與挖掘實戰》讀書筆記之數據探索與數據預處理文章目錄@[toc] ##一、數據探索 Python中用於數據探索的庫主要是Pandas（數據分析）和Matplotlib（數據可視化） ###數據分析內容數據質

2020-07-02 21:24:03

[數據分析基礎] 2. Matplotlib庫

[數據分析基礎] 2. Matplotlib庫文章目錄[數據分析基礎] 2. Matplotlib庫一、Matplotlib庫入門1. pyplot的繪圖區域2. pyplot的plot()函數format_string**kw

2020-07-02 19:25:02

利用Python進行數據分析(三)：繪圖與可視化

本文爲《利用Python進行數據分析》的部分讀書筆記目錄matplotlib入門圖片與子圖顏色，標記和線類型刻度，標籤和圖例將圖片保存到文件顯示圖像註釋與子圖加工matplotlib設置 matplotlib入門本文爲入門內

2020-07-02 18:52:51

利用Python進行數據分析(一)：IPython及Jupyter notebook

本文爲《利用Python進行數據分析》的部分讀書筆記目錄IPython與Jupyter notebook簡介IPython基礎使用IPython命令行運行Jupyter notebook配置文件Jupyter Notebook

2020-07-02 18:52:51

利用Python進行數據分析(二)：Numpy

本文爲《利用Python進行數據分析》的部分讀書筆記目錄Numpy ndarray: 多維數組對象ndarray屬性NumPy 數據類型生成ndarrayNumpy數組算術基礎索引與切片布爾索引神奇索引數組轉置與轉軸通用函數：快

2020-07-02 18:52:51

數據分析之Pandas-01Series和DataFrame

01-什麼是Pandas Python Data Analysis Library 或 pandas 是基於NumPy 的一種工具，該工具是爲了解決數據分析任務而創建的。 pandas 納入了大量庫和一些標準的數據模型，提供了高

Python小学生

2020-07-02 10:12:42

24小時熱門文章

druid數據源 xml配置

最新文章

最新評論文章