w4_2Multi-class Classification and Neural Networks編程解析

原創

2020-07-04 19:04

w4_2Multi-class Classification and Neural Networks編程解析

w4_2Multi-class Classification and Neural Networks編程解析

1. ex3.m

1.1 ex3.m框架分析

把數據可視化, 在plotData.m中添加:

input_layer_size  = 400;  % 20x20 Input Images of Digits
num_labels = 10;          % 10 labels, from 1 to 10
                          % (note that we have mapped "0" to label 10)

# 第1步加載數據並顯示,是裏面有各種手寫0-9的數字網格
load('ex3data1.mat'); % training data stored in arrays X, y
m = size(X, 1);
rand_indices = randperm(m);
sel = X(rand_indices(1:100), :);
displayData(sel);


#第2.1步 Vectorize Logistic Regression 
theta_t = [-2; -1; 1; 2];
X_t = [ones(5,1) reshape(1:15,5,3)/10];
y_t = ([1;0;1;0;1] >= 0.5);
lambda_t = 3;
[J grad] = lrCostFunction(theta_t, X_t, y_t, lambda_t);

# 第2.2 步 實現one-vs-All Training, 將手寫的0-9圖片區分出來
lambda = 0.1;
[all_theta] = oneVsAll(X, y, num_labels, lambda);

# 第3步進行預測
pred = predictOneVsAll(all_theta, X);

1.2 Regularized logistic regression 0 / 30

分兩步,先計算 $J (θ)$ 再計算 $θ$

1.2.1 logistic regression Cost Function

帶正則化項的logistic迴歸的損失函數:
$J (θ) = \frac{1}{m} \sum_{i = 1}^{m} [- y^{(i)} \log (h_{θ} (x^{(i)})) - (1 - y^{(i)}) \log (1 - h_{θ} (x^{(i)}))] + \frac{λ}{2 m} \sum_{j = 1}^{n} θ_{j}^{2}$

h_theta = sigmoid(X*theta);
left = -(y'*log(h_theta));
right = (1-y)'*log(1-h_theta);
theta_line_num = length(theta);
reg = sum(theta(2:theta_line_num,:) .^ 2);
J = 1/m*(left-right)+lambda/(2*m)*reg;

第1行: 先計算 $h_{θ}$ , 注意這兒的sigmoid函數己經寫好了,在文件sigmoid.m中
第2行: 計算左半部分left= $\sum_{i = 1}^{m} - y^{(i)} \log (h_{θ} (x^{(i)}))$
第3行: 計算右半部分right= $\sum_{i = 1}^{m} (1 - y^{(i)}) \log (1 - h_{θ} (x^{(i)}))$
第4行: 計算列向量theta的長度: 因爲 $θ$ 是(28x1)的列向量, $θ_{0}$ 不作爲penalty項,所以要剔除
第5行: 從列向量theta的第2行到最後一行取平方,再求和,得到正則化項
第6行: 將上述的left right reg,代入到公式,即可求得 $J (θ)$

1.2.2 Gradient for regularized LR

下面計算grad:

h_theta = sigmoid(X*theta);  --> 這個上面己經有了
left_grad = X' * (h_theta-y);
grad = 1/m*left_grad + lambda/m*theta;
grad(1) = 1/m*(X(:,1))'*(h_theta-y);

$θ$ 的更新公式：
$\frac{\partial J (θ)}{\partial θ_{j}} = (\frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{_{j}}^{(i)}) + \frac{λ}{m} θ_{j} f o r j \geq 1$
$\frac{\partial J (θ)}{\partial θ_{j}} = (\frac{1}{m} \sum_{i = 1}^{m} (h_{θ} (x^{(i)}) - y^{(i)}) x_{_{j}}^{(i)}) f o r j = 1$

第1行: 先計算 $h_{θ}$ ,這個上面己經有了
第2,3行: 先計算左括號的部分,然後加上正則化項
第4行: 再單獨計算 $θ_{0}$ ,也就是代碼裏面的grad(1)

1.3 One-vs-all classifier training 0 / 20

實現 oneVsAll函數, 在oneVsAll.m中

initial_theta = zeros(n + 1, 1);
options = optimset('GradObj', 'on', 'MaxIter', 50);
for c=1:num_labels,
  theta = fmincg (@(t)(lrCostFunction(t, X, (y == c), lambda)), initial_theta, options);
  all_theta(c,:) = theta';
end;

其中fmincg函數的實現是在 fmincg.m 中, 上面函數直接抄的提示中的實現.

1.4 One-vs-all classifier prediction 0 / 20

在 predictOneVsAll.m中實現

all_preds = all_theta * X';
[max_vals, max_ndxs] = max(all_preds);
p = max_ndxs';

2. ex3_nn.m

2.1 ex3_nn.m框架分析

%% Setup the parameters you will use for this exercise


#第1步加載數據並顯示
load('ex3data1.mat');
m = size(X, 1);
sel = randperm(size(X, 1));
sel = sel(1:100);
displayData(X(sel, :));

input_layer_size  = 400;  % 20x20 Input Images of Digits
hidden_layer_size = 25;   % 25 hidden units
num_labels = 10;          % 10 labels, from 1 to 10   
                          % (note that we have mapped "0" to label 10)
# 第2步加載參數
load('ex3weights.mat');

# 第3步進行預測
pred = predict(Theta1, Theta2, X);

#第4步顯示預測結果
rp = randperm(m);
for i = 1:m
    displayData(X(rp(i), :));
    pred = predict(Theta1, Theta2, X(rp(i),:));
    # 按q退出
    s = input('Paused - press enter to continue, q to exit:','s');
    if s == 'q'
      break
    end
end

2.2 Neural network prediction function 0 / 30

3. 總結

3.1

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

w4_2Multi-class Classification and Neural Networks編程解析

w4_2Multi-class Classification and Neural Networks編程解析

1. ex3.m

1.1 ex3.m框架分析

1.2 Regularized logistic regression 0 / 30

1.2.1 logistic regression Cost Function

1.2.2 Gradient for regularized LR

1.3 One-vs-all classifier training 0 / 20

1.4 One-vs-all classifier prediction 0 / 20

2. ex3_nn.m

2.1 ex3_nn.m框架分析

2.2 Neural network prediction function 0 / 30

3. 總結

3.1

[轉帖]使用NMT和pmap解決JVM資源泄漏問題原創

Python實現大麥網搶票的四大關鍵技術點解析

Python 安裝庫指令大全

salesforce零基礎學習（一百三十八）零碎知識點小總結（十）

一款開源的.NET程序集反編譯、編輯和調試神器

關於接口協議，你必須要知道這些！

基於 Milvus + LlamaIndex 實現高級 RAG

【2024-05-21】以茶會友

鏈接與加載---1.objdump查看機器碼

鏈接與加載---2.dlopen共享庫

Week9_3Program_Anomaly Detection and Recommender Systems編程解析

w4_2Multi-class Classification and Neural Networks編程解析

程序員實用算法筆記----2.CRC校驗

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結