Caffe 初學拾遺(一) 簡單命令

原創

CodeCold

2020-06-24 04:46

Original Source : Alex’s CIFAR-10 tutorial

本文以CIFAR-10數據集爲例，對Caffe的train及test操作進行簡單說明：

1. solver.prototxt 以及 cifar10_full_train_test.prototxt 區別：

CIFAR-10其訓練網絡配置文件與測試網絡配置文件是同一個 cifar10_full_train_test.prototxt 文件。

常規情況下，像model文件夾下的AlexNet中出現三個.prototxt 文件，其中 train_val.prototxt 與之類似。

而solver.prototxt是包含全局參數的配置文件，主要用於train以及fine-tuning，在test時是不需要的。

2. cifar10_full_train_test.prototxt 中的數據輸入：

layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
  }
  data_param {
    source: "examples/cifar10/cifar10_train_lmdb"
    batch_size: 100
    backend: LMDB
  }
}
layer {
  name: "cifar"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  transform_param {
    mean_file: "examples/cifar10/mean.binaryproto"
  }
  data_param {
    source: "examples/cifar10/cifar10_test_lmdb"
    batch_size: 100
    backend: LMDB
  }
}

正如1中所說，在這裏定義train以及test的輸入數據的路徑。

3. cifar10_quick_solver.prototxt 說明：

# Carry out testing every 500 training iterations.
test_interval: 500

每500次訓練迭代進行一次驗證測試(validation)，並不是一直訓練到結束後才進行測試(test)。

# snapshot intermediate results
snapshot: 4000
snapshot_format: HDF5

如果保留 snapshot_format: HDF5會生成.h5後綴的快照，用於繼續訓練或者 fine-tuning。

如果註釋掉該語句，會生成.caffemodel後綴權值文件，用於繼續訓練，fine-tuning，或者test。

4. 訓練：

caffe train \
  --solver=examples/cifar10/cifar10_quick_solver.prototxt

網絡結構在 cifar10_quick_solver.prototxt 文件中指向了:

net: "examples/cifar10/cifar10_quick_train_test.prototxt"

5. 利用snapshot繼續訓練：

caffe train \
  --solver=examples/cifar10/cifar10_quick_solver_lr1.prototxt \
  --snapshot=examples/cifar10/cifar10_quick_iter_4000.solverstate.h5

利用的是4000次迭代後的快照.h5

6. 測試：

sudo caffe test \
   --model=./examples/cifar10/cifar10_train_test.prototxt   --weights=./examples/cifar10/cifar10_quick_iter_5000.caffemodel --iterations 100 -gpu all

使用所有的GPU

加載之前5000次迭代後獲得的model

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Caffe 初學拾遺(一) 簡單命令

[轉帖]使用NMT和pmap解決JVM資源泄漏問題原創

Python實現大麥網搶票的四大關鍵技術點解析

Python 安裝庫指令大全

salesforce零基礎學習（一百三十八）零碎知識點小總結（十）

一款開源的.NET程序集反編譯、編輯和調試神器

關於接口協議，你必須要知道這些！

基於 Milvus + LlamaIndex 實現高級 RAG

【2024-05-21】以茶會友

Caffe 初學拾遺(一) 簡單命令

Caffe 初學拾遺(六) CUDA 線程通信

Convolutional neural networks(CNN) (十二) Convolutional Neural Network Theory

Caffe 初學拾遺(七) Layer Catalogue (Vision Layer)

十進制二進制轉換

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結