On First-Order Meta-Learning Algorithms

Paper:https://arxiv.org/pdf/1803.02999.pdf
Code:https://github.com/openai/supervised-reptile
Tips：OpenAi的一篇相似MAML的Meta-learning相關的paper。
（閱讀筆記）

1.Main idea

目標旨在實現相同分佈的一類任務的少量樣本快速學習。This paper considers meta-learning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution.
類似於first-order MAML，忽略了二階偏微分。並且指出了其實現更爲簡單。
Reptile的作爲meta-learning的方法，訓練還是和傳統方法很相似。Reptile is so similar to joint training that it is especially surprising that it works as a meta-learning algorithm.
做出了first-order MAML和reptile的理論分析。

2.MAML回顧

回顧了MAML相關工作。
目標是求解下式，其中 $\tau$ 是不同的任務集， $\phi$ 是初始參數， $L$ 是損失函數， $U_{\tau}^{k}$ 表示從任務集 $\tau$ 抽樣出來訓練的第 $k$ 次的參數更新操作：
$\min_{\phi}\mathbb{E}_{\tau}[L_{\tau}(U_{\tau}^{k}(\phi)) ]$
$A$ 是原始訓練任務集， $B$ 是新任務集。MAML的訓練操作仍然對原始任務集進行訓練，但是其損失函數卻是針對的 $B$ ，如下所示：
$\min_{\phi}\mathbb{E}_{\tau}[L_{\tau,B}(U_{\tau,A}(\phi)) ]$
找梯度即需對參數 $\phi$ 求偏導（複合函數求導）：
$g=\frac{\partial L_{\tau,B}(U_{\tau,A}(\phi))}{\partial \phi}\\ \\=L_{\tau,B}'(U_{\tau,A}(\phi)) \times U_{\tau,A}'(\phi)=\frac{\partial L_{\tau,B}(U_{\tau,A}(\phi))}{\partial U_{\tau,A}(\phi)} \times \frac{\partial U_{\tau,A}(\phi)}{\partial \phi}$
使用恆等操作（對第二項偏微分變爲常量1），得到First-order MAML爲：
$g=\frac{\partial L_{\tau,B}(U_{\tau,A}(\phi))}{\partial U_{\tau,A}(\phi)}$
即損失下降梯度的方向爲在任務集 $A$ 得到參數 $\phi$ 的情況下，通過對測試集 $B$ 得到的損失最小化的方向即是外循環的方向。

3.Reptile

算法流程如下所示：

注意到可以一次迭代中將 $\widetilde{\phi}$ 進行 $k$ 步後，最後才確定梯度的方向。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Reptile: On First-Order Meta-Learning Algorithms

On First-Order Meta-Learning Algorithms

1.Main idea

2.MAML回顧

3.Reptile

Pre-training on Grayscale ImageNet Improves Medical Image Classification

決策樹和隨機森林原理（Decision Trees and Random Forests）

Reptile: On First-Order Meta-Learning Algorithms

patch-GAN(pixel2pixel)：Image-to-Image Translation with Conditional Adversarial Networks

範數的數學意義

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結