論文筆記： Medical Exam Question Answering with Large-scale Reading Comprehension

原創

2020-06-28 02:24

S爲[question, 候選answer]拼接的集合，D={D_1, D_2, … , D_N}爲文檔集合。
L_Q:question與候選answer中的最大長度
L_D: 爲文檔D中的最大長度

Dual-path attention layer

Contex layer層的輸出爲S：[L_Q, d]、D_i：[L_D, d]
$D_n(j)$ 表示與候選S相關的第n篇文檔中的第j個字向量， $D_n$ 維度爲[L_D,d]
S維度爲[L_Q,d]

matching matrix由S與D做點積得到，維度爲[L_Q, L_D]，實際上是爲了做注意力機制。 $M=SD^T$

Q-centric

對matching matrix按行進行softmax， $R_n^Q={[R_n^Q(1),R_n^Q(2),...,R_n^Q(L_Q)]}$ ，維度爲[L_Q, d]，其中 $R_n^Q(i)$ 的維度爲[1,d]。
作用是利用D來表示S。 $R_n^Q=softmax(SD^T)*D$
做完softmax後，再與文檔向量相乘，這裏實際與bert中與V相乘的作用類似，即利用文檔向量 $D_n$ 對S中各個字的貢獻度。
中間可能有S的信息損失，所以再將S與 $R_n^Q$ 拼接，維度爲[L_Q, 2d]

D-centric

取matching matrix的每一列， $R_n^D$ 表示第n篇文檔按照“Q-centric”方法生成的矩陣，維度爲[L_D, d]，生成方法即第n篇與其餘剩下的文檔。利用S來表示D
$R_n^D=softmax(DS^T)*S$
$D_m(i)+R_m^D(i)$ 做拼接，維度爲[L_D, 2d]， $M_{mn}^{'}$ 的維度爲[L_D, 2d]，接着做注意力， $R_m^{'}{D}$ 的維度爲[L_D, 2d]

cross-document attention

對N篇文檔，按照D-centric中方法運算，最後得到[N, L_D, 2d]

matching feature

對matching matrix矩陣，經過兩層CNN，一層max-pooling後，維度爲[L_Q, d]

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

Twitch表情中的情緒分析

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

Martin Anderson

2021-12-07 16:00:03

達摩院AliceMind上新！首箇中文表格預訓練模型發佈，已向業界開源

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-02 18:18:58

在元宇宙裏怎麼交朋友？Meta發佈跨語種交流語音模型，支持128種語言無障礙對話

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-11-23 14:03:53

人工智能時代，如何硬核玩音樂？| InfoQ《大咖說》

直播內容：在人工智能技術迅速發展的當下，越來越多的領域被這項技術注入新的活力。作爲多媒體領域中不可缺少的組成部分，音樂對於人類的重要性不言而喻。值得一提的是，人工智能在音樂領域的研究早在多年前就已經開始了，並且也落地了很多成熟應用。當前

InfoQ 中文站

2021-11-12 14:23:49

不是隻有數字化水平高，纔可以落地知識圖譜

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockq

2021-11-11 15:23:53

騰訊發佈超大預訓練系統派大星，聚焦解決BERT等超大模型訓練時的“GPU內存牆”問題

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-02 13:38:53

微軟和英偉達推出訓練語言模型MT-NLG：5300億參數量，是GPT-3的3倍

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-10-12 14:13:53

谷歌推出Translatotron 2，一種沒有深度僞造潛力的語音到語音直接翻譯神經模型

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-09-10 14:09:01

放心，GPT-3不會“殺死”編程

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragr

2021-09-03 17:58:55

爲什麼神經網絡不適合理解自然語言？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-08-04 16:13:54

易聊科技宣佈在線客服系統IM永久免費，透視智能客服的商業化潛力

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockq

2021-07-27 17:33:49

5個流行的自然語言處理庫及入門用法

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-07-26 10:43:50

AI虛擬人多模態交互落地難題如何破解？我們在樂享A.I.技術沙龍成都站找到了答案

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-06-24 16:18:54

官宣！達摩院開源祕藏深度語言模型體系AliceMind，NLP正在走向大工業時代

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-06-22 14:48:49

讓普通人秒會編程？微軟在Power平臺上集成GPT-3，將自然語言直接變成現成代碼

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-05-28 17:48:57

24小時熱門文章

最新文章

最新評論文章