基於論壇話題段落劃分的答案識別

從實驗室離開兩年了,想不到畢業設計論文被髮表出來了。哈哈

http://www.aas.net.cn/qikan/Cpaper/zhaiyao.asp?bsid=14676


*******************************************************我是華麗的分割線*******************************************************


基於論壇話題段落劃分的答案識別

摘要
在論壇話題中識別答案是面向論壇的問答對挖掘中的核心問題. 在論壇話題的討論中通常存在隱式的結構, 這種結構信息非常有助於最佳答案的定位和識別. 本文提出了一種基於中文論壇話題段落劃分的答案識別方法: 首先將論壇話題重新組織爲若干段落的集合, 並基於此劃分提取一組能夠反映話題討論邏輯結構的特徵. 在此基礎上給出了一種可以根據候選答案所在段落類別實現模型選擇的答案識別策略, 從而避免了噪聲信息對模型預測的誤導. 實驗結果表明本文的答案識別方法非常適用於面向在線論壇的問答資源挖掘工作.
關鍵詞   話題段落劃分   非文本特徵   答案識別   在線論壇   問答對挖掘   


Thread Segmentation Based Answer Detection in Chinese Online Forums

Abstract
Detecting answers in the threads is an essential task for the online forum oriented question-answer (QA) pair mining. In the forum threads, there normally exist implicit discussion structures with the valuable indication for locating the best answers. This paper proposes a thread segmentation based answer detecting approach: a forum thread is reorganized into several segments, and a group of features reflecting the discussion structures are extracted based on the segmentation results. Utilizing the segment information, a strategy is put forward to find the best answers. By evaluating the candidate answers in different types of segments with different models, the strategy filters the samples that mislead the decision. The experimental results show that our approach is promising for mining the QA resource in the online forums.
Key words   Thread segmentation   non-textual feature   answer detection   online forum   question-answer (QA) pair mining  

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章