【目標檢測系列：五】2018 CVPR IoU-Net 論文閱讀解析總結

原創

鹿鹿最可爱

2020-06-20 06:51

2018 CVPR

Acquisition of Localization Confidence for Accurate Object Detection

PreciseRoIPooling 代碼
 ECCV 2018 | 曠視科技 Oral 論文解讀：IoU-Net 讓目標檢測用上定位置信度

參考的一篇博客

建議先自己看一遍論文，然後再看下面的總結

IoU-Net

解決問題： nms 過程中，是挑選分類置信度最大的值的框，但是它不一定框的準

Two drawbacks in object localization

the misalignment between classification confidence and localization accuracy
the non-monotonic bounding box regression

joint training

Backbone
ResNet-FPN
FPN
Precise RoI Pooling
Head
works in parallel
based on the same visual feature from the backbone
- IoU predictor
- R-CNN
  - classification and regression brance take 512 RoIs per image from RPNs

Training

img (800,1200)
batch size 16
lr 0.01
iteration 160k
warm up 0.004 ,10k

Training the IoU detector

smooth-L1 loss
IoU labels
normalized , distributed over [-1,1]

Inference

first apply bounding box regression for the initial coordinates
IoU-guide NMS
on all detected bounding boxes
refine using optimization-based algorithm
100 bounding boxes with highest classification confidence

Predict IoU

IoU predictor

aim
- takes features from the FPN
- estimates the localization accuracy (IoU) for each bounding box
data generation
- generate candidate bounding box set
  generate bounding boxes and labels for training the IoU-Net : augmenting the ground-truth,instead of taking proposals from RPNs
  for all ground-truth bounding box in training set , manually transform them with a set of randomized parameters
- remove the bounding box having an IoU < 0.5 with the matched ground-truth
feature
- extracted from the output of FPN with the proposed PrRoI-Pooling layers
- then fed into a two-layer feedforward network for the IoU prediction
use class-aware IoU predictors

IoU-guided NMS

use the predicted IoU instead of the classification confidence as the ranking keyword for bounding boxes.
to determine the classification scores
- select the box having the highest IoU with a ground-truth
- eliminate all other boxes having an overlap greater than threshold nms
- for a group of bounding boxes matching the same ground-truth, we take the most confident prediction for the class label.
  highest IoU 的框的分類置信度是其和他匹配同一gt的並大於閾值被濾掉的框的分類置信度的最大值
Algorithm
- ① 從bounding box集合 B 中依次選取預估IOU（localization confidence）最高的bounding box（記爲 $b_m$ ）
- ② 將與其IOU高於一定閾值的bounding box一個個選出來，並將這些bounding box（包括最開始選的 $b_m$ ）的最高classification confidence記爲 $s$
- ③ 將 $(b_m,s)$ 二元組記錄到集合 D 中 (本質是 bounding box和cls conf的重新分配)

Optimization-based bounding box refinement

Algorithm
- 對於檢測到的bounding box，利用 PrPool 提取內部特徵並算出 IOUnet 預測的IOU，記其梯度爲grad，這個IOU記爲PrevScore
- 然後更新bounding box
- 更新之後重新進行IOU預測結果爲NewScore
- 如果 prevscore 和 newscore 相差小於一個early-stop閾值或者 newscore 比 prevscore 低於一個“定位退化容忍度”，則認爲該bounding box更新完畢。

PrPool

連續
可導

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

【目標檢測系列：五】2018 CVPR IoU-Net 論文閱讀解析總結

IoU-Net

joint training

Predict IoU

10分鐘搞定Mysql主從部署配置

如何使用 JS 判斷用戶是否處於活躍狀態

「Pygors跨平臺GUI」2：安裝MinGW-w64、MSYS2還是WSL2

[轉帖]

python列出centos7內存使用前50的進程信息

「Pygors跨平臺GUI」1：Pygors跨平臺GUI應用研究

一鍵自動化博客發佈工具,用過的人都說好(掘金篇)

lightdb數據庫超時相關控制參數

lightdb秒級增加列和刪除列（not null帶默認值）

Java ThreadPoolShutdown

【leetcode】小彩蛋｜主頁draw

【Algorithm】查找之“折半查找”

【醫學+深度論文：F32】2017 SPIE Automated detection of nerve fiber layer defects on retinal fundus

【Interview】 ResNet系列/ Inception系列/ MobileNet系列/ ShuffleNet系列網絡結構圖

2019 AI procon | 張祥雨高效輕量級深度模型的研究和實踐 AI開發者大會部分內容

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結