@@ -171,8 +171,8 @@ class Yolov3LossOpMaker : public framework::OpProtoAndCheckerMaker {
171
171
172
172
The output of previous network is in shape [N, C, H, W], while H and W
173
173
should be the same, H and W specify the grid size, each grid point predict
174
- given number boxes, this given number, which following will be represented as S,
175
- is specified by the number of anchors, In the second dimension(the channel
174
+ given number bounding boxes, this given number, which following will be represented as S,
175
+ is specified by the number of anchor clusters in each scale. In the second dimension(the channel
176
176
dimension), C should be equal to S * (class_num + 5), class_num is the object
177
177
category number of source dataset(such as 80 in coco dataset), so in the
178
178
second(channel) dimension, apart from 4 box location coordinates x, y, w, h,
@@ -203,7 +203,7 @@ class Yolov3LossOpMaker : public framework::OpProtoAndCheckerMaker {
203
203
thresh, the confidence score loss of this anchor box will be ignored.
204
204
205
205
Therefore, the yolov3 loss consist of three major parts, box location loss,
206
- confidence score loss, and classification loss. The L2 loss is used for
206
+ confidence score loss, and classification loss. The L1 loss is used for
207
207
box coordinates (w, h), and sigmoid cross entropy loss is used for box
208
208
coordinates (x, y), confidence score loss and classification loss.
209
209
0 commit comments