Skip to content

Commit fc24307

Browse files
Y-T-Gactions-user
andauthored
Add YOLO11 P2 and P6 configs (#73)
* Merge pull request #57 from Y-T-G/scale-validation Add support for multiple scales in YAML * Handle invalid `flops` and `params` * Support parsing list of FLOPs and parameters * Add YOLO11-P2 Detect config (#56) * Add YOLO11-P2 Detect config * Update output shapes * Update FLOPs and parameters * Add discussion link to detect/yolo11-p2.yaml * Add YOLO11-P2 Segment config (#58) * Add YOLO11-P2 Segment config * Update yolo11-p2.yaml * Update FLOPs and parameters * Add YOLO11-OBB P2 config (#59) * Add discussion link to obb/yolo11-p2.yaml * Add discussion link to segment/yolo11-p2.yaml * Add YOLO11-Pose P2 config (#63) * Add discussion link to pose/yolo11-p2.yaml * Add YOLO11-Detect P6 config (#65) * Add discussion link to detect/yolo11-p6.yaml * Add YOLO11-Segment P6 config (#66) * Add discussion link to segment/yolo11-p6.yaml * Add YOLO11-Pose P6 config (#68) * Add YOLO11-OBB P6 config (#67) * Add discussion link to obb/yolo11-p6.yaml * Add discussion link to pose/yolo11-p6.yaml --------- Co-authored-by: GitHub Action <action@github.com>
1 parent 2beaf20 commit fc24307

File tree

10 files changed

+826
-33
lines changed

10 files changed

+826
-33
lines changed

cfg/detect/yolo11-p2.yaml

Lines changed: 87 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,87 @@
1+
# Metadata
2+
author: Y-T-G
3+
task: detect
4+
keywords: [cnn, yolo11, small-object]
5+
description: YOLO11 object detection model with an extra P2 scale for small object detecton.
6+
flops:
7+
n: 11.4
8+
s: 29.7
9+
m: 88.9
10+
l: 113.6
11+
x: 245.1
12+
parameters:
13+
n: 2740032
14+
s: 9625968
15+
m: 20591120
16+
l: 26116368
17+
x: 57849456
18+
min_version: 8.3.0
19+
discussion: https://github.com/Y-T-G/community/discussions/60
20+
21+
# Model
22+
nc: 80
23+
strides: [4, 8, 16, 32]
24+
scales:
25+
n: [0.50, 0.25, 1024]
26+
s: [0.50, 0.50, 1024]
27+
m: [0.50, 1.00, 512]
28+
l: [1.00, 1.00, 512]
29+
x: [1.00, 1.50, 512]
30+
31+
backbone:
32+
# Stem
33+
- [-1, 1, Conv, [64, 3, 2]] # (1, 64, 320, 320) - 0
34+
35+
# P2 Backbone
36+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 160, 160) - 1
37+
- [-1, 2, C3k2, [256, False, 0.25]] # (1, 256, 160, 160) - 2
38+
39+
# P3 Backbone
40+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 80, 80) - 3
41+
- [-1, 2, C3k2, [512, False, 0.25]] # (1, 512, 80, 80) - 4
42+
43+
# P4 Backbone
44+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 40, 40) - 5
45+
- [-1, 2, C3k2, [512, True]] # (1, 512, 40, 40) - 6
46+
47+
# P5 Backbone
48+
- [-1, 1, Conv, [1024, 3, 2]] # (1, 512, 20, 20) - 7
49+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 20, 20) - 8
50+
- [-1, 1, SPPF, [1024, 5]] # (1, 512, 20, 20) - 9
51+
- [-1, 2, C2PSA, [1024]] # (1, 512, 20, 20) - 10
52+
53+
head:
54+
# Top-Down Path (Feature Pyramid)
55+
# P4 Head
56+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 40, 40) - 11
57+
- [[-1, 6], 1, Concat, [1]] # (1, 1024, 40, 40) - 12
58+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 13
59+
60+
# P3 Head
61+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 80, 80) - 14
62+
- [[-1, 4], 1, Concat, [1]] # (1, 1024, 80, 80) - 15
63+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 16
64+
65+
# P2 Head
66+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 256, 160, 160) - 17
67+
- [[-1, 2], 1, Concat, [1]] # (1, 512, 160, 160) - 18
68+
- [-1, 2, C3k2, [128, False]] # (1, 128, 160, 160) - 19
69+
70+
# Bottom-Up Path (PAN-FPN)
71+
# P3
72+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 80, 80) - 20
73+
- [[-1, 16], 1, Concat, [1]] # (1, 384, 80, 80) - 21
74+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 22
75+
76+
# P4
77+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 40, 40) - 23
78+
- [[-1, 13], 1, Concat, [1]] # (1, 768, 40, 40) - 24
79+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 25
80+
81+
# P5
82+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 20, 20) - 26
83+
- [[-1, 10], 1, Concat, [1]] # (1, 1024, 20, 20) - 27
84+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 20, 20) - 28
85+
86+
# Detection Head (P2, P3, P4, P5)
87+
- [[19, 22, 25, 28], 1, Detect, [nc]] # 29

cfg/detect/yolo11-p6.yaml

Lines changed: 91 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,91 @@
1+
# Metadata
2+
author: Y-T-G
3+
task: detect
4+
keywords: [cnn, yolo11, large-object]
5+
description: YOLO11 detection model with an extra P6 scale for large object detecton.
6+
flops:
7+
n: 7.0
8+
s: 23.2
9+
m: 70.5
10+
l: 90.3
11+
x: 202.1
12+
parameters:
13+
n: 4391920
14+
s: 16130064
15+
m: 30063504
16+
l: 37883024
17+
x: 85074352
18+
min_version: 8.3.0
19+
discussion: https://github.com/Y-T-G/community/discussions/69
20+
21+
# Model
22+
nc: 80
23+
strides: [8, 16, 32, 64]
24+
scales:
25+
n: [0.50, 0.25, 1024]
26+
s: [0.50, 0.50, 1024]
27+
m: [0.50, 1.00, 512]
28+
l: [1.00, 1.00, 512]
29+
x: [1.00, 1.50, 512]
30+
31+
backbone:
32+
# Stem
33+
- [-1, 1, Conv, [64, 3, 2]] # (1, 64, 320, 320) - 0
34+
35+
# P2 Backbone
36+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 160, 160) - 1
37+
- [-1, 2, C3k2, [256, False, 0.25]] # (1, 256, 160, 160) - 2
38+
39+
# P3 Backbone
40+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 80, 80) - 3
41+
- [-1, 2, C3k2, [512, False, 0.25]] # (1, 512, 80, 80) - 4
42+
43+
# P4 Backbone
44+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 40, 40) - 5
45+
- [-1, 2, C3k2, [768, True]] # (1, 512, 40, 40) - 6
46+
47+
# P5 Backbone
48+
- [-1, 1, Conv, [768, 3, 2]] # (1, 512, 20, 20) - 7
49+
- [-1, 2, C3k2, [768, True]] # (1, 512, 20, 20) - 8
50+
51+
# P6 Backbone
52+
- [-1, 1, Conv, [1024, 3, 2]] # (1, 512, 10, 10) - 9
53+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 10, 10) - 10
54+
- [-1, 1, SPPF, [1024, 5]] # (1, 512, 10, 10) - 11
55+
- [-1, 2, C2PSA, [1024]] # (1, 512, 10, 10) - 12
56+
57+
head:
58+
# Top-Down Path (Feature Pyramid)
59+
# P5 Head
60+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 20, 20) - 13
61+
- [[-1, 8], 1, Concat, [1]] # (1, 1024, 20, 20) - 14
62+
- [-1, 2, C3k2, [768, False]] # (1, 512, 20, 20) - 15
63+
64+
# P4 Head
65+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 40, 40) - 16
66+
- [[-1, 6], 1, Concat, [1]] # (1, 1024, 40, 40) - 17
67+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 18
68+
69+
# P3 Head
70+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 80, 80) - 19
71+
- [[-1, 4], 1, Concat, [1]] # (1, 1024, 80, 80) - 20
72+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 21
73+
74+
# Bottom-Up Path (PAN-FPN)
75+
# P4
76+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 40, 40) - 22
77+
- [[-1, 16], 1, Concat, [1]] # (1, 768, 40, 40) - 23
78+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 24
79+
80+
# P5
81+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 20, 20) - 25
82+
- [[-1, 13], 1, Concat, [1]] # (1, 1024, 20, 20) - 26
83+
- [-1, 2, C3k2, [768, True]] # (1, 512, 20, 20) - 27
84+
85+
# P6
86+
- [-1, 1, Conv, [768, 3, 2]] # (1, 512, 10, 10) - 28
87+
- [[-1, 10], 1, Concat, [1]] # (1, 1024, 10, 10) - 29
88+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 10, 10) - 30
89+
90+
# Detect Head (P3, P4, P5, P6)
91+
- [[21, 24, 27, 30], 1, Detect, [nc]] # 31

cfg/obb/yolo11-p2.yaml

Lines changed: 87 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,87 @@
1+
# Metadata
2+
author: Y-T-G
3+
task: obb
4+
keywords: [cnn, yolo11, small-object]
5+
description: YOLO11 rotated bounding box model with an extra P2 scale for small object detecton.
6+
flops:
7+
n: 11.7
8+
s: 30.7
9+
m: 93.0
10+
l: 117.6
11+
x: 254.3
12+
parameters:
13+
n: 2777060
14+
s: 9773748
15+
m: 21034132
16+
l: 26559380
17+
x: 58845748
18+
min_version: 8.3.0
19+
discussion: https://github.com/Y-T-G/community/discussions/62
20+
21+
# Model
22+
nc: 80
23+
strides: [4, 8, 16, 32, ]
24+
scales:
25+
n: [0.50, 0.25, 1024]
26+
s: [0.50, 0.50, 1024]
27+
m: [0.50, 1.00, 512]
28+
l: [1.00, 1.00, 512]
29+
x: [1.00, 1.50, 512]
30+
31+
backbone:
32+
# Stem
33+
- [-1, 1, Conv, [64, 3, 2]] # (1, 64, 320, 320) - 0
34+
35+
# P2 Backbone
36+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 160, 160) - 1
37+
- [-1, 2, C3k2, [256, False, 0.25]] # (1, 256, 160, 160) - 2
38+
39+
# P3 Backbone
40+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 80, 80) - 3
41+
- [-1, 2, C3k2, [512, False, 0.25]] # (1, 512, 80, 80) - 4
42+
43+
# P4 Backbone
44+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 40, 40) - 5
45+
- [-1, 2, C3k2, [512, True]] # (1, 512, 40, 40) - 6
46+
47+
# P5 Backbone
48+
- [-1, 1, Conv, [1024, 3, 2]] # (1, 512, 20, 20) - 7
49+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 20, 20) - 8
50+
- [-1, 1, SPPF, [1024, 5]] # (1, 512, 20, 20) - 9
51+
- [-1, 2, C2PSA, [1024]] # (1, 512, 20, 20) - 10
52+
53+
head:
54+
# Top-Down Path (Feature Pyramid)
55+
# P4 Head
56+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 40, 40) - 11
57+
- [[-1, 6], 1, Concat, [1]] # (1, 1024, 40, 40) - 12
58+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 13
59+
60+
# P3 Head
61+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 80, 80) - 14
62+
- [[-1, 4], 1, Concat, [1]] # (1, 1024, 80, 80) - 15
63+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 16
64+
65+
# P2 Head
66+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 256, 160, 160) - 17
67+
- [[-1, 2], 1, Concat, [1]] # (1, 512, 160, 160) - 18
68+
- [-1, 2, C3k2, [128, False]] # (1, 128, 160, 160) - 19
69+
70+
# Bottom-Up Path (PAN-FPN)
71+
# P3
72+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 80, 80) - 20
73+
- [[-1, 16], 1, Concat, [1]] # (1, 384, 80, 80) - 21
74+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 22
75+
76+
# P4
77+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 40, 40) - 23
78+
- [[-1, 13], 1, Concat, [1]] # (1, 768, 40, 40) - 24
79+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 25
80+
81+
# P5
82+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 20, 20) - 26
83+
- [[-1, 10], 1, Concat, [1]] # (1, 1024, 20, 20) - 27
84+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 20, 20) - 28
85+
86+
# OBB Head (P2, P3, P4, P5)
87+
- [[19, 22, 25, 28], 1, OBB, [nc, 1]] # 29

cfg/obb/yolo11-p6.yaml

Lines changed: 91 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,91 @@
1+
# Metadata
2+
author: Y-T-G
3+
task: obb
4+
keywords: [cnn, yolo11, large-object]
5+
description: YOLO11 rotated bounding box model with an extra P6 scale for large object detecton.
6+
flops:
7+
n: 7.2
8+
s: 24.2
9+
m: 74.3
10+
l: 94.1
11+
x: 210.6
12+
parameters:
13+
n: 4493620
14+
s: 16536212
15+
m: 31244436
16+
l: 39063956
17+
x: 87730484
18+
min_version: 8.3.0
19+
discussion: https://github.com/Y-T-G/community/discussions/72
20+
21+
# Model
22+
nc: 80
23+
strides: [8, 16, 32, 64]
24+
scales:
25+
n: [0.50, 0.25, 1024]
26+
s: [0.50, 0.50, 1024]
27+
m: [0.50, 1.00, 512]
28+
l: [1.00, 1.00, 512]
29+
x: [1.00, 1.50, 512]
30+
31+
backbone:
32+
# Stem
33+
- [-1, 1, Conv, [64, 3, 2]] # (1, 64, 320, 320) - 0
34+
35+
# P2 Backbone
36+
- [-1, 1, Conv, [128, 3, 2]] # (1, 128, 160, 160) - 1
37+
- [-1, 2, C3k2, [256, False, 0.25]] # (1, 256, 160, 160) - 2
38+
39+
# P3 Backbone
40+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 80, 80) - 3
41+
- [-1, 2, C3k2, [512, False, 0.25]] # (1, 512, 80, 80) - 4
42+
43+
# P4 Backbone
44+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 40, 40) - 5
45+
- [-1, 2, C3k2, [768, True]] # (1, 512, 40, 40) - 6
46+
47+
# P5 Backbone
48+
- [-1, 1, Conv, [768, 3, 2]] # (1, 512, 20, 20) - 7
49+
- [-1, 2, C3k2, [768, True]] # (1, 512, 20, 20) - 8
50+
51+
# P6 Backbone
52+
- [-1, 1, Conv, [1024, 3, 2]] # (1, 512, 10, 10) - 9
53+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 10, 10) - 10
54+
- [-1, 1, SPPF, [1024, 5]] # (1, 512, 10, 10) - 11
55+
- [-1, 2, C2PSA, [1024]] # (1, 512, 10, 10) - 12
56+
57+
head:
58+
# Top-Down Path (Feature Pyramid)
59+
# P5 Head
60+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 20, 20) - 13
61+
- [[-1, 8], 1, Concat, [1]] # (1, 1024, 20, 20) - 14
62+
- [-1, 2, C3k2, [768, False]] # (1, 512, 20, 20) - 15
63+
64+
# P4 Head
65+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 40, 40) - 16
66+
- [[-1, 6], 1, Concat, [1]] # (1, 1024, 40, 40) - 17
67+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 18
68+
69+
# P3 Head
70+
- [-1, 1, nn.Upsample, [None, 2, nearest]] # (1, 512, 80, 80) - 19
71+
- [[-1, 4], 1, Concat, [1]] # (1, 1024, 80, 80) - 20
72+
- [-1, 2, C3k2, [256, False]] # (1, 256, 80, 80) - 21
73+
74+
# Bottom-Up Path (PAN-FPN)
75+
# P4
76+
- [-1, 1, Conv, [256, 3, 2]] # (1, 256, 40, 40) - 22
77+
- [[-1, 16], 1, Concat, [1]] # (1, 768, 40, 40) - 23
78+
- [-1, 2, C3k2, [512, False]] # (1, 512, 40, 40) - 24
79+
80+
# P5
81+
- [-1, 1, Conv, [512, 3, 2]] # (1, 512, 20, 20) - 25
82+
- [[-1, 13], 1, Concat, [1]] # (1, 1024, 20, 20) - 26
83+
- [-1, 2, C3k2, [768, True]] # (1, 512, 20, 20) - 27
84+
85+
# P6
86+
- [-1, 1, Conv, [768, 3, 2]] # (1, 512, 10, 10) - 28
87+
- [[-1, 10], 1, Concat, [1]] # (1, 1024, 10, 10) - 29
88+
- [-1, 2, C3k2, [1024, True]] # (1, 512, 10, 10) - 30
89+
90+
# OBB Head (P3, P4, P5, P6)
91+
- [[21, 24, 27, 30], 1, OBB, [nc, 1]] # 31

0 commit comments

Comments
 (0)