Best options are the ones with less than 6M parameters and 0.5 GFLOPs.
Backbone | Parameters (M) | GFLOPs |
---|---|---|
VGG-19 | 138 | 19.6 |
HRNet-W32 | 28.5 | 7.1 |
ResNet-50 | 25.6 | 4.1 |
MobileNetV3 | 5.4 | 0.35 |
EfficientNet-B0 | 5.3 | 0.39 |
ShuffleNetV2 | 3.5 | 0.15 |
Lite-HRNet | 1.1 | 0.5 |
SqueezeNet | 0.7 | 0.8 |
To match the resolution of VGG-19 upsampling or downsampling might be needed. While to match the channels a 1 * 1 conv layer might be needed.
Backbone | Resolution | Channels | Used In |
---|---|---|---|
VGG-19 | H/8 | 512 | OpenPose |
MobileNetV3 | H/16 | 160 | Mediapipe |
EfficientNet-B0 | H/32 | 1280 | EfficientPose |
ShuffleNetV2 | H/8 | 464 | NanoDet |
Lite-HRNet | H/4 | 40 | MMPose |