Agriculture Vision 2021
Results Summary
NOTE all our single model’s scores are computed with just single-scale (512x512) and single feed-forward inference without TTA. TTA denotes test time augmentation (e.g. flip and mirror). Ensemble_TTA (checkpoint1,2) denotes two core.net.(checkpoint1, and checkpoint2) ensemble with TTA, and (checkpoint1, 2, 3) denotes three core.net.ensemble.
Models |
mIoU (%) |
Background |
Cloud shadow |
Double plant |
Planter skip |
Standing water |
Waterway |
Weed cluster |
---|---|---|---|---|---|---|---|---|
MSCG-Net-50 (ckpt1) |
54.7 |
78.0 |
50.7 |
46.6 |
34.3 |
68.8 |
51.3 |
53.0 |
*MSCG-Net-101 (ckpt2)* |
*55.0* |
*79.8* |
*44.8* |
*55.0* |
*30.5* |
*65.4* |
*59.2* |
*50.6* |
MSCG-Net-101_k31 (ckpt3) |
54.1 |
79.6 |
46.2 |
54.6 |
9.1 |
74.3 |
62.4 |
52.1 |
Ensemble_TTA (ckpt1,2) |
59.9 |
80.1 |
50.3 |
57.6 |
52.0 |
69.6 |
56.0 |
53.8 |
Ensemble_TTA (ckpt1,2,3) |
60.8 |
80.5 |
51.0 |
58.6 |
49.8 |
72.0 |
59.8 |
53.8 |
Ensemble_TTA (new_5model) |
62.2 |
80.6 |
48.7 |
62.4 |
58.7 |
71.3 |
60.1 |
53.4 |
Model Size
NOTE all backbones used pretrained weights on ImageNet that can be imported and downloaded from the link. And MSCG-Net-101_k31 has exactly the same architecture wit MSCG-Net-101, while it is trained with extra 1/3 validation set (4,431) instead of just using the official training images (12,901).
Models |
Backbones |
Parameters |
GFLOPs |
Inference time (CPU/GPU ) |
---|---|---|---|---|
MSCG-Net-50 |
Se_ResNext50_32x4d |
9.59 |
18.21 |
522 / 26 ms |
MSCG-Net-101 |
Se_ResNext101_32x4d |
30.99 |
37.86 |
752 / 45 ms |
MSCG-Net-101_k31 |
Se_ResNext101_32x4d |
30.99 |
37.86 |
752 / 45 ms |