Test Conditions:
Test Board: S100.
Number of Test Cores: Single core.
Frequency to obtain model performance data: Average of performance parameters over a 5-minute period.
Python version: Python 3.10.
Models: Models under the samples/ucp_tutorial/dnn/ai_benchmark/s100 path in the OE package.
Runtime Environment: Linux.
Table Header Acronyms:
C = Computation, in GOPs (i.e., billion operations per second), obtained by calling the hbm_perf interface.
FPS = Frame(s) Per Second, obtained by running the fps.sh script with multi thread of different models in the ai_benchmark sample package/script on the dev board. Post-processing included.
ITC = Inference Time Consumption, in ms (millisecond), obtained by running the latency.sh script with single thread of different models in the ai_benchmark sample package/script on the dev board. Post-processing not included.
TCPP = Postprocess Time Consumption, in ms (millisecond), obtained by running the latency.sh script with single thread of different models in the ai_benchmark sample package/script on the dev board.
RV = Read Volume in a single inference, in mb (Mbit), obtained by calling the hbm_perf interface.
WV = Write Volume in a single inference, in mb (Mbit), obtained by calling the hbm_perf interface.
| MODEL NAME | INPUT SIZE | C(GOPs) | FPS | ITC(ms) | TCPP(ms) | ACCURACY | Dataset |
|---|---|---|---|---|---|---|---|
| MobileNetv1 | 1x3x224x224 | 1.14 | 4594.40 | 0.466 | 0.033 | Top1: 0.7374(FLOAT)/0.7295(INT8) | ImageNet |
| MobileNetv2 | 1x3x224x224 | 0.63 | 4591.90 | 0.487 | 0.033 | Top1: 0.7218(FLOAT)/0.7146(INT8) | ImageNet |
| ResNet50 | 1x3x224x224 | 7.72 | 1152.80 | 1.161 | 0.033 | Top1: 0.7704(FLOAT)/0.7673(INT8) | ImageNet |
| GoogleNet | 1x3x224x224 | 3.00 | 2858.20 | 0.636 | 0.032 | Top1: 0.7018(FLOAT)/0.6998(INT8) | ImageNet |
| EfficientNet_Lite0 | 1x224x224x3 | 0.77 | 4068.20 | 0.559 | 0.032 | Top1: 0.7479(FLOAT)/0.7454(INT8) | ImageNet |
| EfficientNet_Lite1 | 1x240x240x3 | 1.20 | 3231.60 | 0.624 | 0.032 | Top1: 0.7652(FLOAT)/0.7614(INT8) | ImageNet |
| EfficientNet_Lite2 | 1x260x260x3 | 1.72 | 2478.80 | 0.723 | 0.032 | Top1: 0.7734(FLOAT)/0.7697(INT8) | ImageNet |
| EfficientNet_Lite3 | 1x280x280x3 | 2.77 | 1898.20 | 0.841 | 0.032 | Top1: 0.7917(FLOAT)/0.7896(INT8) | ImageNet |
| EfficientNet_Lite4 | 1x300x300x3 | 5.11 | 1300.50 | 1.082 | 0.032 | Top1: 0.8063(FLOAT)/0.8043(INT8) | ImageNet |
| Vargconvnet | 1x3x224x224 | 9.06 | 1464.80 | 0.974 | 0.032 | Top1: 0.7793(FLOAT)/0.7770(INT8) | ImageNet |
| Efficientnasnet_m | 1x3x300x300 | 4.53 | 1477.10 | 0.971 | 0.032 | Top1: 0.7935(FLOAT)/0.7923(INT8) | ImageNet |
| Efficientnasnet_s | 1x3x280x280 | 1.44 | 3327.70 | 0.590 | 0.031 | Top1: 0.7441(FLOAT)/0.7524(INT8) | ImageNet |
| ResNet18 | 1x3x224x224 | 3.63 | 2567.80 | 0.671 | 0.032 | Top1: 0.7170(FLOAT)/0.7162(INT8) | ImageNet |
| YOLOv2_Darknet19 | 1x3x608x608 | 62.94 | 227.64 | 4.712 | 0.293 | [IoU=0.50:0.95]= 0.2760(FLOAT)/0.2700(INT8) | COCO |
| YOLOv3_Darknet53 | 1x3x416x416 | 65.86 | 210.85 | 5.101 | 1.611 | [IoU=0.50:0.95]= 0.3370(FLOAT)/0.3360(INT8) | COCO |
| YOLOv5x_v2.0 | 1x3x672x672 | 243.85 | 62.44 | 16.437 | 5.683 | [IoU=0.50:0.95]= 0.4810(FLOAT)/0.4670(INT8) | COCO |
| SSD_MobileNetv1 | 1x3x300x300 | 2.30 | 3223.40 | 0.687 | 0.184 | mAP: 0.7345(FLOAT)/0.7269(INT8) | VOC |
| Centernet_resnet101 | 1x3x512x512 | 90.53 | 186.75 | 5.714 | 0.928 | [IoU=0.50:0.95]= 0.3420(FLOAT)/0.3270(INT8) | COCO |
| YOLOv3_VargDarknet | 1x3x416x416 | 42.82 | 307.08 | 3.616 | 1.575 | [IoU=0.50:0.95]= 0.3280(FLOAT)/0.3270(INT8) | COCO |
| Deeplabv3plus_efficientnetb0 | 1x3x1024x2048 | 30.77 | 152.01 | 6.968 | 0.254 | mIoU: 0.7630(FLOAT)/0.7571(INT8) | Cityscapes |
| Fastscnn_efficientnetb0 | 1x3x1024x2048 | 12.48 | 253.93 | 4.278 | 0.264 | mIoU: 0.6997(FLOAT)/0.6914(INT8) | Cityscapes |
| Deeplabv3plus_efficientnetm1 | 1x3x1024x2048 | 77.04 | 92.28 | 11.184 | 0.264 | mIoU: 0.7794(FLOAT)/0.7754(INT8) | Cityscapes |
| Deeplabv3plus_efficientnetm2 | 1x3x1024x2048 | 124.15 | 64.83 | 15.818 | 0.255 | mIoU: 0.7882(FLOAT)/0.7854(INT8) | Cityscapes |
| Bev_gkt_mixvargenet_multitask | image: 6x3x512x960 points(0-8): 6x64x64x2 | 207.16 | 68.54 | 15.425 | 4.204 | NDS: 0.2810(FLOAT)/0.2782(INT8) MeanIOU: 0.4852(FLOAT)/0.4836(INT8) mAP: 0.1990(FLOAT)/0.1997(INT8) | Nuscenes |
| Bev_ipm_4d_efficientnetb0_multitask | image: 6x3x512x960 points: 6x128x128x2 prev_feat: 1x164x28x128 prev_point: 1x128x128x2 | 53.58 | 112.15 | 9.898 | 4.163 | NDS: 0.3721(FLOAT)/0.3728(INT8) MeanIOU: 0.5287(FLOAT)/0.5388(INT8) mAP: 0.2200(FLOAT)/0.2216(INT8) | Nuscenes |
| Bev_ipm_efficientnetb0_multitask | image: 6x3x512x960 points: 6x128x128x2 | 52.97 | 115.33 | 9.443 | 4.250 | NDS: 0.3054(FLOAT)/0.3041(INT8) MeanIOU: 0.5145(FLOAT)/0.5104(INT8) mAP: 0.2170(FLOAT)/0.2163(INT8) | Nuscenes |
| Bev_lss_efficientnetb0_multitask | image: 6x3x256x704 points(0&1): 10x128x128x2 | 24.06 | 186.93 | 6.116 | 4.191 | NDS: 0.3007(FLOAT)/0.2991(INT8) MeanIOU: 0.5180(FLOAT)/0.5147(INT8) mAP: 0.2062(FLOAT)/0.2040(INT8) | Nuscenes |
| Detr3d_efficientnetb3 | coords(0-3): 6x4x256x2 image: 6x3x512x1408 masks: 1x4x256x24 | 227.71 | 32.09 | 31.709 | 1.111 | NDS: 0.3304(FLOAT)/0.3285(INT8) mAP: 0.2753(FLOAT)/0.2708(INT8) | Nuscenes |
| Petr_efficientnetb3 | image: 6x3x512x1408 pos_embed: 1x96x44x256 | 219.17 | 18.89 | 53.507 | 1.127 | NDS: 0.3765(FLOAT)/0.3731(INT8) mAP: 0.3038(FLOAT)/0.2925(INT8) | Nuscenes |
| Bevformer_tiny_resnet50_detection | img: 6x3x480x800 prev_bev: 1x2500x256 prev_bev_ref: 1x50x50x2 queries_rebatch_grid: 6x20x32x2 restore_bev_grid: 1x100x50x2 reference_points_rebatch: 6x640x4x2 bev_pillar_counts: 1x2500x1 | 387.29 | 31.29 | 41.686 | 1.200 | NDS: 0.3713(FLOAT)/0.3680(INT8) mAP: 0.2673(FLOAT)/0.2619(INT8) | Nuscenes |
| Flashocc_henet_lss_occ3d_nuscenes | img: 6x3x512x960 points: 10x128x128x2 points_depth: 10x128x128x2 | 126.75 | 96.38 | 11.232 | 40.562 | mIoU: 0.3674(FLOAT)/0.3693(INT8) | Nuscenes |
| Horizon_swin_transformer | 1x3x224x224 | 8.98 | 308.77 | 3.549 | 0.032 | Top1: 0.8024(FLOAT)/0.7959(INT8) | ImageNet |
| Mixvargenet | 1x3x224x224 | 2.07 | 4742.70 | 0.467 | 0.033 | Top1: 0.7075(FLOAT)/0.7049(INT8) | ImageNet |
| Vargnetv2 | 1x3x224x224 | 0.72 | 4423.60 | 0.520 | 0.033 | Top1: 0.7342(FLOAT)/0.7326(INT8) | ImageNet |
| Vit_small | 1x3x224x224 | 9.20 | 568.63 | 2.054 | 0.032 | Top1: 0.7950(FLOAT)/0.7937(INT8) | ImageNet |
| Centerpoint_pointpillar | points: 300000x5 voxel_feature: 1x5x20x40000 coors: 40000x4 | 127.73 | 125.15 | 15.344 | 12.889 | NDS: 0.5832(FLOAT)/0.5816(INT8) mAP: 0.4804(FLOAT)/0.4784(INT8) | Nuscenes |
| Detr_efficientnetb3 | 1x3x800x1333 | 67.39 | 51.95 | 19.619 | 0.351 | [IoU=0.50:0.95]= 0.3720(FLOAT)/0.3600(INT8) | MS COCO |
| Detr_resnet50 | 1x3x800x1333 | 203.07 | 40.20 | 25.334 | 0.345 | [IoU=0.50:0.95]= 0.3569(FLOAT)/0.3160(INT8) | MS COCO |
| FCOS3D_efficientnetb0 | 1x3x512x896 | 19.94 | 448.94 | 2.987 | 2.723 | NDS: 0.3061(FLOAT)/0.3030(INT8) mAP: 0.2133(FLOAT)/0.2069(INT8) | nuscenes |
| Fcos_efficientnetb0 | 1x3x512x512 | 5.02 | 1079.10 | 1.513 | 0.053 | [IoU=0.50:0.95]= 0.3626(FLOAT)/0.3553(INT8) | MS COCO |
| Ganet_mixvargenet | 1x3x320x800 | 10.74 | 1574.50 | 0.950 | 0.206 | F1Score: 0.7949(FLOAT)/0.7883(INT8) | CuLane |
| Keypoint_efficientnetb0 | 1x3x128x128 | 0.45 | 4533.80 | 0.500 | 0.071 | PCK(alpha=0.1): 0.9433(FLOAT)/0.9432(INT8) | Carfusion |
| Pointpillars_kitti_car | 150000x4 | 66.82 | 147.90 | 30.921 | 0.431 | APDet= 0.7731(FLOAT)/0.7678(INT8) | Kitti3d |
| Deformable_detr_resnet50 | 1x3x800x1333 | 408.94 | 5.38 | 186.460 | 15.689 | [IoU=0.50:0.95]= 0.4413(FLOAT)/0.4194(INT8) | MS COCO |
| Stereonetplus_mixvargenet | 2x3x544x960 | 48.57 | 223.90 | 4.863 | 1.874 | EPE: 1.1270(FLOAT)/1.1341(INT8) | SceneFlow |
| Centerpoint_mixvargnet_multitask | points: 300000x5 voxel_feature: 1x5x20x40000 coors: 40000x4 | 51.45 | 186.72 | 13.003 | 11.736 | NDS: 0.5809(FLOAT)/0.5753(INT8) MeanIOU: 0.9128(FLOAT)/0.9121(INT8) mAP: 0.4727(FLOAT)/0.4626(INT8) | Nuscenes |
| Unet_mobilenetv1 | 1x3x1024x2048 | 7.36 | 811.04 | 1.676 | 0.116 | mIoU: 0.6802(FLOAT)/0.6758(INT8) | Cityscapes |
| Motr_efficientnetb3 | image: 1x800x1422x3 track_query: 1x2x128x156 ref_points: 1x2x128x4 mask_query: 1x1x256x1 | 64.43 | 73.73 | 13.737 | 4.958 | MOTA: 0.5798(FLOAT)/0.5762(INT8) | Mot17 |
| Densetnt_vectornet | goals_2d: 30x1x2048x2 goals_2d_mask: 30x1x2048x1 instance_mask: 30x1x96x1 lane_feat: 30x9x64x11 traj_feat: 30x19x32x9 | 12.50 | 149.20 | 7.543 | 2.248 | minFDA: 1.2975(FLOAT)/1.3023(INT8) | Argoverse 1 |
| Maptroe_henet_tinym_bevformer | img: 6x3x480x800 osm_mask: 1x1x50x100 queries_rebatch_grid: 6x20x100x2 restore_bev_grid: 1x100x100x2 reference_points_rebatch: 6x2000x4x2 bev_pillar_counts: 1x5000x1 | 134.57 | 75.21 | 13.826 | 0.258 | mAP: 0.6633(FLOAT)/0.6565(INT8) | Nuscenes |
| Qcnet_oe | valid_mask: 1x30x10 valid_mask_a2a: 1x10x30x30 agent_type: 1x30x1 x_a_cur: 1x1x30x1,1x1x30x1,1x1x30x1,1x1x30x1 r_pl2a_cur: 1x1x30x80,1x1x30x80,1x1x30x80 r_t_cur: 1x1x30x6,1x1x30x6,1x1x30x6,1x1x30x6 r_a2a_cur: 1x1x30x30,1x1x30x30,1x1x30x30 x_a_mid_emb: 1x30x2x128 x_a: 1x30x6x128 pl_type,is_intersection: 1x80 r_pl2pl: 1x1x80x80,1x1x80x80,1x1x80x80 r_pt2pl: 1x1x80x50,1x1x80x50,1x1x80x50 mask_pl2pl: 1x80x80 magnitude,pt_type,side,mask: 1x80x50 mask_a2m: 1x30x30 mask_dst: 1x30x1 type_pl2pl: 1x80x80 | 7.85 | 234.88 | 5.384 | 0.897 | hitrate: 0.8026(FLOAT)/0.7953(INT8) | Argoverse 2 |