On the NVIDIA Jetson Orin Nano Super with TensorRT FP16, D-FINE-X leads accuracy at 61.4 mAP@50-95 and still clears real time at 14.2 FPS. If you need higher frame rates, DEIMv2-Atto reaches 89.1 FPS at 27.5 mAP. Unlike a bare CPU board, the Orin runs the accuracy leaders fast enough to use, so the choice is about how much frame rate you trade for mAP. mAP is shown here in percent form.
Every row below is TensorRT FP16 on the Orin Nano Super. FP16 keeps the largest detectors in real-time range, so the accuracy table and the speed table overlap far more than they do on a CPU target. Start from the accuracy ceiling, then check the throughput table to confirm the model clears your frame budget.
| # | Model | mAP@50-95 | FPS | ms/image | Params (M) | License |
|---|---|---|---|---|---|---|
| 1 | D-FINE-X | 6144.0 | 14.2 | 70.45 | 62.0 | Apache-2.0 |
| 2 | ECDet-X | 6098.0 | 5.7 | 174.81 | 49.0 | Apache-2.0 |
| 3 | DEIMv2-X | 6075.0 | 5.4 | 184.54 | 50.3 | Apache-2.0 |
| 4 | ECDet-L | 5999.0 | 6.2 | 161.52 | 31.0 | Apache-2.0 |
| 5 | D-FINE-L | 5998.0 | 18.6 | 53.64 | 31.0 | Apache-2.0 |
| 6 | RT-DETRv4-X | 5996.0 | 13.8 | 72.69 | 62.0 | Apache-2.0 |
| 7 | DEIM-X | 5960.0 | 14.1 | 70.93 | 62.0 | Apache-2.0 |
| 8 | RF-DETR-L | 5868.0 | 8.4 | 119.33 | 128.0 | Apache-2.0 |
| 9 | DEIMv2-L | 5852.0 | 6.2 | 160.81 | 32.2 | Apache-2.0 |
| 10 | ECDet-M | 5825.0 | 8.3 | 119.70 | 18.0 | Apache-2.0 |
| 11 | D-FINE-M | 5792.0 | 23.7 | 42.26 | 19.0 | Apache-2.0 |
| 12 | DEIM-L | 5786.0 | 18.5 | 53.96 | 31.0 | Apache-2.0 |
| 13 | RT-DETR-X | 5785.0 | 17.0 | 58.90 | 67.0 | Apache-2.0 |
| 14 | RT-DETRv4-L | 5782.0 | 18.1 | 55.15 | 31.0 | Apache-2.0 |
| 15 | RF-DETR-M | 5740.0 | 12.5 | 79.75 | 33.7 | Apache-2.0 |
The accuracy-speed frontier
The measured frontier runs from DEIMv2-Atto at 27.5 mAP up to D-FINE-X at 61.4 mAP. The D-FINE family holds most of the high end: DEIMv2-Atto, DEIMv2-Femto, YOLOX-Tiny, DEIMv2-Pico, D-FINE-N, DEIM-N, RT-DETRv2-R18, RT-DETRv2-R34, D-FINE-S, RT-DETRv2-R50m, YOLOv9-M, YOLOv9-C, D-FINE-M, D-FINE-L, D-FINE-X. Anything off this list is beaten on both accuracy and speed by a model on it.
| # | Model | mAP@50-95 | FPS | ms/image | Params (M) | License |
|---|---|---|---|---|---|---|
| 1 | DEIMv2-Atto | 2749.0 | 89.1 | 11.23 | 0.5 | Apache-2.0 |
| 2 | DEIMv2-Femto | 3451.0 | 65.7 | 15.23 | 1.0 | Apache-2.0 |
| 3 | YOLOX-Tiny | 3545.0 | 51.6 | 19.39 | 5.1 | Apache-2.0 |
| 4 | YOLOX-Nano | 2877.0 | 51.0 | 19.62 | 0.9 | Apache-2.0 |
| 5 | DEIMv2-Pico | 4227.0 | 44.5 | 22.50 | 1.5 | Apache-2.0 |
| 6 | D-FINE-N | 4579.0 | 42.1 | 23.77 | 4.0 | Apache-2.0 |
| 7 | DEIM-N | 4677.0 | 41.8 | 23.91 | 4.0 | Apache-2.0 |
| 8 | DEIMv2-N | 4664.0 | 41.2 | 24.25 | 3.6 | Apache-2.0 |
| 9 | RT-DETRv2-R18 | 5077.0 | 39.9 | 25.06 | 20.0 | Apache-2.0 |
| 10 | PicoDet-S | 3042.0 | 39.6 | 25.26 | 1.0 | Apache-2.0 |
Picks by latency budget
For 30 FPS work with a 33 ms budget, RT-DETRv2-R34 is the most accurate model that fits, at 53.2 mAP (30.59 ms). If you can spend up to 100 ms per image, D-FINE-X is reachable at its full 61.4 mAP (70.45 ms), which buys nearly 8 more mAP points over the 33 ms pick. Between the two, pick by whether your pipeline is frame-rate bound or accuracy bound.
Every number on this page comes from the verified dataset: same 500-image COCO val2017 slice, conf 0.001, IoU 0.6, max 300 detections, pycocotools mAP, identical protocol across all hardware and runtimes. The full protocol is on the methodology page. To rerun this comparison with your own filters, open compare. Accuracy is measured on LibreYOLO retrained checkpoints; other weight sources can yield different values.
