On the NVIDIA RTX 5070 Ti with TensorRT FP16, D-FINE-X leads accuracy at 61.5 mAP@50-95 and runs 73.0 FPS. The fastest measured model is RT-DETRv2-R34 at 132.9 FPS (53.2 mAP). Every model in this table clears real time on this card, so accuracy is the deciding axis, not speed. mAP is shown here in percent form.
Every row below is TensorRT FP16 on the RTX 5070 Ti. On a desktop GPU this fast, the speed table stops being the constraint: even the largest detectors run well above 30 FPS. That makes the accuracy table the one that matters, with the speed table only ruling out the few models you cannot afford at your target frame rate.
| # | Model | mAP@50-95 | FPS | ms/image | Params (M) | License |
|---|---|---|---|---|---|---|
| 1 | D-FINE-X | 6148.0 | 73.0 | 13.70 | 62.0 | Apache-2.0 |
| 2 | ECDet-X | 6103.0 | 41.3 | 24.22 | 49.0 | Apache-2.0 |
| 3 | DEIMv2-X | 6077.0 | 37.9 | 26.40 | 50.3 | Apache-2.0 |
| 4 | D-FINE-L | 6002.0 | 69.3 | 14.42 | 31.0 | Apache-2.0 |
| 5 | ECDet-L | 6000.0 | 41.7 | 24.00 | 31.0 | Apache-2.0 |
| 6 | RT-DETRv4-X | 5995.0 | 71.0 | 14.07 | 62.0 | Apache-2.0 |
| 7 | DEIM-X | 5950.0 | 70.7 | 14.15 | 62.0 | Apache-2.0 |
| 8 | RF-DETR-L | 5861.0 | 47.6 | 21.01 | 128.0 | Apache-2.0 |
| 9 | DEIMv2-L | 5851.0 | 38.7 | 25.86 | 32.2 | Apache-2.0 |
| 10 | ECDet-M | 5823.0 | 48.9 | 20.46 | 18.0 | Apache-2.0 |
| 11 | RT-DETR-X | 5785.0 | 82.3 | 12.15 | 67.0 | Apache-2.0 |
| 12 | RT-DETRv4-L | 5774.0 | 78.0 | 12.83 | 31.0 | Apache-2.0 |
| 13 | D-FINE-M | 5759.0 | 74.5 | 13.42 | 19.0 | Apache-2.0 |
| 14 | RF-DETR-M | 5737.0 | 62.9 | 15.91 | 33.7 | Apache-2.0 |
| 15 | YOLOv9-C | 5720.0 | 74.7 | 13.39 | 25.5 | MIT |
The accuracy-speed frontier
The measured frontier is short: RT-DETRv2-R34, RT-DETR-L, RT-DETR-R101, RT-DETRv2-R101, RT-DETR-X, D-FINE-X. It spans 53.2 to 61.5 mAP. The small detectors do not earn a spot on this GPU: they save little wall-clock time over these models, so a larger model matches their frame rate at higher accuracy.
| # | Model | mAP@50-95 | FPS | ms/image | Params (M) | License |
|---|---|---|---|---|---|---|
| 1 | RT-DETRv2-R34 | 5319.0 | 132.9 | 7.52 | 31.0 | Apache-2.0 |
| 2 | DEIMv2-Atto | 2579.0 | 132.1 | 7.57 | 0.5 | Apache-2.0 |
| 3 | RT-DETR-L | 5579.0 | 122.0 | 8.20 | 32.0 | Apache-2.0 |
| 4 | RT-DETRv2-R18 | 5076.0 | 117.8 | 8.49 | 20.0 | Apache-2.0 |
| 5 | RT-DETR-R18 | 4975.0 | 113.4 | 8.82 | 20.0 | Apache-2.0 |
| 6 | RT-DETRv2-R50m | 5474.0 | 113.0 | 8.85 | 36.0 | Apache-2.0 |
| 7 | RT-DETR-R50m | 5385.0 | 110.1 | 9.08 | 36.6 | Apache-2.0 |
| 8 | RT-DETR-R34 | 5226.0 | 108.4 | 9.22 | 31.0 | Apache-2.0 |
| 9 | RT-DETRv2-R50 | 5567.0 | 103.6 | 9.65 | 42.0 | Apache-2.0 |
| 10 | YOLOX-Nano | 2873.0 | 100.0 | 9.99 | 0.9 | Apache-2.0 |
Picks by latency budget
Under 10 ms per image, RT-DETR-L is the most accurate fit at 55.8 mAP (8.2 ms). Give it a 33 ms budget and D-FINE-X reaches the top of the table at 61.5 mAP (13.7 ms); a 100 ms budget does not change that pick. On this card even the accuracy leader fits inside a single 33 ms frame.
Every number on this page comes from the verified dataset: same 500-image COCO val2017 slice, conf 0.001, IoU 0.6, max 300 detections, pycocotools mAP, identical protocol across all hardware and runtimes. The full protocol is on the methodology page. To rerun this comparison with your own filters, open compare. Accuracy is measured on LibreYOLO retrained checkpoints; other weight sources can yield different values.
