Verdict

At matched compute on an RTX 5070 Ti, D-FINE is the more accurate family by a wide margin: it averages 6.4 mAP points higher and wins all 3 nearest-parameter pairs. RT-DETR answers on speed, running 33.8% faster on average in PyTorch. The choice is accuracy against throughput.

D-FINE fields 5 measured variants; RT-DETR fields 7. Both run the same COCO val2017 protocol at 640 px on an RTX 5070 Ti, so the two ladders align by parameter count.

ModelFamilyParams (M)mAP@50-95FPS
D-FINE-ND-FINE3.84575.032.5
D-FINE-SD-FINE10.35339.034.2
D-FINE-MD-FINE19.65782.028.8
RT-DETR-R18RT-DETR20.24979.047.2
D-FINE-LD-FINE31.25996.021.9
RT-DETR-R34RT-DETR31.45223.040.2
RT-DETR-LRT-DETR32.95577.027.7
RT-DETR-R50mRT-DETR36.65382.035.3
RT-DETR-R50RT-DETR42.95588.029.8
D-FINE-XD-FINE62.66143.018.1
RT-DETR-XRT-DETR67.45794.021.8
RT-DETR-R101RT-DETR76.65677.023.6
D-FINE and RT-DETR variant ladders interleaved by parameter count on NVIDIA RTX 5070 Ti, PyTorch FP32, batch 1. mAP in percent form. The family column shows where each frontier sits at every size.
Live chartverified data
Accuracy vs parameters on COCO val2017. Every D-FINE and RT-DETR variant highlighted against the full set.

Accuracy at matched compute

mAP is shown in percent. Pairing by nearest parameter count, D-FINE wins all 3 pairs and averages 6.4 mAP points higher. At the top end D-FINE-X reaches 61.4 mAP against RT-DETR-R101 at 56.8. This is the largest accuracy gap in this set of family comparisons.

Speed

RT-DETR is the faster family: 33.8% faster on average in PyTorch FP32. Its smallest measured variant, RT-DETR-R18, runs 47.2 FPS against D-FINE-N at 32.5.

Licensing
D-FINE
Apache-2.0 (permissive)
RT-DETR
Apache-2.0 (permissive)
Commercial use
Both families ship under Apache-2.0

Which family to pick

Pick D-FINE when accuracy leads: it is more accurate at every matched pair and reaches 61.4 mAP at the top. Pick RT-DETR when throughput is the constraint, since it runs faster at matched compute and fields more variants, 7 to 5. Both ship under Apache-2.0. See the per-variant specs on the D-FINE and RT-DETR model pages.

Every number on this page comes from the verified dataset: same 500-image COCO val2017 slice, conf 0.001, IoU 0.6, max 300 detections, pycocotools mAP, identical protocol across all hardware and runtimes. The full protocol is on the methodology page. To rerun this comparison with your own filters, open compare. Accuracy is measured on LibreYOLO retrained checkpoints; other weight sources can yield different values.