Vision Analysis
Back to Leaderboard

RT-DETRv4-M

rtdetrv4

transformer detector with HGNetv2 backbone

Parameters19.0M
GFLOPs57.0
Input Size640px
Best mAP53.6%
LicenseApache-2.0

Architecture

Type

transformer

Backbone

HGNetv2

Neck

HybridEncoder

Head

DETR

Benchmark Results

Performance on COCO val2017 across different hardware configurations

HardwareRuntimemAP@50-95FPSLatencyVRAM
NVIDIA RTX 5070 TiPyTorch FP3253.6%35.128.5ms188 MB

Speed Breakdown(NVIDIA RTX 5070 Ti)

5.1ms
22.5ms
0.9ms
Preprocess
Inference
Postprocess (NMS)

Usage with LibreYOLO

from libreyolo import LIBREYOLO

# Load model (auto-downloads from HuggingFace if not found locally)
model = LIBREYOLO("librertdetrv4m.pth")

# Run inference
result = model("image.jpg", conf=0.25, iou=0.45)

# Process results
print(f"Found {len(result)} objects")
print(result.boxes.xyxy)   # bounding boxes (N, 4)
print(result.boxes.conf)   # confidence scores (N,)
print(result.boxes.cls)    # class IDs (N,)
detrnms-freePaper: 53.5% mAP

Related Models (rtdetrv4)