D-FINE-N

dfine

transformer detector with HGNetv2 backbone

Paper Code

Parameters4.0M

GFLOPs7.0

Input Size640px

Best mAP45.8%

LicenseApache-2.0

Architecture

Type

transformer

Backbone

HGNetv2

Neck

HybridEncoder

Head

FDR (Fine-grained Distribution Refinement)

Benchmark Results

Performance on COCO val2017 across different hardware configurations

Hardware	Runtime	mAP@50-95	FPS	Latency	VRAM
NVIDIA A100	PyTorch FP32	42.8%	25.6	39.1ms	68 MB
NVIDIA Jetson Orin Nano Super 8GB	ONNX Runtime FP32	45.8%	17.5	57.0ms	-
NVIDIA Jetson Orin Nano Super 8GB	PyTorch FP32	45.8%	11.7	85.5ms	64 MB
NVIDIA Jetson Orin Nano Super 8GB	TensorRT FP16	45.8%	42.1	23.8ms	-
NVIDIA Jetson Orin Nano Super 8GB	TensorRT FP32	45.8%	33.5	29.9ms	-
NVIDIA RTX 5070 Ti	ONNX Runtime FP32	45.8%	78.9	12.7ms	-
NVIDIA RTX 5070 Ti	PyTorch FP32	45.8%	32.5	30.7ms	70 MB
NVIDIA RTX 5070 Ti	TensorRT FP16	45.8%	79.7	12.5ms	-
NVIDIA RTX 5070 Ti	TensorRT FP32	45.8%	98.7	10.1ms	-

Speed Breakdown(NVIDIA A100)

6.1ms

21.3ms

11.6ms

Preprocess

Inference

Postprocess (NMS)

Usage with LibreYOLO

from libreyolo import LibreYOLO

# Load model (auto-downloads from HuggingFace if not found locally)
model = LibreYOLO("LibreDFINEn.pt")

# Run inference
result = model("image.jpg", conf=0.25, iou=0.45)

# Process results
print(f"Found {len(result)} objects")
print(result.boxes.xyxy)   # bounding boxes (N, 4)
print(result.boxes.conf)   # confidence scores (N,)
print(result.boxes.cls)    # class IDs (N,)

detrnms-freePaper: 42.8% mAP

Related Models (dfine)

D-FINE-S D-FINE-M D-FINE-L D-FINE-X