ECDet-S

transformer detector with ECViT (compact ViT) backbone

Paper Code

Parameters9.9M

GFLOPs26.0

Input Size640px

Best mAP54.4%

LicenseApache-2.0

Architecture

Type

transformer

Backbone

ECViT (compact ViT)

Neck

HybridEncoder

Head

DETR

Benchmark Results

Performance on COCO val2017 across different hardware configurations

Hardware	Runtime	mAP@50-95	FPS	Latency	VRAM
NVIDIA Jetson Orin Nano Super 8GB	ONNX Runtime FP32	54.3%	5.2	192.3ms	-
NVIDIA Jetson Orin Nano Super 8GB	PyTorch FP32	54.3%	6.0	168.2ms	102 MB
NVIDIA Jetson Orin Nano Super 8GB	TensorRT FP16	54.3%	10.6	94.8ms	-
NVIDIA Jetson Orin Nano Super 8GB	TensorRT FP32	54.4%	10.6	94.8ms	-
NVIDIA RTX 5070 Ti	ONNX Runtime FP32	54.3%	42.8	23.4ms	-
NVIDIA RTX 5070 Ti	PyTorch FP32	54.4%	27.5	36.4ms	103 MB
NVIDIA RTX 5070 Ti	TensorRT FP16	54.4%	43.5	23.0ms	-
NVIDIA RTX 5070 Ti	TensorRT FP32	54.4%	63.4	15.8ms	-

Speed Breakdown(NVIDIA Jetson Orin Nano Super 8GB)

16.3ms

148.9ms

3.0ms

Preprocess

Inference

Postprocess (NMS)

Usage with LibreYOLO

from libreyolo import LibreYOLO

# Load model (auto-downloads from HuggingFace if not found locally)
model = LibreYOLO("LibreECs.pt")

# Run inference
result = model("image.jpg", conf=0.25, iou=0.45)

# Process results
print(f"Found {len(result)} objects")
print(result.boxes.xyxy)   # bounding boxes (N, 4)
print(result.boxes.conf)   # confidence scores (N,)
print(result.boxes.cls)    # class IDs (N,)

detrnms-freePaper: 51.7% mAP

Related Models (ec)

ECDet-M ECDet-L ECDet-X EC-Seg-S EC-Seg-M EC-Seg-L EC-Seg-X