Skip to content

support YOLO-World [Ultralytics version]#682

Open
ZhengQuancheng wants to merge 1 commit intomarcoslucianops:masterfrom
ZhengQuancheng:master
Open

support YOLO-World [Ultralytics version]#682
ZhengQuancheng wants to merge 1 commit intomarcoslucianops:masterfrom
ZhengQuancheng:master

Conversation

@ZhengQuancheng
Copy link

@ZhengQuancheng ZhengQuancheng commented Dec 29, 2025

Description

This PR adds support for YOLO-World models to DeepStream.

YOLO-World is an open-vocabulary detection model that can detect objects based on text descriptions. Unlike traditional YOLO models, YOLO-World uses CLIP text embeddings to enable flexible class definition without retraining.

Features

  • Support for all YOLO-World model variants (yolov8s/m/l/x-worldv2)
  • Custom class support via --custom-classes parameter (e.g., --custom-classes "person, car, dog")
  • Default COCO 80 classes when no custom classes specified
  • CLIP text embeddings are pre-computed and baked into the ONNX model for efficient inference

Files Added

  • utils/export_yoloworld.py - Export script for YOLO-World models
  • docs/YOLOWorld.md - Documentation for YOLO-World usage
  • config_infer_primary_yoloworld.txt - DeepStream inference configuration
  • README.md - Update content related to YOLO-World

Usage

# Export with default COCO 80 classes
python3 export_yoloworld.py -w yolov8s-worldv2.pt --dynamic --simplify
# Export with custom classes
python3 export_yoloworld.py -w yolov8s-worldv2.pt --custom-classes "person, car, dog" --dynamic --simplify

Testing

  • Tested on DeepStream 8.0 with CUDA 12.8
  • Successfully exported ONNX model and generated TensorRT engine
  • Verified inference pipeline with sample video
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant