Skip to content

One-step-off-policy does not support IPv6 in distributed training #4771

@lzxdjb

Description

@lzxdjb

System Info

I am using the one-step-off-policy method for multi-machine training, but I encounter the following error during execution:

Image

After investigating, I traced the error to the following code path:

File: ./recipe/one_step_off_policy/distributed_util.py

Line: 61
Image

At this line, the code calls a utility function from vLLM:
File: ./vllm/distributed/utils.py

Image

However, the implementation in vllm/distributed/utils only supports IPv4 addresses. When the training environment uses IPv6, this results in a failure during distributed initialization.

Therefore, I guess the one-step-off-policy distributed training pipeline currently does not support IPv6?

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions