Skip to content

Error executing job with overrides, please help #56

@wondrxin

Description

@wondrxin

And remember to star the repo on GitHub! --> https://github.com/gensyn-ai/rl-swarm
wandb: Tracking run with wandb version 0.21.1
wandb: W&B syncing is set to offline in this directory. Run wandb online or set WANDB_MODE=online to enable cloud syncing.
wandb: Run data is saved locally in ./logs/wandb/offline-run-20250827_150447-dvq6tlnf
[2025-08-27 15:04:47,672][genrl.logging_utils.global_defs][INFO] - Reasoning Gym Data Manager initialized with config: rgym_exp/src/datasets.yaml
[2025-08-27 15:04:47,672][genrl.logging_utils.global_defs][INFO] - Loaded composite dataset with 1000 samples
[2025-08-27 15:04:47,672][genrl.logging_utils.global_defs][INFO] - Train samples: 2, Eval samples: 0
[2025-08-27 15:04:47,672][genrl.logging_utils.global_defs][INFO] - Dataset weights: arc_1d: 1, basic_arithmetic: 1, base_conversion: 1, bf: 1, binary_matrix: 1, calendar_arithmetic: 1, decimal_arithmetic: 1, fraction_simplification: 1, propositional_logic: 1
Aug 27 15:04:47.769 [INFO] Checking that identity from /Users/wangxing/Desktop/project/rl-swarm/swarm.pem is not used by other peers
Error executing job with overrides: []
Traceback (most recent call last):
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 92, in _call_target
return target(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/genrl/communication/hivemind/hivemind_backend.py", line 84, in init
self.dht = DHT(
^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hivemind/dht/dht.py", line 87, in init
self.run_in_background(await_ready=await_ready)
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hivemind/dht/dht.py", line 148, in run_in_background
self.wait_until_ready(timeout)
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hivemind/dht/dht.py", line 151, in wait_until_ready
self._ready.result(timeout=timeout)
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hivemind/utils/mpfuture.py", line 254, in result
return super().result(timeout)
^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/python@3.11/3.11.13/Frameworks/Python.framework/Versions/3.11/lib/python3.11/concurrent/futures/_base.py", line 456, in result
return self.__get_result()
^^^^^^^^^^^^^^^^^^^
File "/opt/homebrew/Cellar/python@3.11/3.11.13/Frameworks/Python.framework/Versions/3.11/lib/python3.11/concurrent/futures/_base.py", line 401, in __get_result
raise self._exception
hivemind.p2p.p2p_daemon_bindings.utils.P2PDaemonError: Daemon failed to start: 2025/08/27 15:05:02 failed to connect to bootstrap peers

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "", line 198, in _run_module_as_main
File "", line 88, in _run_code
File "/Users/wangxing/Desktop/project/rl-swarm/rgym_exp/runner/swarm_launcher.py", line 29, in
main()
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/main.py", line 94, in decorated_main
_run_hydra(
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/utils.py", line 394, in _run_hydra
_run_app(
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/utils.py", line 457, in _run_app
run_and_report(
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/utils.py", line 223, in run_and_report
raise ex
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/utils.py", line 220, in run_and_report
return func()
^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/utils.py", line 458, in
lambda: hydra.run(
^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/hydra.py", line 132, in run
_ = ret.return_value
^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/core/utils.py", line 260, in return_value
raise self._return_value
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/core/utils.py", line 186, in run_job
ret.return_value = task_function(task_cfg)
^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/rgym_exp/runner/swarm_launcher.py", line 22, in main
game_manager = instantiate(cfg.game_manager)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 226, in instantiate
return instantiate_node(
^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 342, in instantiate_node
value = instantiate_node(
^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 347, in instantiate_node
return _call_target(target, partial, args, kwargs, full_key)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/wangxing/Desktop/project/rl-swarm/.venv/lib/python3.11/site-packages/hydra/_internal/instantiate/_instantiate2.py", line 97, in _call_target
raise InstantiationException(msg) from e
hydra.errors.InstantiationException: Error in call to target 'genrl.communication.hivemind.hivemind_backend.HivemindBackend':
P2PDaemonError('Daemon failed to start: 2025/08/27 15:05:02 failed to connect to bootstrap peers')
full_key: game_manager.communication
wandb:
wandb: You can sync this run to the cloud by running:
wandb: wandb sync ./logs/wandb/offline-run-20250827_150447-dvq6tlnf

An error was detected while running rl-swarm. See /Users/wangxing/Desktop/project/rl-swarm/logs for full logs.
Shutting down trainer...
zsh: terminated ./run_rl_swarm.sh

swarm_launcher.log

yarn.log

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions