Conversation
|
In my mind it is very unlikely an update in a runner image causes the CI to break. But here is the diff between the image releases, link You can check it to see whether you see something interesting. |
I tried looking for something interesting there but couldn't find anything. I'm suspicious of runner image for two reasons:
|
|
Can we print the filesystem tree or similar just before building? I am just looking for a way to debug this issue. |
|
@MatthijsBurgh Github actions seem to be down for now |
|
Yes Indeed 😢 |
|
@MatthijsBurgh I have no more ideas for things I can check for now. |
|
honestly? I had CI fail on one of my repos and deleting all the github actions caches and rerunning solved it for me. Try that? EDIT: oh, there are no caches here... |
There is only cache from pre-commit job. It shouldn't affect anything but I cleared it anyway and it did not change anything. |
|
@MatthijsBurgh Any idea how to debug this further? |
|
Preferably we can print the state midway through the ros-ci-action. Are there any hooks we can define? |
We could just try running the commands manually, instead of using the action |
|
We can try checking the colcon logs. See here how to store the logs as artifacts |
|
@bjsowa when running on the runner the logs show the following PYTHONPATH The backslash and the semi-colon show that some part of the tooling, either colcon/ament or the action-ros-ci, thinks we are on Windows. I think this is something we should investigate. |
|
@bjsowa I don't know what changed in december causing the issues. But the issue is But by overriding it in the CI config, we make it work for now. Might be fixed by colcon/colcon-powershell#42 |
|
Thank you @MatthijsBurgh for finding the culprit! Should we remove the other changes or leave it as it is? |
|
I kept these changes as I think both changes (bash as default shell; upload log artifacts) are useful. So you can merge if you agree. |
Co-authored-by: Matthijs van der Burgh <MatthijsBurgh@outlook.com>
Co-authored-by: Matthijs van der Burgh <MatthijsBurgh@outlook.com>
chore: disable colcon-powershell (RobotWebTools#1143)
This is a hotfix for the CI failing due to
ModuleNotFoundError: No module named 'ament_package'error (see here for example). The workflow has been failing for about a month now and I suspect it is due to some recent change to the Github runner images. I couldn't find any other solution than running the job in a container.The other test is failing due to recent changes in rclpy that changed some executor behavior (which I authored 😅). It will be fixed in another PR later.