Skip to content

Commit 25d50a2

Browse files
authored
Merge pull request #374 from flox/troubleshooting
feat: add installation troubleshooting steps
2 parents cba60d5 + 2174f11 commit 25d50a2

File tree

3 files changed

+54
-2
lines changed

3 files changed

+54
-2
lines changed

docs/k8s/install/self-managed.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -84,7 +84,7 @@ Check the `version` line at the beginning of the file and use the matching confi
8484
runtime_type = "io.containerd.runc.v2"
8585
pod_annotations = [ "flox.dev/*" ]
8686
container_annotations = [ "flox.dev/*" ]
87-
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.flox.options]
87+
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.flox.options]
8888
SystemdCgroup = true
8989
```
9090

@@ -94,7 +94,7 @@ Check the `version` line at the beginning of the file and use the matching confi
9494
runtime_type = "io.containerd.runc.v2"
9595
pod_annotations = [ "flox.dev/*" ]
9696
container_annotations = [ "flox.dev/*" ]
97-
[plugins."io.containerd.cri.v1.runtime".containerd.runtimes.flox.options]
97+
[plugins."io.containerd.cri.v1.runtime".containerd.runtimes.flox.options]
9898
SystemdCgroup = true
9999
```
100100

Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
---
2+
title: "Troubleshooting"
3+
description: "Troubleshooting Imageless Kubernetes installation"
4+
---
5+
6+
This guide describes possible issues and solutions that may arise during the installation of Imageless Kubernetes.
7+
8+
## Pods stuck in `ContainerCreating`
9+
10+
If your pods are stuck in the `ContainerCreating` state with a message like `no runtime for "flox" is configured`, the shim installation may have been disrupted or failed.
11+
12+
### Configuration conflicts
13+
14+
The Flox additions to `/etc/containerd/config.toml` may be getting overridden by competing entries in an imported configuration file.
15+
16+
Verify the Flox runtime configuration is present in the active containerd config:
17+
18+
1. Dump the active containerd configuration and verify the Flox runtime is present:
19+
20+
```bash
21+
containerd config dump | grep -A 10 "flox"
22+
```
23+
24+
2. Check if `containerd config dump` has an `imports` section that might be loading other configuration files.
25+
26+
3. Confirm the relevant sections exist in the output:
27+
28+
```toml
29+
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.flox]
30+
runtime_path = "/usr/local/bin/containerd-shim-flox-v2"
31+
runtime_type = "io.containerd.runc.v2"
32+
pod_annotations = [ "flox.dev/*" ]
33+
container_annotations = [ "flox.dev/*" ]
34+
[plugins."io.containerd.grpc.v1.cri".containerd.runtimes.flox.options]
35+
SystemdCgroup = true
36+
```
37+
38+
4. If the configuration is missing or incorrect, and an `imports` section is present, you may need to manually add the Flox runtime section to the competing imported file and restart `containerd`.
39+
40+
!!! note "Note"
41+
This scenario is more likely if the NVIDIA Container Toolkit is installed on the same node as Imageless Kubernetes.
42+
43+
### EKS node shim installation failure
44+
45+
The Flox `containerd` shim may have failed to install properly on the EC2 instance during setup.
46+
47+
Check the system logs in the EC2 console to identify any errors:
48+
49+
1. Navigate to the EC2 console and select your instance.
50+
2. Click **Actions****Monitor and troubleshoot****Get system log**.
51+
3. Review the logs for any errors related to containerd or the Flox shim installation.

mkdocs.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,7 @@ nav:
5757
- Installation:
5858
- Amazon EKS: k8s/install/eks.md
5959
- Self-managed: k8s/install/self-managed.md
60+
- Troubleshooting: k8s/install/troubleshooting.md
6061
- Upgrading: k8s/install/upgrading.md
6162
- Uninstall: k8s/install/uninstall.md
6263
- Examples:

0 commit comments

Comments
 (0)