Replies: 7 comments 2 replies
-
got passed lscpu flag by installing py_cpuinfo then rain into triton flag so edited the name in all files I saw from 2.0.0 to 1.0.0 and now I'm stuck on building the async IO bc there's no libaio-dev version for windows that I can find. it's only on apt for now |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
was able to capture the code for async.io.py with all the things it's looking for while trying to install: Copyright 2020 The Microsoft DeepSpeed Team from .builder import OpBuilder class AsyncIOBuilder(OpBuilder):
|
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
forceWrite(self.choices[index], end) when trying to run Accelerate Config after enabling deepspeed option |
Beta Was this translation helpful? Give feedback.
-
Used WSL to generate a Stage 2 optimized cpu offload Accelerate config and copied it into my huggingface cache folder in windows where it's normally stored to get around it and was able to start training with deepspeed including sparse attention support both enabled. Still no NVMe offload bc I have no way to build the libaio-dev version it uses for it for windows but my RAM usage spiked the second it started training. |
Beta Was this translation helpful? Give feedback.
-
I still oomed bc I only have an rtx 3060 ti with 8gb vram and only 16gb total system RAM but it's a start... |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
To preface this I'm on Cuda 11.7, python 3.10.8, most recent dev build of xformers, accelerate 0.15.0, and I've gotten both to recognize that I have triton 2.0.0 installed thanks to the recent thread about somebody finally cracking it but I do not know how to get past the next hurdle because it's a command buried deep in deepspeed "lscpu" which is what linux uses to poll cpu information (cores, threads, etc) that does not have an equivalent in Windows... Need help from a wrinklier brain than mine to get this figure this out :) we're so close!
Beta Was this translation helpful? Give feedback.
All reactions