Differences of v0.3 docker images and connection refused from API call #1255
                  
                    
                      lililolo0927
                    
                  
                
                  started this conversation in
                General
              
            Replies: 0 comments
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
In docker hub for KTransformers, I've found several kinds of tags for the new version(v0.3) images such as v0.3-AVX2, lastest-AVX2, v0.3-NATIVE, latest-NATIVE, v0.3-FANCY, lastest-FANCY, v0.3-AVX512, latest-AVX512 .
What exactly is the difference between these tags?
I'm not sure which image I should download.
Currently, I downloaded
approachingai/ktransformers:v0.3-AVX2and tried running the Qwen3-30B-A3B model.There are no errors, but I can't access through the API.
When I run
curl -X 'GET' 'http://localhost:10002/api/tags', I get a "connection refused" response.I've confirmed that all the necessary ports inside the container are open, but API access still doesn't work.
The command I used is as follows:
In addition , I tried the option
--optimize_config_path ktransformers/optimize/optimize_rules/Qwen3Moe-serve-amx.yamlbut it returned me an error even though my machine support AMX.I would appreciate your opinions on this.
Thanks in advance.
Beta Was this translation helpful? Give feedback.
All reactions