feat: support LongCat-Image-Edit on cuda device. by Dragonliu2018 · Pull Request #957 · jd-opensource/xllm

Dragonliu2018 · 2026-02-27T17:00:25Z

This PR supports LongCat-Image-Edit on cuda device.

The test program and the generated image are as follows.

OS: Ubuntu22.04
device: NVIDIA A800-SXM4-40GB
model: LongCat-Image-Edit

import requests
import json
import base64
from PIL import Image
from io import BytesIO

# Test prompt for image generation
url = "http://localhost:9977/v1/image/generation"

img = Image.open("cat.png").convert("RGB")
buf = BytesIO()
img.save(buf, format="PNG")        # 和服务端 OpenCV 解码兼容
img_bytes = buf.getvalue()
image_base64 = base64.b64encode(img_bytes).decode("utf-8")

prompt = "将猫变成狗"
request_data = {
    "model": "LongCat-Image-Edit",
    "input": {
        "prompt": prompt,
        "negative_prompt": "",
        "image": image_base64
    },
    "parameters": {
        "guidance_scale": 1,
        "num_inference_steps": 8,
        "num_images_per_prompt": 1,
        "seed":43
    }
}

print("Testing LongCat-Image-Edit model...")
print(f"Request URL: {url}")
print(f"Request data: {json.dumps(request_data, indent=2, ensure_ascii=False)}")

response = requests.post(url, json=request_data)
if response.status_code != 200:
    print(f"Error: {response.status_code}")
    print(f"Response: {response.text}")
else:
    try:
        result = json.loads(response.text)
        print("Success! Response:")
        print(json.dumps(result, indent=2, ensure_ascii=False))
        
        # Handle image response
        if "output" in result and "results" in result["output"]:
            for i, image_data in enumerate(result["output"]["results"]):
                if "image" in image_data:
                    # Decode base64 image
                    image_bytes = base64.b64decode(image_data["image"])
                    image = Image.open(BytesIO(image_bytes))
                    
                    # Save image
                    filename = f"edited_image_{i+1}.png"
                    image.save(filename)
                    print(f"\nGenerated image saved as: {filename}")
                    print(f"Image size: {image_data.get('width', 'unknown')}x{image_data.get('height', 'unknown')}")
                    print(f"Seed: {image_data.get('seed', 'unknown')}")
    except json.JSONDecodeError as e:
        print(f"Failed to parse JSON response: {e}")
        print(f"Raw response: {response.text}")

Input image:

Output image:

gemini-code-assist

Code Review

This pull request adds support for the LongCat-Image-Edit model on CUDA devices. The changes include a new pipeline implementation, modifications to the model loader to handle preprocessor configurations, and several improvements and bug fixes in the attention and rotary embedding kernels. The implementation of the new pipeline is comprehensive. I've identified one issue in the model loader's error handling for the preprocessor configuration that could lead to silent failures.

xllm/core/framework/dit_model_loader.cpp

xllm/core/kernels/ops_api.cpp

xllm/core/platform/vmm_torch_allocator.h

Dragonliu2018 requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners February 27, 2026 17:00

gemini-code-assist bot reviewed Feb 27, 2026

View reviewed changes

xllm/core/framework/dit_model_loader.cpp Show resolved Hide resolved

XuZhang99 reviewed Feb 28, 2026

View reviewed changes

xllm/core/kernels/ops_api.cpp Outdated Show resolved Hide resolved

xllm/core/platform/vmm_torch_allocator.h Show resolved Hide resolved

Dragonliu2018 force-pushed the lzl/feat/support_longcat_image_edit_on_cuda branch from 153bb27 to a72b41a Compare February 28, 2026 04:37

feat: support LongCat-Image-Edit on cuda device.

941c362

Dragonliu2018 force-pushed the lzl/feat/support_longcat_image_edit_on_cuda branch from a72b41a to 941c362 Compare February 28, 2026 08:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support LongCat-Image-Edit on cuda device.#957

feat: support LongCat-Image-Edit on cuda device.#957
Dragonliu2018 wants to merge 1 commit intojd-opensource:mainfrom
Dragonliu2018:lzl/feat/support_longcat_image_edit_on_cuda

Dragonliu2018 commented Feb 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Dragonliu2018 commented Feb 27, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants