Skip to content

How to switch the language model to the Qwen 0.5B int8 quantized version? #2

@hakukohaku

Description

@hakukohaku

Hello! Thank you very much for your work; it has been very helpful for my research. Qwen1.5 0.5B has released int8 and int4 quantized versions. Could the language model be replaced with the int quantized version within the current framework? If so, how can this be done?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions