Conversation
ngxson
left a comment
There was a problem hiding this comment.
also wait for @CISC @ggerganov reviews for libllama changes
Revert, but don't touch |
|
Perfect. |
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
This comment was marked as off-topic.
Thanks. I will change rsplit to split in chat_template.json. |
Don't, just use it properly instead, as I suggested here: https://huggingface.co/tencent/Youtu-LLM-2B/discussions/1 |
|
@ngxson @ggerganov |
|
Lint CI fails, please fix before we can merge. |
LOL, I think it just picks up the previous @f291400 Try rebasing, should fix the CI. |
|
If it's fixed on Beside, @f291400 if you want reviews to be fast and efficient, read the contribution guidelines and validate your changes carefully. This PR is create from your master branch, maintainers cannot push fixes directly here; the PR is also moved 2-3 times which make our work extremely inefficient. |
I ran |
|
Thank you for acknowledging this submission, which will greatly advance the use of youtu-llm on llama.cpp. |
|
I attempted to merge via GH web UI but failed, so unfortunately you need to fix the merge conflict yourself @f291400 |
So, it seems GitHub started allowing this now? |
No idea, probably allowed via web UI only? I never have problem applying patches via web UI. But if I do |
I'm pretty sure merging from |
* Support Youtu-VL Model * merge code * fix bug * revert qwen2 code & support rsplit in minja.hpp * update warm info * fix annotation * u * revert minja.hpp * fix * Do not write routed_scaling_factor to gguf when routed_scaling_factor is None * fix expert_weights_scale * LGTM after whitespace fixes * fix * fix * fix * layers to layer_index * enum fix --------- Co-authored-by: Xuan-Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Support Youtu-VL Model * merge code * fix bug * revert qwen2 code & support rsplit in minja.hpp * update warm info * fix annotation * u * revert minja.hpp * fix * Do not write routed_scaling_factor to gguf when routed_scaling_factor is None * fix expert_weights_scale * LGTM after whitespace fixes * fix * fix * fix * layers to layer_index * enum fix --------- Co-authored-by: Xuan-Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Support Youtu-VL Model * merge code * fix bug * revert qwen2 code & support rsplit in minja.hpp * update warm info * fix annotation * u * revert minja.hpp * fix * Do not write routed_scaling_factor to gguf when routed_scaling_factor is None * fix expert_weights_scale * LGTM after whitespace fixes * fix * fix * fix * layers to layer_index * enum fix --------- Co-authored-by: Xuan-Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
* Support Youtu-VL Model * merge code * fix bug * revert qwen2 code & support rsplit in minja.hpp * update warm info * fix annotation * u * revert minja.hpp * fix * Do not write routed_scaling_factor to gguf when routed_scaling_factor is None * fix expert_weights_scale * LGTM after whitespace fixes * fix * fix * fix * layers to layer_index * enum fix --------- Co-authored-by: Xuan-Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Make sure to read the contributing guidelines before submitting a PR