-
Notifications
You must be signed in to change notification settings - Fork 331
Open
Labels
type:featureNew feature or requestNew feature or request
Description
Hi Keras Team,
I have successfully ported the Llama-3-Groq-8B-Tool-Use model (originally by Groq/Meta) to KerasHub. This model is currently State-of-the-Art for 8B-class tool use and agentic workflows, and I believe it would be a valuable addition to the library's llama3_presets.py.
Work Done
I have converted the weights from the original PyTorch source, handled the transpose operations, and verified the model outputs using KerasHub.
Artifacts
- Model Weights (Kaggle): https://www.kaggle.com/models/jaytiwari04/llama-3-groq-8b-tool-use/
- Validation Notebook: https://www.kaggle.com/code/jaytiwari04/llama-3-groq-tool-use-01
Next Steps
I would love to contribute this as an official preset.
Could you please review the artifacts? If everything looks good, I am happy to transfer the weights to the official Keras Kaggle account so I can open a PR to add the preset to the codebase.
Thanks!
cc: @mattdangerw @abheesht17 @sachinprasadhs @divyashreepathihalli
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
type:featureNew feature or requestNew feature or request