Claude code skills for transformers-api #43340

gautamvarmadatla · 2026-01-19T03:04:30Z

What does this PR do?

This PR adds a Claude Skill for the huggingface/transformers to help contributors navigate the codebase and common development workflows more efficiently

What’s included

A repo-specific Claude Skill (SKILL.md and corresponding reference files ) describing:
- Key library entry points and directory map (models, configs, tokenizers, generation, pipelines, trainer, etc.)
- Common contributor workflows
- Conventions and gotchas that help Claude give higher-quality, repo-aligned guidance

What’s not included

Claude Code plugin support is not implemented in this PR.
The original issue mentions a plugin request as well, but this PR focuses on delivering the Skill first as a minimal, useful step. Plugin support can be handled in a follow-up PR.

How to test

Load the repository in Claude and verify the Skill is discovered.
Ask a few repo-navigation questions (e.g., “Where do model configs live?” / “What tests should I run after changing X?”) and confirm Claude follows the Skill’s structure and pointers.

A few of the many examples I tested include questions like:

API existence / anti-hallucination check:
“Does Transformers have a public argument called temperature_decay on generate()? If yes, show the exact signature location. If no, point to the closest real knobs and where they’re defined.”
Repo navigation / backend dispatch:
“Where is the logic that decides which backend (PyTorch vs TensorFlow vs Flax) gets used when calling AutoModel.from_pretrained()? Point to the exact files and decision flow.”
Generation internals / repetition debugging:
“I’m getting repetitive text in long generations even with repetition_penalty set, what knobs interact most strongly with repetition, and which files apply these penalties during decoding?”
Quantization & loading performance troubleshooting:
“Loading a 7B causal LM with 4-bit quantization and device_map="auto" is causing slow CPU offload and high RAM. what are the likely causes in the loading path, what knobs should I change, and where are they handled in code?”
Serving/export reality check:
“Is there a supported CLI command transformers serve for text-generation with batching? If not, what are the supported alternatives in the Transformers ecosystem, and where are the relevant docs/code in this repo?”

PS: This is just an initial draft I put together so maintainers and other community folks can try it out first. Once people test it and share feedback, we can iterate on it and polish/improve it.

For review : @Rocketknight1, @stevhliu, @ArthurZucker
CC : @Emasoft, @coolgalsandiego

github-actions · 2026-01-19T03:10:15Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43340&sha=d33b06

gautamvarmadatla · 2026-01-19T03:33:49Z

Looks like CI is failing for reasons unrelated to this PR. This PR only adds Claude Skill markdown files.

The failing tests involve dynamic or custom tokenizers where AutoTokenizer.from_pretrained(..., trust_remote_code=True) returns TokenizersBackend instead of CustomTokenizerFast. This matches a known upstream regression where auto_map is ignored in some cases. See issue #43202.

CI job link: https://app.circleci.com/pipelines/github/huggingface/transformers/160360/workflows/306aee43-9030-477f-919e-3d09752353dd/jobs/2110009/tests

I am happy to rebase and rerun CI once the upstream tokenizer fix is in main. Or please lmk i can open another issue and PR to fix this

gautamvarmadatla and others added 2 commits January 18, 2026 21:44

Claude code skills for transformers-api

249a04c

Merge branch 'main' into claude-transformers-skill

d33b063

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Claude code skills for transformers-api #43340

Claude code skills for transformers-api #43340

gautamvarmadatla commented Jan 19, 2026

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

gautamvarmadatla commented Jan 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Claude code skills for transformers-api #43340

Are you sure you want to change the base?

Claude code skills for transformers-api #43340

Conversation

gautamvarmadatla commented Jan 19, 2026

What does this PR do?

What’s included

What’s not included

How to test

Uh oh!

github-actions bot commented Jan 19, 2026

Uh oh!

gautamvarmadatla commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gautamvarmadatla commented Jan 19, 2026 •

edited

Loading