Skip to content

Option to Disable kv_cache for Generative Models in ai_edge_torch #723

@Hardik-Choraria

Description

@Hardik-Choraria

Description of the bug:

While the current ai_edge_torch.generative pipeline supports KV cache layout and update strategies (INPLACE, PREPEND_LEFT), there’s no explicit high-level option to disable the use of KV cache entirely during model export.

Actual vs expected behavior:

No response

Any other information you'd like to share?

No response

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions