Description of the bug:
While the current ai_edge_torch.generative pipeline supports KV cache layout and update strategies (INPLACE, PREPEND_LEFT), there’s no explicit high-level option to disable the use of KV cache entirely during model export.
Actual vs expected behavior:
No response
Any other information you'd like to share?
No response