Currently, ollama.chat() and ollama.generate() create their own internal AbortController.
The only way to abort is via stream.abort() or ollama.abort(), but these don't provide instant response - the async iterator continues waiting for the server.
This introduces lag on the UI side where the feedback is not instant after abort.